10 46

srivatsa

srivatsa92

devsrivatsa

AI & ML interests

rag, agents, fine-tuning

Recent Activity

liked a model 3 days ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

liked a Space 9 days ago

lm-provers/qed-nano-blogpost

liked a dataset 15 days ago

google/mobile-actions

View all activity

Organizations

liked a model 3 days ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

Text Generation • 121B • Updated about 6 hours ago • 56.4k • 85

liked a Space 9 days ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

liked a dataset 15 days ago

google/mobile-actions

Viewer • Updated Dec 18, 2025 • 9.65k • 1.64k • 258

liked a model 2 months ago

ai21labs/AI21-Jamba2-3B

Text Generation • Updated Feb 2 • 3.36k • 40

liked a Space 4 months ago

The Smol Training Playbook

📚

3.05k

The secrets to building world-class LLMs

upvoted a collection 4 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 42

liked a dataset 4 months ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 14.4k • 968

upvoted an article 4 months ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

207

liked a Space 4 months ago

Open ASR Leaderboard

🏆

1.25k

Explore and compare speech recognition model benchmarks

liked a dataset 7 months ago

neerajaabhyankar/hindustani-raag-small

Viewer • Updated Mar 20, 2024 • 1.25k • 266 • 3

upvoted 2 articles 7 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

Article

Efficient Request Queueing – Optimizing LLM Performance

Apr 2, 2025

•

updated a Space 7 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

published a Space 7 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

liked a model 9 months ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

Updated Jan 28 • 5.44M • 862

liked 2 datasets 9 months ago

vidore/colpali_train_set

Viewer • Updated Jun 20, 2025 • 119k • 5.96k • 91

llamaindex/vdr-multilingual-train

Viewer • Updated Jan 10, 2025 • 424k • 2.43k • 28

liked 2 models 9 months ago

unsloth/Nanonets-OCR-s-GGUF

Image-Text-to-Text • 3B • Updated Jul 3, 2025 • 2.99k • 59

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 48.4k • 1.59k

upvoted an article 10 months ago

Article

The Transformers Library: standardizing model definitions

May 15, 2025

•

121

srivatsa

AI & ML interests

Recent Activity

Organizations

srivatsa92's activity

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

The Smol Training Playbook

Let's talk about LLM evaluation

Open ASR Leaderboard

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Efficient Request Queueing – Optimizing LLM Performance

GPU VRAM Estimator

GPU VRAM Estimator

The Transformers Library: standardizing model definitions