Rajdeep Borgohain's picture

Rajdeep Borgohain

rbgo

·

RajdeepBorgohain

AI & ML interests

Solving language barriers.

Recent Activity

updated a model 2 days ago

Inferless/gpt-oss-20b

published a model 2 days ago

Inferless/gpt-oss-20b

upvoted a paper 19 days ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

View all activity

Organizations

upvoted a paper 19 days ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published 23 days ago • 41

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

Jul 8

• 614

upvoted a collection 4 months ago

Gemma 3 Release

24 items • Updated 29 days ago • 426

upvoted an article 4 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 449

upvoted 2 articles 5 months ago

Article

Inside the family of Smol models

By

and 1 other •

Feb 27

• 13

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 404

upvoted a paper 5 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

upvoted a collection 5 months ago

Phi-4

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated 29 days ago • 176

upvoted a paper 5 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 200

upvoted a paper 6 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 154

upvoted 2 articles 6 months ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

Jan 23

• 69

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 876

upvoted 2 collections 6 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 18 days ago • 522

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 18 days ago • 120

upvoted 2 collections 7 months ago

DeepSeek-V2

8 items • Updated Jan 3 • 31

DeepSeek-LLM

DeepSeek LLM series • 5 items • Updated Aug 16, 2024 • 21

upvoted a paper 7 months ago

KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Paper • 2412.06071 • Published Dec 8, 2024 • 9

upvoted an article 7 months ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

By

and 4 others •

Jan 16

• 51

upvoted a paper 7 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 121

upvoted a collection 8 months ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 29 days ago • 149