Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

new activity 1 day ago

Datadog/BOOM:add arrow path

updated a model 2 days ago

HuggingFaceH4/Qwen2.5-1.5B-Instruct-gkd

published a model 2 days ago

HuggingFaceH4/Qwen2.5-1.5B-Instruct-gkd

View all activity

Organizations

kashif's activity

upvoted 2 articles 3 days ago

Article

xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy

By

•

3 days ago

• 12

Article

KV Cache from scratch in nanoVLM

By

and 4 others •

4 days ago

• 56

upvoted an article 4 days ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By

and 5 others •

5 days ago

• 37

upvoted an article 13 days ago

Article

🐯 Liger GRPO meets TRL

By

and 5 others •

14 days ago

• 36

upvoted an article 15 days ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

By

and 3 others •

16 days ago

• 122

upvoted 2 articles 18 days ago

Article

Building an Open Ecosystem for Time Series Forecasting: Introducing TimesFM in Hugging Face

By

and 1 other •

19 days ago

• 16

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

By

and 5 others •

18 days ago

• 26

upvoted an article 2 months ago

Article

Open R1: Update #4

By

and 3 others •

Mar 26

• 48

upvoted a paper 3 months ago

MONSTER: Monash Scalable Time Series Evaluation Repository

Paper • 2502.15122 • Published Feb 21 • 3

upvoted 2 articles 4 months ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 214

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

By

•

Jan 31

• 50

upvoted a paper 5 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 280

upvoted an article 5 months ago

Article

Process Reinforcement through Implicit Rewards

By

and 1 other •

Jan 3

• 27

upvoted 2 papers 6 months ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24, 2024 • 5

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 5

upvoted a paper 8 months ago

A Rate-Distortion View of Uncertainty Quantification

Paper • 2406.10775 • Published Jun 16, 2024 • 1

upvoted 2 papers 9 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 139

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 13

upvoted a collection 9 months ago

Power-LM

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17, 2024 • 15

upvoted a paper 10 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13, 2024 • 21