1 1

Shubham Parashar

shubhamprshr

AI & ML interests

Computer Vision, Multi-Modal Learning

Recent Activity

authored a paper 11 days ago

Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning

updated a model 12 days ago

divelab/DAPO_E2H-countdown-gaussian_0p5_0p5

updated a model 12 days ago

divelab/DAPO_E2H-math-gaussian_0p5_0p5

View all activity

Organizations

authored a paper 11 days ago

Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning

Paper • 2506.06632 • Published Mar 16

updated 4 models 12 days ago

published 4 models 12 days ago

divelab/DAPO_E2H-gsm8k-gaussian_0p25_0p75

Text Generation • 2B • Updated 12 days ago • 254

divelab/DAPO_E2H-countdown-gaussian_0p5_0p5

Text Generation • 2B • Updated 12 days ago • 241

divelab/DAPO_E2H-math-gaussian_0p5_0p5

Text Generation • 2B • Updated 12 days ago • 272

divelab/DAPO_E2H-math-cosine

Text Generation • 2B • Updated 12 days ago • 300

commented 2 papers 3 months ago

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published Jan 28 • 120 •

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published Jan 28 • 120 •

updated 2 models 5 months ago

shubhamprshr/Qwen2.5-1.5B-Instruct_gsm8k_grpo_gaussian_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 20, 2025 • 9

shubhamprshr/Qwen2.5-1.5B-Instruct_gsm8k_grpo_gaussian_0.25_0.75_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 20, 2025 • 3

published 2 models 5 months ago

shubhamprshr/Qwen2.5-1.5B-Instruct_gsm8k_grpo_gaussian_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 20, 2025 • 9

shubhamprshr/Qwen2.5-1.5B-Instruct_gsm8k_grpo_gaussian_0.25_0.75_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 20, 2025 • 3

updated 4 models 5 months ago

shubhamprshr/Qwen2.5-1.5B-Instruct_countdown2345_grpo_cosine_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 18, 2025 • 5

shubhamprshr/Qwen2.5-1.5B-Instruct_countdown2345_grpo_gaussian_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 18, 2025 • 8

shubhamprshr/Qwen2.5-1.5B-Instruct_math_grpo_gaussian_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 18, 2025 • 3

shubhamprshr/Qwen2.5-1.5B-Instruct_math_grpo_cosine_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 18, 2025 • 5

published a model 5 months ago

shubhamprshr/Qwen2.5-1.5B-Instruct_countdown2345_grpo_cosine_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 2B • Updated Nov 18, 2025 • 5

Shubham Parashar

AI & ML interests

Recent Activity

Organizations

shubhamprshr's activity