3 19

Salman Rahman

salmannyu

https://salmanrahman.net/

AI & ML interests

Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation

Recent Activity

updated a model 4 days ago

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step150

published a model 4 days ago

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step150

updated a model 4 days ago

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step100

View all activity

Organizations

updated a model 4 days ago

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step150

4B • Updated 4 days ago • 28

published a model 4 days ago

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step150

4B • Updated 4 days ago • 28

updated a model 4 days ago

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step100

4B • Updated 4 days ago • 19

published a model 4 days ago

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step100

4B • Updated 4 days ago • 19

updated a model 13 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-non-think-nopack-lr1.5e5-ep3

3B • Updated 13 days ago • 12

published a model 13 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-non-think-nopack-lr1.5e5-ep3

3B • Updated 13 days ago • 12

updated a model 13 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-nopack-lr1.5e5-ep3

3B • Updated 13 days ago • 14

published a model 13 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-nopack-lr1.5e5-ep3

3B • Updated 13 days ago • 14

updated a model 17 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full

Text Generation • 3B • Updated 17 days ago • 41

published a model 17 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full

Text Generation • 3B • Updated 17 days ago • 41

updated a model 21 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-140K-Step

3B • Updated 21 days ago • 7

published a model 21 days ago

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-140K-Step

3B • Updated 21 days ago • 7

updated a model about 1 month ago

salmannyu/Qwen2.5-1.5B-Nemotron-Math-52B-Mid-Train-8

Text Generation • 2B • Updated Feb 8 • 5

published a model about 1 month ago

salmannyu/Qwen2.5-1.5B-Nemotron-Math-52B-Mid-Train-8

Text Generation • 2B • Updated Feb 8 • 5

upvoted 3 papers 3 months ago

WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

Paper • 2512.12692 • Published Dec 14, 2025 • 14

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 39

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

Paper • 2512.03244 • Published Dec 2, 2025 • 17

submitted a paper to Daily Papers 3 months ago

SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning

Paper • 2512.03244 • Published Dec 2, 2025 • 17

updated a model 4 months ago

salmannyu/nemotron-train8-52B-Token

2B • Updated Nov 8, 2025 • 2

published a model 4 months ago

salmannyu/nemotron-train8-52B-Token

2B • Updated Nov 8, 2025 • 2

Salman Rahman

AI & ML interests

Recent Activity

Organizations

salmannyu's activity