Xinyu Zhu

TianHongZXY

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

upvoted a paper 16 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

updated a model 19 days ago

TianHongZXY/Qwen3-4B-NSR

published a model about 2 months ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

View all activity

Organizations

upvoted a paper 16 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 16 days ago • 125

updated a model 19 days ago

TianHongZXY/Qwen3-4B-NSR

4B • Updated 19 days ago • 16

published a model about 2 months ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

0.5B • Updated Nov 5

updated a model about 2 months ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

0.5B • Updated Nov 5

upvoted a paper 3 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 55

authored a paper 3 months ago

RAST: Reasoning Activation in LLMs via Small-model Transfer

Paper • 2506.15710 • Published May 30

updated a dataset 4 months ago

TianHongZXY/similar_problems_with_three_in_context_problems

Viewer • Updated Sep 4 • 2.16k • 6.47k

published a dataset 4 months ago

TianHongZXY/similar_problems_with_three_in_context_problems

Viewer • Updated Sep 4 • 2.16k • 6.47k

upvoted a paper 4 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25 • 346

updated a dataset 4 months ago

TianHongZXY/Top_5_similar_question-NVIDIA-OpenScienceReasoning-2

Viewer • Updated Aug 28 • 2.16k • 8.79k

published a dataset 4 months ago

TianHongZXY/Top_5_similar_question-NVIDIA-OpenScienceReasoning-2

Viewer • Updated Aug 28 • 2.16k • 8.79k

liked 2 datasets 4 months ago

cais/hle

Viewer • Updated Sep 10 • 2.5k • 24k • 579

nvidia/OpenScienceReasoning-2

Viewer • Updated Jul 31 • 803k • 554 • 51

liked a model 4 months ago

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 38.5k • • 389

liked a dataset 5 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25 • 25.7M • 12.8k • 169

upvoted a collection 5 months ago

RLVR-Decomposed

Collection

The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning" • 9 items • Updated Jun 1 • 3

updated a model 5 months ago

TianHongZXY/Qwen2.5-Math-7B-GRPO

8B • Updated Jul 28 • 9

updated a model 6 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28

Updated Jul 12

published a model 6 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28

Updated Jul 12

updated a model 6 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8

Xinyu Zhu

AI & ML interests

Recent Activity

Organizations

TianHongZXY's activity