Runzhe Zhan's picture

Runzhe Zhan

rzzhan

·

https://runzhe.me/

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

AI Can Learn Scientific Taste

upvoted a paper about 1 month ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

upvoted a paper about 2 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

View all activity

Organizations

None yet

Collections 2

Papers 2

arxiv:2510.20780

arxiv:2510.02245

models 10

rzzhan/ThinMQM-8B

Text Generation • 8B • Updated Oct 28, 2025 • 2

rzzhan/ExGRPO-Llama3.1-8B-Instruct

Text Generation • 8B • Updated Oct 24, 2025 • 2

rzzhan/ExGRPO-Llama3.1-8B-Zero

Text Generation • 8B • Updated Oct 24, 2025 • 3

rzzhan/ExGRPO-Qwen2.5-Math-1.5B-Zero

Text Generation • 2B • Updated Oct 24, 2025 • 3

rzzhan/ExGRPO-Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Oct 24, 2025 • 2

rzzhan/ExGRPO-LUFFY-7B-Continual

Text Generation • 8B • Updated Oct 24, 2025 • 5 • 1

rzzhan/ExGRPO-Qwen2.5-Math-7B-Zero

Text Generation • 8B • Updated Oct 24, 2025 • 6

rzzhan/ThinMQM-7B

8B • Updated Oct 24, 2025 • 16

rzzhan/ThinMQM-32B

33B • Updated Oct 24, 2025 • 2

rzzhan/tiny-llama-stories-42m

Updated Sep 17, 2024 • 1

datasets 1

rzzhan/ThinMQM-12k

Viewer • Updated Oct 24, 2025 • 23.9k • 52