Xiaohang Tang
timxiaohangt
AI & ML interests
Reinforcement Learning, Game Theory
Recent Activity
published
a model
2 days ago
timxiaohangt/Qwen2.5-1.5B-Open-R1-GRPO
updated
a model
24 days ago
timxiaohangt/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
published
a model
about 2 months ago
timxiaohangt/DeepSeek-R1-Distill-Qwen-1.5B-GRPO