Hejian Sang
pb09204048
AI & ML interests
None yet
Recent Activity
upvoted a paper 15 days ago
On-Policy Self-Distillation for Reasoning Compression submitted a paper 16 days ago
On-Policy Self-Distillation for Reasoning Compression authored a paper 18 days ago
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning