arxiv:2601.21244
YijuGuo
AI & ML interests
LLM Alignment
Recent Activity
liked
a dataset 21 minutes ago
LulaCola/AgentProcessBench upvoted a paper about 1 month ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper about 1 month ago
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research