arxiv:2509.15207
zhu
xuekai
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
9 days ago
P1: Mastering Physics Olympiads with Reinforcement Learning
commented on
a paper
about 1 month ago
FlowRL: Matching Reward Distributions for LLM Reasoning
updated
a model
about 1 month ago
xuekai/FlowRL-DeepSeek-7B-code