zhu's picture

zhu

xuekai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling

upvoted a paper about 2 months ago

LLM-in-Sandbox Elicits General Agentic Intelligence

upvoted a paper 4 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

View all activity

Organizations

Papers 16

arxiv:2509.15207

arxiv:2509.09674

arxiv:2509.08827

arxiv:2509.04419

models 3

xuekai/FlowRL-DeepSeek-7B-code

8B • Updated Oct 27, 2025

xuekai/FlowRL-Qwen2.5-32B-math

33B • Updated Oct 27, 2025

xuekai/FlowRL-Qwen2.5-7B-math

8B • Updated Oct 27, 2025 • 14

datasets 2

xuekai/flowrl-data-collection

Preview • Updated Sep 28, 2025 • 149

xuekai/pad_train

Viewer • Updated Mar 21, 2024 • 184k • 34