arxiv:2407.13048
Yu Meng
yumeng5
AI & ML interests
None yet
Recent Activity
upvoted a paper about 21 hours ago
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning upvoted a paper 5 months ago
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning upvoted a paper about 1 year ago
Efficient Test-Time Scaling via Self-Calibration