arxiv:2409.12568
Yiqi Wang
Yi-Qi638
AI & ML interests
None yet
Recent Activity
liked
a model
about 12 hours ago
inclusionAI/Ring-2.5-1T
upvoted
a
paper
10 months ago
Reinforcement Learning for Reasoning in Large Language Models with One
Training Example
upvoted
a
paper
10 months ago
QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM
Pretraining