On Non-interactive Evaluation of Animal Communication Translators Paper • 2510.15768 • Published Oct 17 • 2
Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting Paper • 2510.08696 • Published Oct 9 • 14
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels Paper • 2510.06499 • Published Oct 7 • 31
Awesome papers from 臺大李宏毅 (Hung-yi Lee) Collection Recent papers authored by Hung-yi Lee. Sorted by ID • 8 items • Updated Oct 24 • 17
Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition Paper • 2510.08047 • Published Oct 9 • 7
IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling Paper • 2506.00736 • Published May 31 • 10
Vibe Checker: Aligning Code Evaluation with Human Preference Paper • 2510.07315 • Published Oct 8 • 32
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Paper • 2507.02768 • Published Jul 3 • 18
SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models Paper • 2510.06917 • Published Oct 8 • 34
Game-Time: Evaluating Temporal Dynamics in Spoken Language Models Paper • 2509.26388 • Published Sep 30 • 26
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation Paper • 2509.25849 • Published Sep 30 • 47
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 140
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning Paper • 2509.25760 • Published Sep 30 • 55