Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning Paper • 2605.02913 • Published Apr 8 • 8
SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion Paper • 2605.01466 • Published 10 days ago • 6
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 9 days ago • 152
sjin4861/asap-7shot-sim-option3-grpo-qwen3.5.9b-fold0-20260501-202559 Text Generation • 9B • Updated 10 days ago • 32 • 1
On the Robustness of LLM-Based Dense Retrievers: A Systematic Analysis of Generalizability and Stability Paper • 2604.16576 • Published 25 days ago • 2
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published 29 days ago • 101
AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors Paper • 2601.20524 • Published Apr 9 • 6
Qualixar OS: A Universal Operating System for AI Agent Orchestration Paper • 2604.06392 • Published Apr 7 • 17
waxal-benchmarking/mms-300m-wal-aki Automatic Speech Recognition • 0.3B • Updated 18 days ago • 99 • 1
A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System Paper • 2603.25832 • Published Mar 26 • 4
Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published Mar 20 • 9