arxiv:2603.20105
xiaotong
xtongji
AI & ML interests
None yet
Recent Activity
authored a paper about 11 hours ago
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks authored a paper about 11 hours ago
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers authored a paper about 11 hours ago
The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-CalculusOrganizations
None yet