Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization Paper • 2605.13641 • Published 3 days ago • 2
SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval Paper • 2010.00768 • Published Oct 2, 2020
Libra: Assessing and Improving Reward Model by Learning to Think Paper • 2507.21645 • Published Jul 29, 2025 • 3
Efficient Context Scaling with LongCat ZigZag Attention Paper • 2512.23966 • Published Dec 30, 2025 • 7