Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published 20 days ago • 18 • 2
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Paper • 2505.15141 • Published May 21 • 4 • 2