EvalTalker: Learning to Evaluate Real-Portrait-Driven Multi-Subject Talking Humans Paper • 2512.01340 • Published Dec 1, 2025
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 22 days ago • 144
LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment Paper • 2604.11689 • Published 8 days ago • 11
LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment Paper • 2604.11689 • Published 8 days ago • 11
UNO-Bench: A Unified Benchmark for Exploring the Compositional Law Between Uni-modal and Omni-modal in OmniModels Paper • 2510.18915 • Published Oct 21, 2025 • 7