Submitted by PhoenixZ 103 MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization · 14 authors 64 3
Submitted by binxia 72 DreamOmni2: Multimodal Instruction-based Editing and Generation · 13 authors 3
Submitted by taesiri 65 UniVideo: Unified Understanding, Generation, and Editing for Videos · 8 authors 3
Submitted by taesiri 60 VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning Kuaishou Visual Generation and Interaction Center 55 2
Submitted by yjyjyj98 54 Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning KAIST AI 7 4
Submitted by Blue-Giant 48 From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning · 5 authors 2
Submitted by Carlanlarkk 43 Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward Tencent 16 2
Submitted by jackzhang 39 The Alignment Waltz: Jointly Training Agents to Collaborate for Safety AI at Meta 2
Submitted by UML 31 ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation shanghai ailab 98 2
Submitted by Kylin-ll 30 Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense AI at Meta 2
Submitted by tqfang229 27 NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents · 13 authors 120 2
Submitted by olafyiii 24 First Try Matters: Revisiting the Role of Reflection in Reasoning Models · 6 authors 2 2
Submitted by tsq2000 23 DeepPrune: Parallel Scaling without Inter-trace Redundancy Knowledge Engineer Group @ Tsinghua University 13 2
Submitted by Foreshhh 22 LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions Fudan University 2
Submitted by Wayne-lc 22 Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks KnowledgeXLab@Shanghai AI Lab 2
Submitted by YOKIMIYA 20 UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution Kuaishou Visual Generation and Interaction Center 11 3
Submitted by SoroushMehraban 20 PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Pickford 2
Submitted by Changyao 19 NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints OpenGVLab 81 2
Submitted by xxyQwQ 18 CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards · 10 authors 13 2
Submitted by canqin001 15 UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG Salesforce 10 4
Submitted by ZetangForward 14 LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling Soochow University 16 2
Submitted by Luo-Yihong 10 Reinforcing Diffusion Models by Direct Group Preference Optimization · 3 authors 16 2
Submitted by Guan123 10 Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction Apple 2
Submitted by xiangh 9 Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window · 14 authors 2
Submitted by taesiri 8 SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models · 8 authors 8 3
Submitted by worstcoder 8 Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency · 10 authors 2
Submitted by Co2y 8 UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections · 7 authors 99 3
Submitted by hyc2026 7 Memory Retrieval and Consolidation in Large Language Models through Function Tokens ByteDance Seed 2
Submitted by lliutianc 7 OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment OpenRubrics 2
Submitted by ChonghuaLiao 6 Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints · 4 authors 13 2
Submitted by xymeow7 5 DexNDM: Closing the Reality Gap for Dexterous In-Hand Rotation via Joint-Wise Neural Dynamics Model · 3 authors 2
Submitted by Mr-Philo 5 Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training · 8 authors 2
Submitted by cfahlgren1 5 OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction · 9 authors 2
Submitted by xuxw98 4 R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation · 7 authors 2
Submitted by andreasengelhardt 4 SViM3D: Stable Video Material Diffusion for Single Image 3D Generation Stability AI 2
Submitted by zfj1998 3 A^2Search: Ambiguity-Aware Question Answering with Reinforcement Learning City University of Hong Kong 4 3
Submitted by Franck-Dernoncourt 3 Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs · 7 authors 2
Submitted by ytgui 3 Search-R3: Unifying Reasoning and Embedding Generation in Large Language Models · 2 authors 10 2
Submitted by paischer101 2 GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations Johannes Kepler University 2
Submitted by jiahaoplus 2 Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models · 14 authors 2
Submitted by Saleh 2 Beyond Outliers: A Study of Optimizers Under Quantization Scalable Parallel Computing Laboratory (SPCL) 2
Submitted by ahmedhendawy19 1 Use the Online Network If You Can: Towards Fast and Stable Reinforcement Learning · 6 authors 2
Submitted by ryancll118 1 Fidelity-Aware Data Composition for Robust Robot Generalization · 9 authors 2