Submitted by akhaliq 221 OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models · 5 authors 21
Submitted by Myashka 115 The Differences Between Direct Alignment Algorithms are a Blur · 5 authors 2
Submitted by ahmed-masry 39 AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding · 22 authors 5
Submitted by jimi888 31 SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model · 11 authors 5
Submitted by RohitGandikota 26 SliderSpace: Decomposing the Visual Capabilities of Diffusion Models · 6 authors 99 8
Submitted by xinyan233333 24 DeepRAG: Thinking to Retrieval Step by Step for Large Language Models · 9 authors 2
Submitted by huanqia 24 MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models · 3 authors 2
Submitted by yiren98 21 MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation · 3 authors 2
Submitted by akhaliq 18 ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning · 7 authors 2
Submitted by dongwonjo 17 FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation · 4 authors 22 2
Submitted by akhaliq 14 The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles · 4 authors 2
Submitted by hba123 11 Almost Surely Safe Alignment of Large Language Models at Inference-Time · 6 authors 2
Submitted by arjunguha 10 PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models · 8 authors 6
Submitted by PAlbert31 9 RandLoRA: Full-rank parameter-efficient fine-tuning of large models · 6 authors 3
Submitted by akshat57 5 Lifelong Sequential Knowledge Editing without Model Degradation · 6 authors 2
Submitted by Bowen232 4 LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information · 6 authors 2
Submitted by vshrivas 4 Language Models Prefer What They Know: Relative Confidence Estimation via Confidence Preferences · 3 authors 2
Submitted by moein99 3 A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation · 8 authors 3
Submitted by EdwinDdeJong 2 Current Pathology Foundation Models are unrobust to Medical Center Differences · 3 authors 2