Submitted by jymcc 65 ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation · 8 authors 254 3
Submitted by guipenedo 64 FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language · 10 authors 1
Submitted by SinclairWang 46 OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling · 4 authors 162 1
Submitted by affjljoo3581 44 Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models · 5 authors 29 5
Submitted by ai-alanov 42 Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models · 3 authors 28 1
Submitted by tellarin 22 DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning · 12 authors 15 2
Submitted by msadat97 18 HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling · 4 authors 6
Submitted by TianxingChen 17 RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation · 26 authors 1.37k 1
Submitted by JuliaKreutzerCohere 10 When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs · 5 authors 1
Submitted by fanhongxing 10 Use Property-Based Testing to Bridge LLM Code Generation and Validation · 6 authors 4 1
Submitted by Ningyu 8 ReCode: Updating Code API Knowledge with Reinforcement Learning · 5 authors 12 1
Submitted by gonzmart 8 Is There a Case for Conversation Optimized Tokenizers in Large Language Models? · 4 authors 1
Submitted by JonasGeiping 7 GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching · 6 authors 11 1
Submitted by rntc 5 Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content · 3 authors 1
Submitted by AleksandrAlgazinov 5 MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications · 3 authors 1
Submitted by Epiphqny 5 FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation · 9 authors 1
Submitted by adnaan525 3 The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs · 2 authors 1