new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jun 26

Submitted by

jymcc

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

·
8 authors

Submitted by

guipenedo

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

·
10 authors

1

Submitted by

SinclairWang

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

·
4 authors

Submitted by

affjljoo3581

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

·
5 authors

Submitted by

ai-alanov

Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models

·
3 authors

Submitted by

tellarin

DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning

·
12 authors

Submitted by

msadat97

HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

·
4 authors

6

Submitted by

TianxingChen

RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation

·
26 authors

Submitted by

uzaymacar

Thought Anchors: Which LLM Reasoning Steps Matter?

·
4 authors

Submitted by

JuliaKreutzerCohere

When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs

·
5 authors

Submitted by

fanhongxing

Use Property-Based Testing to Bridge LLM Code Generation and Validation

·
6 authors

Submitted by

Ningyu

ReCode: Updating Code API Knowledge with Reinforcement Learning

·
5 authors

Submitted by

gonzmart

Is There a Case for Conversation Optimized Tokenizers in Large Language Models?

·
4 authors

Submitted by

JonasGeiping

GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching

·
6 authors

Submitted by

rntc

Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content

·
3 authors

1

Submitted by

AleksandrAlgazinov

MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications

·
3 authors

1

Submitted by

Epiphqny

FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation

·
9 authors

Submitted by

adnaan525

The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs

·
2 authors

1