view article Article How I contributed a new model to the Transformers library using Codex about 15 hours ago • 17
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation Paper • 2603.25804 • Published 5 days ago • 19
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 5 days ago • 138
Representation Alignment for Just Image Transformers is not Easier than You Think Paper • 2603.14366 • Published 16 days ago • 9
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published 5 days ago • 50
Vega: Learning to Drive with Natural Language Instructions Paper • 2603.25741 • Published 5 days ago • 4
FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol Paper • 2603.24943 • Published 5 days ago • 8
Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration Paper • 2603.24800 • Published 5 days ago • 60
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks Paper • 2603.24755 • Published 6 days ago • 25
PixelSmile: Toward Fine-Grained Facial Expression Editing Paper • 2603.25728 • Published 5 days ago • 114
Running on CPU Upgrade Featured 68 Cohere Multilingual ASR 🎙 68 Transcribe audio clips to text in many languages
Qworld: Question-Specific Evaluation Criteria for LLMs Paper • 2603.23522 • Published 25 days ago • 9
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis Paper • 2603.20176 • Published 11 days ago • 8
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published 6 days ago • 89
Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments Paper • 2603.23638 • Published 7 days ago • 9
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published 6 days ago • 41