dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model Paper • 2512.02498 • Published Dec 2, 2025 • 2
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery Paper • 2602.08990 • Published 9 days ago • 69
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 13 days ago • 41
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch Paper • 2602.03183 • Published 15 days ago • 11
Build error Featured 101 Qwen3-ASR Demo 🎙 101 Transcribe audio to text with multi-language timestamps