MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues Paper • 2512.03046 • Published 1 day ago • 9
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 1 day ago • 100
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 1 day ago • 64
Back to Basics: Let Denoising Generative Models Denoise Paper • 2511.13720 • Published 17 days ago • 61
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated 18 days ago • 68
view article Article We’re open-sourcing our text-to-image model and the process behind it 22 days ago • 73
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 65
Phoenix-VAD: Streaming Semantic Endpoint Detection for Full-Duplex Speech Interaction Paper • 2509.20410 • Published Sep 24 • 2
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper • 2510.02283 • Published Oct 2 • 94
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing Paper • 2509.26346 • Published Sep 30 • 18