ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper ⢠2505.24864 ⢠Published 8 days ago ⢠115
SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer ⢠6 items ⢠Updated Apr 17 ⢠6
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing Paper ⢠2503.13434 ⢠Published Mar 17 ⢠27
BrushEdit: All-In-One Image Inpainting and Editing Paper ⢠2412.10316 ⢠Published Dec 13, 2024 ⢠36
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper ⢠2411.07975 ⢠Published Nov 12, 2024 ⢠31
Running on Zero 155 155 Chat With Janus 1.3B š A unified multimodal understanding and generation model.
Running on Zero 155 155 Chat With Janus 1.3B š A unified multimodal understanding and generation model.
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper ⢠2410.13848 ⢠Published Oct 17, 2024 ⢠35 ⢠4
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper ⢠2410.13848 ⢠Published Oct 17, 2024 ⢠35
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper ⢠2410.13848 ⢠Published Oct 17, 2024 ⢠35
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation Paper ⢠2410.13848 ⢠Published Oct 17, 2024 ⢠35 ⢠4
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots Paper ⢠2405.07990 ⢠Published May 13, 2024 ⢠21 ⢠4
What matters when building vision-language models? Paper ⢠2405.02246 ⢠Published May 3, 2024 ⢠104