baidu/ERNIE-4.5-VL-28B-A3B-PT Image-Text-to-Text • 29B • Updated 16 days ago • 99.4k • • 93
How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective Paper • 2505.21505 • Published May 27, 2025 • 18
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer Paper • 2503.09277 • Published Mar 12, 2025 • 1
Running 189 Video Generation Leaderboard 📊 189 Text to Video and Image to Video Arena & Leaderboard
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection Paper • 2409.08513 • Published Sep 13, 2024 • 14