view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 saurabhdash, olivernan, ArashAhmadian, johndang-cohere β’ Mar 4, 2025 β’ 78
view article Article FastRTC: The Real-Time Communication Library for Python freddyaboulton, abidlabs β’ Feb 25, 2025 β’ 172
view article Article SmolVLM2: Bringing Video Understanding to Every Device +5 orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova β’ Feb 20, 2025 β’ 338
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark wolfram β’ Jan 2, 2025 β’ 41
Adding Conditional Control to Text-to-Image Diffusion Models Paper β’ 2302.05543 β’ Published Feb 10, 2023 β’ 58
Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published Dec 9, 2024 β’ 95
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper β’ 2412.05271 β’ Published Dec 6, 2024 β’ 161
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper β’ 2411.16489 β’ Published Nov 25, 2024 β’ 45
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π manu β’ Jul 5, 2024 β’ 317
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper β’ 2406.08464 β’ Published Jun 12, 2024 β’ 72
view article Article Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens and 11 languages +7 Quent-01, nilabhra, rcojocaru, Mughaira, gcampesan, SanathNarayan, griffintaur, clefourrier, SaylorTwift β’ May 24, 2024 β’ 28
view article Article Hugging Face x LangChain : A new partner package +1 Jofthomas, kkondratenko, efriis β’ May 14, 2024 β’ 161
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model +1 merve, andsteing, pcuenq β’ May 14, 2024 β’ 287
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model Paper β’ 2405.04434 β’ Published May 7, 2024 β’ 25
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 964