EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 3 items • Updated about 12 hours ago • 24
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU Paper • 2604.05091 • Published 4 days ago • 37
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published 4 days ago • 29
Distilled Models Collection Kimi-2.5, Gemini 3.1 Pro, Qwen3.6-plus • 5 items • Updated 3 days ago • 4
khazarai/Qwen3-4B-Qwen3.6-plus-Reasoning-Distilled-GGUF Text Generation • 4B • Updated about 15 hours ago • 11.8k • 11
Meta CLIP 1 Collection Scaling CLIP data with transparent training distribution from an end-to-end pipeline. • 7 items • Updated Nov 24, 2025 • 23
cwm Collection Collection for Code World Model, an agentic coding model from FAIR. • 3 items • Updated Sep 24, 2025 • 20
MobileLLM-R1 Collection MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 30