Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated 7 days ago • 578k • 2.6k
ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12, 2024 • 72
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Paper • 2401.08417 • Published Jan 16, 2024 • 37
Running on CPU Upgrade Featured 3.1k The Smol Training Playbook 📚 3.1k The secrets to building world-class LLMs
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 292
Holo1.5 Collection Holo1.5 - Open Foundation Models for Computer Use Agents • 5 items • Updated Sep 15, 2025 • 35