Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper • 2309.05516 • Published Sep 11, 2023 • 11
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 24 items • Updated 1 day ago • 90
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 507
view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30, 2025 • 200
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9, 2025 • 769
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated Dec 23, 2025 • 21
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK Nov 21, 2024 • 35