ndl-core-collection Collection A collection of UK government structured datasets and textual sources for research, analysis, and AI applications. • 6 items • Updated Jan 12 • 3
view article Article Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 3 days ago • 8
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding 5 days ago • 41
Datasets of AI Ecosystem Data Collection Datasets shared on the Hub to support research and investigation of the AI ecosystem • 3 items • Updated 7 days ago • 1
Visualizations of AI Ecosystem Data Collection Spaces and demos showing the evolution of the AI ecosystem • 6 items • Updated 7 days ago • 1
Research on AI Ecosystem Data Collection Research papers leveraging AI ecosystem data • 6 items • Updated 7 days ago • 1
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 15 days ago • 76
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 14 days ago • 182
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 25 days ago • 88
view article Article easytranscriber: Speech Recognition with Accurate Timestamps in the HF Ecosystem 21 days ago • 5
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 64