OpenReasoning-Nemotron Collection Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. β’ 6 items β’ Updated 8 days ago β’ 38
π Optimized Models: torchao & Pruna Quantization Collection Quantized Models using torchao & Pruna for efficient inference and deployment. β’ 8 items β’ Updated 5 days ago β’ 1
view article Article Creating custom kernels for the AMD MI300 By ror and 1 other β’ 21 days ago β’ 43
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others β’ 22 days ago β’ 590
π June 2025 - Open works from the Chinese community Collection 29 items β’ Updated 1 day ago β’ 7
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. β’ 25 items β’ Updated 18 days ago β’ 156
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper β’ 2506.11763 β’ Published Jun 13 β’ 69
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others β’ Jun 26 β’ 113
view article Article Transformers backend integration in SGLang By marcsun13 and 4 others β’ Jun 23 β’ 49
view article Article MCP is at a Tipping Point: Here's Why You Should Care By fdaudens β’ Jun 10 β’ 17
view article Article Sensitivity Aware Mixed Precision Quantization V1 By badaoui and 1 other β’ Jun 13 β’ 18
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others β’ Jun 3 β’ 77
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann β’ 8 items β’ Updated Jun 13 β’ 152
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ May 23 β’ 151