MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices β’ 12 items β’ Updated 1 day ago β’ 40
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning Paper β’ 2505.24298 β’ Published 9 days ago β’ 21
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others β’ 5 days ago β’ 96
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published 5 days ago β’ 75
One-RL-to-See-Them-All Collection One RL to See Them All: Visual Triple Unified Reinforcement Learning. GitHub: https://github.com/MiniMax-AI/One-RL-to-See-Them-All β’ 5 items β’ Updated 11 days ago β’ 27
SynLogic Collection Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond β’ 5 items β’ Updated 5 days ago β’ 8
ZeroGUI Collection ZeroGUI: Automating Online GUI Learning at Zero Human Cost β’ 3 items β’ Updated 9 days ago β’ 1
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper β’ 2505.19457 β’ Published 13 days ago β’ 61
My MCP-ready spaces [WIP] Collection Progressive list of MCP server ready trending spaces maintained by fffiloni β’ 24 items β’ Updated about 20 hours ago β’ 4
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper β’ 2505.19641 β’ Published 13 days ago β’ 64