Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sean29 's Collections
todo
agent
rl
mllm

todo

updated 7 days ago
Upvote
-

  • Less is More: Recursive Reasoning with Tiny Networks

    Paper • 2510.04871 • Published 16 days ago • 427

  • Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

    Paper • 2509.25541 • Published 23 days ago • 136

  • Agent Learning via Early Experience

    Paper • 2510.08558 • Published 13 days ago • 235

  • DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

    Paper • 2509.25454 • Published 23 days ago • 133

  • MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

    Paper • 2509.24002 • Published 24 days ago • 165

  • A Survey of Reinforcement Learning for Large Reasoning Models

    Paper • 2509.08827 • Published Sep 10 • 183

  • VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

    Paper • 2509.09372 • Published Sep 11 • 230

  • Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

    Paper • 2508.01191 • Published Aug 2 • 236

  • Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

    Paper • 2510.09201 • Published 12 days ago • 46
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs