Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xlalex 's Collections
synthesis
perception
survey
RL
critic
speech full duplex
agent
self-paly

RL

updated 8 days ago
Upvote
-

  • Language Models that Think, Chat Better

    Paper • 2509.20357 • Published 30 days ago

  • Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

    Paper • 2505.21457 • Published May 27 • 15
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs