Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
khazarai 's Collections
Benchmarks
CoT
Az-Language
GRPO
Text-to-Speech Models
RLHF
SFT

GRPO

updated about 11 hours ago

Group Relative Policy Optimization

Upvote
1

  • khazarai/HeisenbergQ-0.5B-RL

    Text Generation • Updated 27 days ago • 2 • 1

  • khazarai/Math-RL

    Text Generation • Updated 27 days ago • 8 • 1
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs