Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OpenRLHF

community
https://github.com/OpenRLHF
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

chuyi777  updated a model 4 days ago
OpenRLHF/Llama-3-8b-rm-700k
catqaq  new activity 11 days ago
OpenRLHF/Llama-3-8b-rm-700k:Improve model card: add tags, paper/code links, and usage example
chuyi777  authored a paper about 2 months ago
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
View all activity

Xianyu's profile picture AtsushiSaito's profile picture Jian Hu's profile picture Zhang Ruichong's profile picture Qing Wang's profile picture Longhui Yu's profile picture Chenhe Gu's profile picture

OpenRLHF 's models 10

OpenRLHF/Llama-3-8b-rm-700k

Text Ranking • 8B • Updated 4 days ago • 922 • 3

OpenRLHF/Llama-3-8b-rm-mixture

8B • Updated Nov 30, 2024 • 216 • 1

OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt

7B • Updated Nov 30, 2024 • 7 • 1

OpenRLHF/Mistral-7b-PRM-Math-Shepherd

7B • Updated Oct 30, 2024 • 8 • 1

OpenRLHF/Llama-3-8b-iter-dpo-179k

Text Generation • 8B • Updated Jul 28, 2024 • 22

OpenRLHF/Llama-3-8b-rlhf-100k

Text Generation • 8B • Updated Jun 24, 2024 • 286 • 4

OpenRLHF/Llama-3-8b-sft-mixture

Text Generation • 8B • Updated Jun 14, 2024 • 4.62k • • 1

OpenRLHF/Llama-2-7b-sft-model-ocra-500k

Text Generation • 7B • Updated Jun 9, 2024 • 6

OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt

13B • Updated Jan 24, 2024 • 4

OpenRLHF/Llama-2-13b-sft-model-ocra-500k

Text Generation • 13B • Updated Jan 5, 2024 • 7 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs