Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenRLHF
community
https://github.com/OpenRLHF
Activity Feed
Follow
47
AI & ML interests
None defined yet.
Recent Activity
chuyi777
updated
a model
4 days ago
OpenRLHF/Llama-3-8b-rm-700k
catqaq
new
activity
11 days ago
OpenRLHF/Llama-3-8b-rm-700k:
Improve model card: add tags, paper/code links, and usage example
chuyi777
authored
a paper
about 2 months ago
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
View all activity
Team members
7
OpenRLHF
's models
10
Sort: Recently updated
OpenRLHF/Llama-3-8b-rm-700k
Text Ranking
•
8B
•
Updated
4 days ago
•
922
•
3
OpenRLHF/Llama-3-8b-rm-mixture
8B
•
Updated
Nov 30, 2024
•
216
•
1
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt
7B
•
Updated
Nov 30, 2024
•
7
•
1
OpenRLHF/Mistral-7b-PRM-Math-Shepherd
7B
•
Updated
Oct 30, 2024
•
8
•
1
OpenRLHF/Llama-3-8b-iter-dpo-179k
Text Generation
•
8B
•
Updated
Jul 28, 2024
•
22
OpenRLHF/Llama-3-8b-rlhf-100k
Text Generation
•
8B
•
Updated
Jun 24, 2024
•
286
•
4
OpenRLHF/Llama-3-8b-sft-mixture
Text Generation
•
8B
•
Updated
Jun 14, 2024
•
4.62k
•
•
1
OpenRLHF/Llama-2-7b-sft-model-ocra-500k
Text Generation
•
7B
•
Updated
Jun 9, 2024
•
6
OpenRLHF/Llama-2-13b-rm-anthropic_hh-lmsys-oasst-webgpt
13B
•
Updated
Jan 24, 2024
•
4
OpenRLHF/Llama-2-13b-sft-model-ocra-500k
Text Generation
•
13B
•
Updated
Jan 5, 2024
•
7
•
1