Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLHFlow
university
RLHFlow
RLHFlow
Activity Feed
Follow
135
AI & ML interests
Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/
Team members
8
RLHFlow
's datasets
83
Sort: Recently updated
RLHFlow/pair-preference-dataset-700K
Viewer
•
Updated
May 26, 2024
•
699k
•
10
•
3
RLHFlow/test_generation_2k
Viewer
•
Updated
May 12, 2024
•
2k
•
89
RLHFlow/SHP-standard
Viewer
•
Updated
May 9, 2024
•
93.3k
•
10
RLHFlow/HH-RLHF-Harmless-and-RedTeam-standard
Viewer
•
Updated
May 8, 2024
•
42.3k
•
28
•
3
RLHFlow/prompt-collection-v0.1
Viewer
•
Updated
May 8, 2024
•
179k
•
37
•
9
RLHFlow/pair-preference-dataset-mix1
Viewer
•
Updated
May 6, 2024
•
548k
•
13
•
3
RLHFlow/Prometheus2-preference-standard
Viewer
•
Updated
May 5, 2024
•
200k
•
28
•
2
RLHFlow/iterative-prompt-v1-iter3-20K
Viewer
•
Updated
May 3, 2024
•
20k
•
52
•
3
RLHFlow/iterative-prompt-v1-iter2-20K
Viewer
•
Updated
May 3, 2024
•
20k
•
68
•
3
RLHFlow/iterative-prompt-v1-iter1-20K
Viewer
•
Updated
May 3, 2024
•
20k
•
192
•
2
RLHFlow/Argilla-Math-DPO-standard
Viewer
•
Updated
Apr 30, 2024
•
2.42k
•
14
•
3
RLHFlow/PKU-SafeRLHF-30K-standard
Viewer
•
Updated
Apr 29, 2024
•
26.9k
•
72
•
3
RLHFlow/prm80k-phase2
Viewer
•
Updated
Apr 28, 2024
•
79.5k
•
13
•
4
RLHFlow/mix3
Preview
•
Updated
Apr 28, 2024
•
2
•
1
RLHFlow/UltraInteract-filtered-standard
Viewer
•
Updated
Apr 28, 2024
•
162k
•
13
•
2
RLHFlow/Capybara-distibalel-Filter-standard
Viewer
•
Updated
Apr 28, 2024
•
14.8k
•
30
RLHFlow/Orca-distibalel-standard
Viewer
•
Updated
Apr 28, 2024
•
6.93k
•
14
•
1
RLHFlow/Helpsteer-preference-standard
Viewer
•
Updated
Apr 27, 2024
•
37.1k
•
28
•
6
RLHFlow/UltraFeedback-preference-standard
Viewer
•
Updated
Apr 27, 2024
•
340k
•
65
•
13
RLHFlow/HH-RLHF-Helpful-standard
Viewer
•
Updated
Apr 27, 2024
•
115k
•
175
•
1
RLHFlow/CodeUltraFeedback-standard
Viewer
•
Updated
Apr 27, 2024
•
50.2k
•
39
•
5
RLHFlow/SFT-OpenHermes-2.5-Standard
Viewer
•
Updated
Apr 24, 2024
•
1M
•
23
•
3
RLHFlow/pair_preference_model_dataset
Viewer
•
Updated
Apr 20, 2024
•
699k
•
28
•
5
Previous
1
2
3
Next