Viewer
•
Updated
•
3.83k
•
19.7k
trl-lib/documentation-images
Viewer
•
Updated
•
11
•
61.3k
Viewer
•
Updated
•
103k
•
4.11k
•
5
trl-lib/llava-instruct-mix
Viewer
•
Updated
•
228k
•
1.55k
•
2
trl-lib/OpenMathReasoning
Viewer
•
Updated
•
3.2M
•
355
trl-lib/chatbot_arena_completions
Viewer
•
Updated
•
33k
•
294
•
1
Viewer
•
Updated
•
83.1k
•
227
•
3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
•
16.6k
•
254
•
4
trl-lib/ultrafeedback-prompt
Viewer
•
Updated
•
39.8k
•
319
•
9
Viewer
•
Updated
•
179k
•
1.13k
•
3
Viewer
•
Updated
•
130k
•
2.89k
•
30
Viewer
•
Updated
•
41.2k
•
408
•
2
Viewer
•
Updated
•
445k
•
2.13k
•
11
trl-lib/lm-human-preferences-sentiment
Viewer
•
Updated
•
6.26k
•
220
trl-lib/lm-human-preferences-descriptiveness
Viewer
•
Updated
•
6.26k
•
34
•
1
trl-lib/hh-rlhf-helpful-base
Viewer
•
Updated
•
46.2k
•
1.2k
•
3
Viewer
•
Updated
•
51.8k
•
32
trl-lib/Capybara-Preferences
Viewer
•
Updated
•
15.4k
•
10
Viewer
•
Updated
•
16k
•
3.7k
•
17
trl-lib/ultrafeedback_binarized
Viewer
•
Updated
•
63.1k
•
4.07k
•
23
trl-lib/capybara-preferencces-7k
Viewer
•
Updated
•
7.56k
•
4
Viewer
•
Updated
•
15k
•
346
•
9
trl-lib/ultrachat_200k_chatml
Viewer
•
Updated
•
231k
•
17
•
3