Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
5
6
Zhaolin Gao
GitBag
Follow
kirankc's profile picture
dark-pen's profile picture
LeroyDyer's profile picture
3 followers
·
2 following
https://zhaolingao.github.io/
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a dataset
24 days ago
GitBag/qwen2.5-1.5b-1.5b-math500-value
published
a dataset
about 1 month ago
GitBag/qwen2.5-1.5b-1.5b-math500-value
updated
a dataset
2 months ago
GitBag/math_qwen3_1.7B_8192_n_128_eval_len
View all activity
Organizations
GitBag
's datasets
468
Sort: Recently updated
GitBag/1743824500
Viewer
•
Updated
Apr 5
•
7.1k
•
3
GitBag/1743824510
Viewer
•
Updated
Apr 5
•
7.1k
•
2
GitBag/1743824507
Viewer
•
Updated
Apr 5
•
7.1k
•
1
GitBag/1743824512
Viewer
•
Updated
Apr 5
•
7.1k
•
1
GitBag/1743824497
Viewer
•
Updated
Apr 5
•
7.1k
GitBag/1743824506
Viewer
•
Updated
Apr 5
•
7.1k
•
1
GitBag/1743727597
Viewer
•
Updated
Apr 4
•
7.47k
•
1
GitBag/1743727721
Viewer
•
Updated
Apr 4
•
7.47k
•
1
GitBag/1743727695
Viewer
•
Updated
Apr 4
•
7.47k
•
2
GitBag/1743724600
Viewer
•
Updated
Apr 3
•
256
GitBag/llama3-uf-dp-from1735956551-token-rfst-1k3k_harvard
Viewer
•
Updated
Jan 25
•
91.9k
•
5
GitBag/llama3-uf-dp-from1735956551-token-rfst-1k3k
Viewer
•
Updated
Jan 25
•
91.9k
•
2
GitBag/llama3-uf-dp-from1735956551-token-st-1k3k_harvard
Viewer
•
Updated
Jan 25
•
50.5k
•
5
GitBag/llama3-uf-dp-from1735956551-token-rf-1k3k_harvard
Viewer
•
Updated
Jan 25
•
41.4k
•
3
GitBag/llama3-uf-dp-from1735956551-token-oa-1k3k
Viewer
•
Updated
Jan 25
•
45.2k
•
4
GitBag/llama3-uf-dp-from1735956551-token-st-1k3k
Viewer
•
Updated
Jan 25
•
50.5k
•
4
GitBag/llama3-uf-dp-from1735956551-token-rf-1k3k
Viewer
•
Updated
Jan 25
•
41.4k
•
2
GitBag/regenerated_responses_from_base_harvard
Viewer
•
Updated
Jan 17
•
55.1k
•
4
GitBag/llama3-uf-dp-from1735956551-same-turn
Viewer
•
Updated
Jan 13
•
56.6k
•
1
GitBag/llama3-uf-dp-from1735956551-reinforce
Viewer
•
Updated
Jan 13
•
57.9k
•
4
GitBag/llama3-uf-dp-from1735956551-token-st-1k1k_mosaic
Viewer
•
Updated
Jan 13
•
22.4k
•
2
GitBag/llama3-uf-dp-from1735956551-token-rf-1k1k_mosaic
Viewer
•
Updated
Jan 13
•
24.5k
•
3
GitBag/llama3-uf-dp-from1735956551-token-oa-1k1k_mosaic
Viewer
•
Updated
Jan 13
•
16.5k
•
1
GitBag/llama3-uf-dp-from1735956551-token-st-1k1k_harvard
Viewer
•
Updated
Jan 12
•
22.4k
•
1
GitBag/llama3-uf-dp-from1735956551-token-rf-1k1k_harvard
Viewer
•
Updated
Jan 12
•
24.5k
•
3
GitBag/llama3-uf-dp-from1735956551-token-oa-1k1k
Viewer
•
Updated
Jan 12
•
16.5k
•
2
GitBag/llama3-uf-dp-from1735956551-token-st-1k1k
Viewer
•
Updated
Jan 12
•
22.4k
•
2
GitBag/llama3-uf-dp-from1735956551-token-rf-1k1k
Viewer
•
Updated
Jan 12
•
24.5k
•
2
GitBag/llama3-uf-dp-from1735956551-armo
Viewer
•
Updated
Jan 12
•
60.8k
•
1
GitBag/1735961901_eval
Viewer
•
Updated
Jan 4
•
1k
•
2
Previous
1
...
4
5
6
7
8
...
16
Next