Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
113
43
607
Nathan Lambert
natolambert
Follow
Illidan1234's profile picture
penfever's profile picture
jaigouk's profile picture
263 followers
·
36 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
updated
a dataset
about 20 hours ago
allenai/Dolci-Instruct-SFT-Tool-Use-SA
liked
a model
about 24 hours ago
Qwen/Qwen3.5-4B
upvoted
a
paper
11 days ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
View all activity
Organizations
natolambert
's datasets
66
Sort: Recently updated
natolambert/rlhf-library
Viewer
•
Updated
Sep 17, 2025
•
864
•
9
•
3
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
6
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
11
natolambert/rlhf-library-tulu-2-dpo-7b
Viewer
•
Updated
Sep 15, 2025
•
48
•
7
natolambert/rlhf-library-OLMo-2-0425-1B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
8
natolambert/rlhf-library-OLMo-2-0425-1B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
6
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
14
natolambert/rlhf-library-tulu-2-7b
Viewer
•
Updated
Sep 15, 2025
•
48
•
21
natolambert/rlhf-library-OLMo-7B-0424-Instruct-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
9
natolambert/rlhf-library-OLMo-7B-0424-SFT-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
11
natolambert/rlhf-library-OLMo-7B-Instruct-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
5
natolambert/rlhf-library-OLMo-7B-SFT-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
4
natolambert/rlhf-library-OLMo-2-0325-32B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
5
natolambert/rlhf-library-OLMo-2-0325-32B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
5
natolambert/rlhf-library-OLMo-2-1124-13B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
4
natolambert/rlhf-library-OLMo-2-1124-13B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
4
natolambert/rlhf-library-OLMo-2-1124-7B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
3
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
6
natolambert/rlhf-library-OLMo-2-1124-7B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
3
natolambert/rlhf-book-prompts-v2
Viewer
•
Updated
Sep 14, 2025
•
16
•
6
natolambert/coconot-r1-debug-debug
Viewer
•
Updated
Jun 30, 2025
•
10
•
8
natolambert/tulu_v3.9_wildchat_100k_english-r1
Viewer
•
Updated
Jun 30, 2025
•
57.4k
•
6
natolambert/acecoder-r1
Viewer
•
Updated
Jun 29, 2025
•
63.6k
•
8
natolambert/rlvr-code-data-python-r1
Viewer
•
Updated
Jun 29, 2025
•
80k
•
17
natolambert/tulu_v3.9_wildchat_100k_english-r1-debug
Viewer
•
Updated
Jun 29, 2025
•
9
•
6
natolambert/hardcoded-test
Viewer
•
Updated
Jun 29, 2025
•
24
•
7
natolambert/rlvr_acecoder_filtered-r1
Updated
Jun 28, 2025
•
6
natolambert/the-algorithm-python-r1
Viewer
•
Updated
Jun 28, 2025
•
608
•
14
natolambert/the-algorithm-python-r1-debug
Viewer
•
Updated
Jun 28, 2025
•
10
•
16
natolambert/GeneralThought-430K-filtered
Viewer
•
Updated
Mar 26, 2025
•
338k
•
261
•
34
Previous
1
2
3
Next