Nathan Lambert's picture

Nathan Lambert

natolambert

·

https://www.natolambert.com/

AI & ML interests

Reinforcement learning, Ethics, Robotics, Dynamics Models

Recent Activity

liked a model 17 days ago

inclusionAI/Ling-2.6-flash

liked a model 27 days ago

openai/privacy-filter

authored a paper about 1 month ago

The ATOM Report: Measuring the Open Language Model Ecosystem

View all activity

Organizations

natolambert 's datasets 66

natolambert/rlhf-library

Viewer • Updated Sep 17, 2025 • 864 • 73 • 3

natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-DPO

Viewer • Updated Sep 15, 2025 • 48 • 15

natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-SFT

Viewer • Updated Sep 15, 2025 • 48 • 16

natolambert/rlhf-library-tulu-2-dpo-7b

Viewer • Updated Sep 15, 2025 • 48 • 22

natolambert/rlhf-library-OLMo-2-0425-1B-DPO

Viewer • Updated Sep 15, 2025 • 48 • 14

natolambert/rlhf-library-OLMo-2-0425-1B-SFT

Viewer • Updated Sep 15, 2025 • 48 • 15

natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-DPO

Viewer • Updated Sep 15, 2025 • 48 • 26

natolambert/rlhf-library-tulu-2-7b

Viewer • Updated Sep 15, 2025 • 48 • 12

natolambert/rlhf-library-OLMo-7B-0424-Instruct-hf

Viewer • Updated Sep 15, 2025 • 48 • 13

natolambert/rlhf-library-OLMo-7B-0424-SFT-hf

Viewer • Updated Sep 15, 2025 • 48 • 20

natolambert/rlhf-library-OLMo-7B-Instruct-hf

Viewer • Updated Sep 15, 2025 • 48 • 13

natolambert/rlhf-library-OLMo-7B-SFT-hf

Viewer • Updated Sep 15, 2025 • 48 • 7

natolambert/rlhf-library-OLMo-2-0325-32B-DPO

Viewer • Updated Sep 15, 2025 • 48 • 15

natolambert/rlhf-library-OLMo-2-0325-32B-SFT

Viewer • Updated Sep 15, 2025 • 48 • 16

natolambert/rlhf-library-OLMo-2-1124-13B-DPO

Viewer • Updated Sep 15, 2025 • 48 • 8

natolambert/rlhf-library-OLMo-2-1124-13B-SFT

Viewer • Updated Sep 15, 2025 • 48 • 13

natolambert/rlhf-library-OLMo-2-1124-7B-DPO

Viewer • Updated Sep 15, 2025 • 48 • 11

natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-SFT

Viewer • Updated Sep 15, 2025 • 48 • 11

natolambert/rlhf-library-OLMo-2-1124-7B-SFT

Viewer • Updated Sep 15, 2025 • 48 • 25

natolambert/rlhf-book-prompts-v2

Viewer • Updated Sep 14, 2025 • 16 • 10

natolambert/coconot-r1-debug-debug

Viewer • Updated Jun 30, 2025 • 10 • 13

natolambert/tulu_v3.9_wildchat_100k_english-r1

Viewer • Updated Jun 30, 2025 • 57.4k • 15

natolambert/acecoder-r1

Viewer • Updated Jun 29, 2025 • 63.6k • 21

natolambert/rlvr-code-data-python-r1

Viewer • Updated Jun 29, 2025 • 80k • 30

natolambert/tulu_v3.9_wildchat_100k_english-r1-debug

Viewer • Updated Jun 29, 2025 • 9 • 25

natolambert/hardcoded-test

Viewer • Updated Jun 29, 2025 • 24 • 32

natolambert/rlvr_acecoder_filtered-r1

Updated Jun 28, 2025 • 9

natolambert/the-algorithm-python-r1

Viewer • Updated Jun 28, 2025 • 608 • 37

natolambert/the-algorithm-python-r1-debug

Viewer • Updated Jun 28, 2025 • 10 • 17

natolambert/GeneralThought-430K-filtered

Viewer • Updated Mar 26, 2025 • 338k • 487 • 35