Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
113
47
615
Nathan Lambert
natolambert
Follow
samforeman's profile picture
DrDrunkenstein22's profile picture
nicoboou's profile picture
283 followers
·
37 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
liked
a model
9 days ago
openbmb/BitCPM-CANN-3B-unquantized
liked
a model
15 days ago
perplexity-ai/pplx-embed-v1-late-0.6b
liked
a model
about 1 month ago
inclusionAI/Ling-2.6-flash
View all activity
Organizations
natolambert
's datasets
66
Sort: Recently updated
natolambert/rlhf-library
Viewer
•
Updated
Sep 17, 2025
•
864
•
81
•
3
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
22
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
26
natolambert/rlhf-library-tulu-2-dpo-7b
Viewer
•
Updated
Sep 15, 2025
•
48
•
22
natolambert/rlhf-library-OLMo-2-0425-1B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
22
natolambert/rlhf-library-OLMo-2-0425-1B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
25
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
21
natolambert/rlhf-library-tulu-2-7b
Viewer
•
Updated
Sep 15, 2025
•
48
•
17
natolambert/rlhf-library-OLMo-7B-0424-Instruct-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
21
natolambert/rlhf-library-OLMo-7B-0424-SFT-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
35
natolambert/rlhf-library-OLMo-7B-Instruct-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
18
natolambert/rlhf-library-OLMo-7B-SFT-hf
Viewer
•
Updated
Sep 15, 2025
•
48
•
12
natolambert/rlhf-library-OLMo-2-0325-32B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
25
natolambert/rlhf-library-OLMo-2-0325-32B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
23
natolambert/rlhf-library-OLMo-2-1124-13B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
12
natolambert/rlhf-library-OLMo-2-1124-13B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
14
natolambert/rlhf-library-OLMo-2-1124-7B-DPO
Viewer
•
Updated
Sep 15, 2025
•
48
•
15
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
17
natolambert/rlhf-library-OLMo-2-1124-7B-SFT
Viewer
•
Updated
Sep 15, 2025
•
48
•
25
natolambert/rlhf-book-prompts-v2
Viewer
•
Updated
Sep 14, 2025
•
16
•
14
natolambert/coconot-r1-debug-debug
Viewer
•
Updated
Jun 30, 2025
•
10
•
30
natolambert/tulu_v3.9_wildchat_100k_english-r1
Viewer
•
Updated
Jun 30, 2025
•
57.4k
•
230
natolambert/acecoder-r1
Viewer
•
Updated
Jun 29, 2025
•
63.6k
•
30
natolambert/rlvr-code-data-python-r1
Viewer
•
Updated
Jun 29, 2025
•
80k
•
123
natolambert/tulu_v3.9_wildchat_100k_english-r1-debug
Viewer
•
Updated
Jun 29, 2025
•
9
•
30
natolambert/hardcoded-test
Viewer
•
Updated
Jun 29, 2025
•
24
•
25
natolambert/rlvr_acecoder_filtered-r1
Updated
Jun 28, 2025
•
10
natolambert/the-algorithm-python-r1
Viewer
•
Updated
Jun 28, 2025
•
608
•
22
natolambert/the-algorithm-python-r1-debug
Viewer
•
Updated
Jun 28, 2025
•
10
•
20
natolambert/GeneralThought-430K-filtered
Viewer
•
Updated
Mar 26, 2025
•
338k
•
2.32k
•
35
Previous
1
2
3
Next