Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
28
Jin Zhu
mamba413
Follow
Kyleyee's profile picture
Eehan's profile picture
2 followers
·
2 following
https://mamba413.github.io/
Mamba413
AI & ML interests
reinforcement learning
Recent Activity
new
activity
6 days ago
AyoubChLin/CNN_News_Articles_2011-2022:
Request: DOI
liked
a dataset
8 days ago
euirim/goodwiki
liked
a dataset
9 days ago
toloka/beemo
View all activity
Organizations
None yet
mamba413
's models
10
Sort: Recently updated
mamba413/Qwen2.5-1.5B-PPO-DR-HH-Seed1
2B
•
Updated
Mar 21
mamba413/Qwen2.5-1.5B-PPO-BENCH-HH-Seed1
2B
•
Updated
Mar 21
mamba413/Qwen2.5-1.5B-Instruct-Reward-BENCH-HH-Seed1
2B
•
Updated
Mar 21
mamba413/Qwen2.5-1.5B-Instruct-Reward-BENCH-HH-Seed0
Updated
Mar 20
mamba413/Qwen2.5-1.5B-Instruct-Reward-DR-HH-Seed0
Updated
Mar 20
mamba413/Qwen2-0.5B-Reward-DR-HH-Seed0
Text Classification
•
0.5B
•
Updated
Mar 19
mamba413/Qwen2.5-1.5B-Reward-DR-IMDB-Seed0
Updated
Mar 18
mamba413/Qwen2.5-1.5B-Reward-DR-SIMU-Seed0
Updated
Mar 18
mamba413/Qwen2-0.5B-Reward-DR-SIMU-Seed0
Text Classification
•
0.5B
•
Updated
Mar 16
mamba413/Qwen2-0.5B-Reward-DR-SIMU
Text Classification
•
0.5B
•
Updated
Mar 15