Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
1
11
Phung Van Duy
pvduy
Follow
zonemercy's profile picture
21world's profile picture
OmbelineM's profile picture
33 followers
·
15 following
PhungVanDuy1
phungvanduy
AI & ML interests
Code Generation, HITL, RLHF
Recent Activity
updated
a model
21 days ago
Intelligent-Internet/II-Medical-8B
updated
a model
21 days ago
Intelligent-Internet/II-Medical-8B-1706
updated
a model
21 days ago
Intelligent-Internet/II-Medical-32B-Preview
View all activity
Organizations
pvduy
's datasets
91
Sort: Recently updated
pvduy/qwen14b_samples
Viewer
•
Updated
Feb 24
•
4.5k
•
1
pvduy/sft-450b-llama3.1_pred
Viewer
•
Updated
Jan 18
•
500
•
4
pvduy/big_math_remaining_rollouts_01142025
Viewer
•
Updated
Jan 16
•
52.8k
•
19
pvduy/orpo-dpo-mix-40k
Viewer
•
Updated
Jan 2
•
45.2k
•
4
pvduy/ppo_verl_math
Viewer
•
Updated
Dec 31, 2024
•
117k
•
3
pvduy/my-distiset-6f4967e8
Viewer
•
Updated
Dec 24, 2024
•
10
•
13
pvduy/merged-master-signals-train-responses
Viewer
•
Updated
Dec 7, 2024
•
300k
•
43
pvduy/web_questions
Viewer
•
Updated
Nov 7, 2024
•
3.78k
•
4
pvduy/simpleqa
Viewer
•
Updated
Nov 7, 2024
•
4.33k
•
115
pvduy/captioning-dummy
Viewer
•
Updated
Oct 21, 2024
•
300
•
2
pvduy/ultrafeedback_binarized_rationalizer_iter1_70b_boostrap
Viewer
•
Updated
Sep 16, 2024
•
39.4k
pvduy/ultrainteract_pair
Viewer
•
Updated
Sep 14, 2024
•
220k
•
7
pvduy/code-feedback-80k-maxrm-critic-iter2-cls
Viewer
•
Updated
Sep 6, 2024
•
157k
•
4
pvduy/train_prefs_ultrafeedback_binarized_critic_llama3_8b
Viewer
•
Updated
Sep 5, 2024
•
91.4k
•
1
pvduy/code-feedback-80k-maxrm-critic-iter2
Viewer
•
Updated
Aug 29, 2024
•
83.2k
•
5
pvduy/argilla-dpo-mix-7k-refined-critic-reformat
Viewer
•
Updated
Aug 27, 2024
•
6.75k
•
1
pvduy/code-feedback-80k-maxrm-critic
Viewer
•
Updated
Aug 25, 2024
•
80.1k
•
5
pvduy/code-feedback-deepseekv2-critic
Viewer
•
Updated
Aug 24, 2024
•
132k
•
6
•
1
pvduy/lmsys-train
Viewer
•
Updated
Aug 23, 2024
•
39.7k
•
4
pvduy/code-feedback-50k-maxrm-critic
Viewer
•
Updated
Aug 23, 2024
•
50.1k
pvduy/code-feedback-10k-deepseekv2-critic
Viewer
•
Updated
Aug 21, 2024
•
51.4k
•
13
pvduy/code-feedback-10k-maxrm-critic
Viewer
•
Updated
Aug 20, 2024
•
10.1k
•
2
pvduy/argilla-dpo-mix-7k-gpt4o-qwen2-ensemble
Viewer
•
Updated
Aug 19, 2024
•
6.07k
•
4
pvduy/argilla-dpo-mix-7k-gpt4o-refined-remove-same
Viewer
•
Updated
Aug 19, 2024
•
4.67k
pvduy/m-a-p-codefeedback-mistral-large
Viewer
•
Updated
Aug 14, 2024
•
157k
•
1
pvduy/reward_bench
Viewer
•
Updated
Aug 14, 2024
•
2.99k
•
6
pvduy/argilla-dpo-mix-7k-gpt4o-refined
Viewer
•
Updated
Aug 13, 2024
•
6.75k
•
1
pvduy/SWE-bench_Verified_oracle
Viewer
•
Updated
Aug 13, 2024
•
1k
•
16
pvduy/exp_dpo_func
Viewer
•
Updated
Jul 25, 2024
•
11.8k
•
8
pvduy/nestar-ppo
Viewer
•
Updated
May 14, 2024
•
183k
•
2
Previous
1
2
3
4
Next