Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xu Huang's picture
3

Xu Huang

XuHuang
  • XuHuang441

AI & ML interests

None yet

Organizations

None yet

models 41

XuHuang/inpo_iter1

Updated Dec 4, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_2_athene_beta1_ratio0.85_eta0.005_weights0.75-0.25

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_1_athene_beta1_ratio0.33_eta0.005

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_2_skywork_beta3_ratio0.85_eta0.01_weights1-0_3pl_fixed

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_2_skywork_ratio0.85_eta0.01

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_1_skywork_inpo_iter1_20k

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_3_armo_beta5_ratio0.33_eta0.0075_weights0.25-0.75_td

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_2_armo_beta1_ratio0.8_eta0.005_weights0.75-0.25_3pl

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_2_armo_puredpoloss

606k • Updated Nov 30, 2025

XuHuang/gemma-2-9b-it_mnpo_stage_2_armo_puredpoloss_beta5

606k • Updated Nov 30, 2025
View 41 models

datasets 3

XuHuang/po_ready_data_10k

Preview • Updated Oct 13, 2025 • 3

XuHuang/inpo_iter2_20k_pref

Updated Sep 15, 2025 • 1

XuHuang/inpo_iter1

Viewer • Updated Aug 18, 2025 • 56.9k • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs