Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kexin Huang's picture
7 13

Kexin Huang

737443h
https://kexinhuang02.github.io

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
authored a paper 1 day ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
authored a paper 1 day ago
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs
View all activity

Organizations

None yet

authored 3 papers 1 day ago

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 11 days ago • 109

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 8 days ago • 27

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 8 days ago • 6
authored 3 papers 6 months ago

RePO: ReLU-based Preference Optimization

Paper • 2503.07426 • Published Mar 10, 2025 • 2

SPRec: Self-Play to Debias LLM-based Recommendation

Paper • 2412.09243 • Published Dec 12, 2024

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs