Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yinxu Pan's picture
24 59 230

Yinxu Pan

cppowboy
21world's profile picture Lynncc6's profile picture davanstrien's profile picture
·
https://github.com/Cppowboy
  • pnynx3
  • Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

upvoted a paper about 2 hours ago
A Survey of Reinforcement Learning for Large Reasoning Models
upvoted a paper about 2 hours ago
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
upvoted a paper 2 days ago
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
View all activity

Organizations

Diffusers Pipelines Library for Stable Diffusion's profile picture OpenBMB's profile picture XAgentCommunity's profile picture

cppowboy 's models 2

cppowboy/XAgentLLaMa-7B-preview

Text Generation • Updated Nov 21, 2023 • 13

cppowboy/XAgentLLaMa-34B-preview

Updated Nov 20, 2023
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs