Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrew Zhao's picture
3 20 22

Andrew Zhao

andrewzh
go4broke's profile picture kiuckhuang's profile picture abutair1's profile picture
ยท
https://andrewzh112.github.io/
  • _AndrewZhao
  • Andrewzh112
  • andrewqzhao

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

upvoted a paper about 1 month ago
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
upvoted a paper 2 months ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
upvoted a paper 3 months ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
View all activity

Organizations

None yet

andrewzh 's datasets

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs