Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ru Peng's picture
9

Ru Peng

RuPeng
21world's profile picture Gargaz's profile picture mmks735's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago
Group Sequence Policy Optimization
upvoted a paper 14 days ago
Agentic Reinforced Policy Optimization
upvoted a paper 30 days ago
Reinforcement Learning with Rubric Anchors
View all activity

Organizations

None yet

Collections 1

DataMan
  • RuPeng/DataMan-1.5B-EN

    2B • Updated Aug 7 • 56
  • RuPeng/DataMan-1.5B-ZH

    2B • Updated Aug 8 • 15
  • RuPeng/DataMan-MoE-A2.7B-EN

    14B • Updated Aug 8 • 5
  • RuPeng/DataMan-MoE-A2.7B-ZH

    14B • Updated Aug 9 • 4
DataMan
  • RuPeng/DataMan-1.5B-EN

    2B • Updated Aug 7 • 56
  • RuPeng/DataMan-1.5B-ZH

    2B • Updated Aug 8 • 15
  • RuPeng/DataMan-MoE-A2.7B-EN

    14B • Updated Aug 8 • 5
  • RuPeng/DataMan-MoE-A2.7B-ZH

    14B • Updated Aug 9 • 4

Papers 6

arxiv:2502.19363
arxiv:2408.10764
arxiv:2407.10671
arxiv:2407.04078

models 4

RuPeng/DataMan-MoE-A2.7B-ZH

14B • Updated Aug 9 • 4

RuPeng/DataMan-MoE-A2.7B-EN

14B • Updated Aug 8 • 5

RuPeng/DataMan-1.5B-ZH

2B • Updated Aug 8 • 15

RuPeng/DataMan-1.5B-EN

2B • Updated Aug 7 • 56

datasets 1

RuPeng/DataPajama

Updated May 8 • 7
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs