R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
liked
a dataset
about 6 hours ago
openbmb/DCAD-2000
liked
a dataset
about 7 hours ago
Rapidata/Recraft-v3-24-7-25_t2i_human_preference
authored
a paper
5 days ago
Question Translation Training for Better Multilingual Reasoning