Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted a paper 13 minutes ago
Composer 2 Technical Report upvoted a paper about 10 hours ago
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training upvoted a paper about 10 hours ago
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward
Modeling and LLM AlignmentOrganizations
None yet