Datasets and trained checkpoints of Composition-RL
xuxin
xx18
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
submitted
a paper
3 days ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models