From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning Paper • 2512.01970 • Published Dec 1, 2025 • 1
DifferentiableEvolutionaryRL/DERL-Meta-Optimizer-Init-Qwen2.5-0.5B-Instruct Text Generation • 0.5B • Updated 12 days ago • 16