Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate Paper • 2604.24881 • Published 15 days ago
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 888