Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Paper
•
2407.03181
•
Published
•
1
Models from the ACL 2025 paper "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs" "