DCoT - a haritzpuerto Collection

haritzpuerto 's Collections

DCoT

DCoT

updated Jun 10

Models from the ACL 2025 paper "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs" "

Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Paper • 2407.03181 • Published Jul 3, 2024 • 1
haritzpuerto/LLaMA2-7B-dcot

Text Generation • Updated Jul 16, 2024 • 8 • 2
haritzpuerto/LLaMA2-13B-dcot

Text Generation • Updated Jul 16, 2024 • 3
haritzpuerto/LLaMA2-70B-dcot

Text Generation • Updated Jul 16, 2024 • 2
haritzpuerto/phi-2-dcot

Text Generation • Updated Jul 16, 2024 • 3 • 1
haritzpuerto/phi-1.5-dcot

Text Generation • Updated Jul 16, 2024 • 3