This model was trained on a Claude Sonnet 4 (non-reasoning) dataset and a Claude Sonnet 3.7 (reasoning) dataset. It is a reasoning model.
Chat template
4-bit
Base model