TODO: improve model card

Trained from Apertus-8B Base using liger kernel with a two step Anchored SFT > LD-DPO recipe.

Alpaca template, no system.

.7 temp, top_p .95, no rep pen or dry

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ConicCat/Cap-ertus-8B

Base model

Finetuned

(14)

this model

Quantizations

ConicCat
/

Cap-ertus-8B