TODO: improve model card
Trained from Apertus-8B Base using liger kernel with a two step Anchored SFT > LD-DPO recipe.
Alpaca template, no system.
.7 temp, top_p .95, no rep pen or dry
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support