Model with No Refusal Tokens

This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on UltraChat SFT, CoCoNoT refusals, and CoCoNoT's contrast data as SFT data. Note that this model is not the model found in the paper the original models are not able to be released due to corporate legalities.

Suggusted Method For Generations

For generating a output from this model, please refer to the code found in repo in the coconot_eval folder. However, for this model, model.generate or pipeline are also sufficient.

Downloads last month
4
Safetensors
Model size
8.03B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast