This collection contains models described in the refusal token paper published in COLM 2025.
-
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast
8B • Updated -
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens
8B • Updated -
tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-single-token
8B • Updated -
tomg-group-umd/zephyr-llama3-8b-sft-no-refusal-messages
8B • Updated