Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

chchen
/
Mistral-7B-Instruct-v0.2-ORPO

PEFT
Safetensors
llama-factory
lora
trl
dpo
Generated from Trainer
Model card Files Files and versions
xet
Community
Mistral-7B-Instruct-v0.2-ORPO / .ipynb_checkpoints
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
chchen's picture
chchen
Training in progress, step 500
a144b89 verified over 1 year ago
  • lora_orpo-checkpoint.yaml
    832 Bytes
    Training in progress, step 500 over 1 year ago
  • tokenizer_config-checkpoint.json
    1.41 kB
    Training in progress, step 500 over 1 year ago
  • training_loss-checkpoint.png
    45.1 kB
    Training in progress, step 500 over 1 year ago
  • training_rewards_accuracies-checkpoint.png
    54.9 kB
    Training in progress, step 500 over 1 year ago
  • training_sft_loss-checkpoint.png
    45.6 kB
    Training in progress, step 500 over 1 year ago