Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pyamy
/
llama3-dpo-llm-judge
like
0
PEFT
TensorBoard
Safetensors
dpo
llama
preference-learning
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
llama3-dpo-llm-judge
/
runs
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
pyamy
Upload DPO LLM Judge fine-tuned model
f582c27
verified
16 days ago
Aug10_17-19-08_Cheddar
Upload DPO LLM Judge fine-tuned model
16 days ago