Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pyamy
/
llama3-dpo-llm-judge
like
0
PEFT
TensorBoard
Safetensors
dpo
llama
preference-learning
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
llama3-dpo-llm-judge
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
pyamy
Upload README.md with huggingface_hub
0699fd0
verified
14 days ago
checkpoint-100
Upload DPO LLM Judge fine-tuned model
15 days ago
checkpoint-150
Upload DPO LLM Judge fine-tuned model
15 days ago
checkpoint-200
Upload DPO LLM Judge fine-tuned model
15 days ago
checkpoint-250
Upload DPO LLM Judge fine-tuned model
15 days ago
checkpoint-50
Upload DPO LLM Judge fine-tuned model
15 days ago
checkpoint-500
Upload DPO LLM Judge fine-tuned model
15 days ago
runs
Upload DPO LLM Judge fine-tuned model
15 days ago
.gitattributes
Safe
1.97 kB
Upload DPO LLM Judge fine-tuned model
15 days ago
README.md
1.32 kB
Upload README.md with huggingface_hub
14 days ago
adapter_config.json
Safe
932 Bytes
Upload DPO LLM Judge fine-tuned model
15 days ago
adapter_model.safetensors
Safe
6.83 MB
LFS
Upload DPO LLM Judge fine-tuned model
15 days ago
chat_template.jinja
Safe
3.92 kB
Upload DPO LLM Judge fine-tuned model
15 days ago
special_tokens_map.json
Safe
342 Bytes
Upload DPO LLM Judge fine-tuned model
15 days ago
tokenizer.json
Safe
17.2 MB
LFS
Upload DPO LLM Judge fine-tuned model
15 days ago
tokenizer_config.json
Safe
52.6 kB
Upload DPO LLM Judge fine-tuned model
15 days ago
training_args.bin
pickle
Detected Pickle imports (11)
"accelerate.state.PartialState"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_utils.SchedulerType"
,
"trl.trainer.dpo_config.FDivergenceType"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SaveStrategy"
,
"trl.trainer.dpo_config.DPOConfig"
,
"transformers.training_args.OptimizerNames"
,
"torch.device"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"accelerate.utils.dataclasses.DistributedType"
How to fix it?
6.26 kB
LFS
Upload DPO LLM Judge fine-tuned model
15 days ago
training_history.json
Safe
219 Bytes
Upload DPO LLM Judge fine-tuned model
15 days ago
training_metrics.json
Safe
13 kB
Upload DPO LLM Judge fine-tuned model
15 days ago