CodCodingCode
/

llama-3.1-8b-grpo-v1.2

Model card Files Files and versions

llama-3.1-8b-grpo-v1.2

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

CodCodingCode's picture

Create handler.py

61d1ef9 verified about 2 months ago

.gitattributes

1.57 kB

GRPO-trained model from checkpoint-670 about 2 months ago
config.json

867 Bytes

GRPO-trained model from checkpoint-670 about 2 months ago
generation_config.json

180 Bytes

GRPO-trained model from checkpoint-670 about 2 months ago
handler.py

2.21 kB

Create handler.py about 2 months ago
model-00001-of-00004.safetensors

4.98 GB
xet

GRPO-trained model from checkpoint-670 about 2 months ago
model-00002-of-00004.safetensors

5 GB
xet

GRPO-trained model from checkpoint-670 about 2 months ago
model-00003-of-00004.safetensors

4.92 GB
xet

GRPO-trained model from checkpoint-670 about 2 months ago
model-00004-of-00004.safetensors

1.17 GB
xet

GRPO-trained model from checkpoint-670 about 2 months ago
model.safetensors.index.json

24 kB

GRPO-trained model from checkpoint-670 about 2 months ago
special_tokens_map.json

449 Bytes

GRPO-trained model from checkpoint-670 about 2 months ago
tokenizer.json

17.2 MB
xet

GRPO-trained model from checkpoint-670 about 2 months ago
tokenizer_config.json

50.6 kB

GRPO-trained model from checkpoint-670 about 2 months ago