This model aims to test the conversion between Megatron-LM and transformers. It is a small GPT-2-like model that has been used to debug the script. Use it only for integration tests
Downloads last month
16,032
Safetensors
Model size
16.2M params
Tensor type
BF16
Β·
Model tree for bigscience/bigscience-small-testing