Code datasets for pretraining
Orion
Orion-zhen
AI & ML interests
Eco-friendly training using Tesla P4. Prefers (FSDP+)QLoRA.
Recent Activity
new activity
6 days ago
grimjim/gemma-3-12b-it-norm-preserved-biprojected-abliterated:Implementation Methodology
updated
a Space
about 1 month ago
Orion-zhen/mcpo
updated
a Space
about 2 months ago
Orion-zhen/webgpuf
Organizations
Qwen3-Dense-AWQ
AWQ quantization of Qwen3 Dense series at Day0!
🤯Emoji datasets
All for emoji!
Calibration datasets
Datasets used for various calibrations
Unalignments
Datasets used to unalign models
Free Spaces
Powerful apps built on free HF space
Qwen2.5 Series
Llama3-Orion
My llama3 models
-
Orion-zhen/Llama3-70B-Orion-Chinese
Text Generation • 71B • Updated • 51 • • 14 -
Orion-zhen/Llama3-70B-Orion-Chinese-SE
Text Generation • 71B • Updated • 13 -
Orion-zhen/Llama3-70B-Orion-Chinese-Plus
Text Generation • 71B • Updated • 9 -
Orion-zhen/Llama3-70B-Orion-Chinese-Ultra
Text Generation • 71B • Updated • 7 • 1
Reasoning
Datasets focus on reasoning
Code4Pretrain
Code datasets for pretraining
Free Spaces
Powerful apps built on free HF space
Qwen3-Dense-AWQ
AWQ quantization of Qwen3 Dense series at Day0!
Qwen2.5 Series
🤯Emoji datasets
All for emoji!
Llama3-Orion
My llama3 models
-
Orion-zhen/Llama3-70B-Orion-Chinese
Text Generation • 71B • Updated • 51 • • 14 -
Orion-zhen/Llama3-70B-Orion-Chinese-SE
Text Generation • 71B • Updated • 13 -
Orion-zhen/Llama3-70B-Orion-Chinese-Plus
Text Generation • 71B • Updated • 9 -
Orion-zhen/Llama3-70B-Orion-Chinese-Ultra
Text Generation • 71B • Updated • 7 • 1
Calibration datasets
Datasets used for various calibrations
Reasoning
Datasets focus on reasoning
Unalignments
Datasets used to unalign models