Base models trained on 1T high-quality tokens, demonstrating strong competitiveness among existing SOTA small models (<2B).
ParScale
community
AI & ML interests
None defined yet.
models
67

ParScale/ParScale-1.8B-P1-Inst
Text Generation
•
2B
•
Updated
•
74
•
1

ParScale/ParScale-1.8B-P2-Inst
Text Generation
•
2B
•
Updated
•
9

ParScale/ParScale-1.8B-P4-Inst
Text Generation
•
2B
•
Updated
•
12
•
1

ParScale/ParScale-1.8B-P8-Inst
Text Generation
•
2B
•
Updated
•
64
•
1

ParScale/ParScale-1.8B-P1
Text Generation
•
2B
•
Updated
•
15
•
1

ParScale/ParScale-1.8B-P2
Text Generation
•
2B
•
Updated
•
18

ParScale/ParScale-1.8B-P4
Text Generation
•
2B
•
Updated
•
40
•
1

ParScale/ParScale-Qwen-3B-P2-Python
Text Generation
•
3B
•
Updated
•
9

ParScale/ParScale-Qwen-3B-P4-Python
Text Generation
•
3B
•
Updated
•
13

ParScale/ParScale-Qwen-3B-P8-Python
Text Generation
•
3B
•
Updated
•
40
datasets
0
None public yet