Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ZhenghaiXue
/
Qwen2.5-7B-SimpleTIR
like
0
Reinforcement Learning
Safetensors
hkust-nlp/SimpleRL-Zoo-Data
agentica-org/DeepScaleR-Preview-Dataset
English
qwen2
License:
apache-2.0
Model card
Files
Files and versions
Community
README.md exists but content is empty.
Downloads last month
14
Safetensors
Model size
7.62B params
Tensor type
BF16
·
Chat template
Files info
Video Preview
Reinforcement Learning
loading
Model tree for
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
Base model
Qwen/Qwen2.5-7B
Finetuned
(
608
)
this model
Datasets used to train
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
agentica-org/DeepScaleR-Preview-Dataset
Viewer
•
Updated
Feb 10
•
40.3k
•
6.79k
•
149
hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
Mar 25
•
53.1k
•
511
•
6
Collection including
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
SimpleTIR
Collection
2 items
•
Updated
30 days ago