Files changed (1) hide show
  1. README.md +38 -24
README.md CHANGED
@@ -1,24 +1,38 @@
1
- ---
2
- pipeline_tag: text-generation
3
- inference: true
4
- license: apache-2.0
5
- datasets:
6
- - simplescaling/s1K-1.1
7
- base_model:
8
- - Qwen/Qwen2.5-0.5B-Instruct
9
- library_name: transformers
10
- ---
11
-
12
- # Model Summary
13
-
14
- > s1.1-0.5B is a sucessor of [s1](https://huggingface.co/2stacks/s1-0.5B) with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.
15
-
16
- - **Logs:** https://wandb.ai/2stacks-sms/s1/runs/ishervdt?nw=nwuser2stacks
17
- - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
18
- - **Paper:** https://arxiv.org/abs/2501.19393
19
-
20
- Thanks to [Ryan Marten](https://huggingface.co/ryanmarten) for helping generate r1 traces for s1K.
21
-
22
- # Use
23
-
24
- The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-generation
3
+ inference: true
4
+ license: apache-2.0
5
+ datasets:
6
+ - simplescaling/s1K-1.1
7
+ base_model:
8
+ - Qwen/Qwen2.5-0.5B-Instruct
9
+ library_name: transformers
10
+ language:
11
+ - zho
12
+ - eng
13
+ - fra
14
+ - spa
15
+ - por
16
+ - deu
17
+ - ita
18
+ - rus
19
+ - jpn
20
+ - kor
21
+ - vie
22
+ - tha
23
+ - ara
24
+ ---
25
+
26
+ # Model Summary
27
+
28
+ > s1.1-0.5B is a sucessor of [s1](https://huggingface.co/2stacks/s1-0.5B) with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.
29
+
30
+ - **Logs:** https://wandb.ai/2stacks-sms/s1/runs/ishervdt?nw=nwuser2stacks
31
+ - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
32
+ - **Paper:** https://arxiv.org/abs/2501.19393
33
+
34
+ Thanks to [Ryan Marten](https://huggingface.co/ryanmarten) for helping generate r1 traces for s1K.
35
+
36
+ # Use
37
+
38
+ The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference).