mlfoundations-dev/stackexchange_cogsci
Text Generation
•
8B
•
Updated
•
5
•
1
mlfoundations-dev/stackexchange_cseducators
Text Generation
•
8B
•
Updated
•
5
•
1
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_quant
8B
•
Updated
•
5
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_reverseengineering
Text Generation
•
8B
•
Updated
•
5
•
1
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_puzzling
Text Generation
•
8B
•
Updated
•
5
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_quantumcomputing
Text Generation
•
8B
•
Updated
•
6
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_politics
Text Generation
•
8B
•
Updated
•
5
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_physics
8B
•
Updated
•
5
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_proofassistants
Text Generation
•
8B
•
Updated
•
4
mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_poker
Text Generation
•
8B
•
Updated
•
5
mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
•
7B
•
Updated
•
37
mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
36
mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
•
7B
•
Updated
•
37
mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
34
mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
•
Updated
•
38
mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
34
mlfoundations-dev/oh-mistral-bs2048_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
36
mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
•
7B
•
Updated
•
9
mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
10
mlfoundations-dev/oh-mistral-bs4096_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
35
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_1.0
Text Generation
•
7B
•
Updated
•
5
mlfoundations-dev/oh-mistral-bs512_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
29
mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
•
Updated
•
35
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.9
Text Generation
•
7B
•
Updated
•
5
mlfoundations-dev/oh-mistral-bs1024_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
36
mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
37
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.8
Text Generation
•
7B
•
Updated
•
4
mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
38
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.7
Text Generation
•
7B
•
Updated
•
5
mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
•
7B
•
Updated
•
37