Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Girinath11
/
MixtureofRecursionwithRouter
like
1
Text Generation
Transformers
recursive-transformer
technical-content
code-generation
math
conversation
bpe-tokenizer
adaptive-routing
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
MixtureofRecursionwithRouter
Ctrl+K
Ctrl+K
1 contributor
History:
23 commits
Girinath11
Update README.md
5a4d89b
verified
10 days ago
checkpoints
Rename best_model.pt to checkpoints/best_model.pt
10 days ago
split_data
Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt
10 days ago
tokenizer
Rename merges.txt to tokenizer/merges.txt
10 days ago
.gitattributes
2.22 kB
Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt
10 days ago
README.md
9.05 kB
Update README.md
10 days ago
custom_tokenizer.py
21.2 kB
Create custom_tokenizer.py
10 days ago
embeddings.py
13.8 kB
Create embeddings.py
10 days ago
model_slm.py
15.7 kB
Create model_slm.py
10 days ago
requirements.txt
75 Bytes
Create requirements.txt
10 days ago
slm_training_complete_chat.txt
143 MB
xet
Upload slm_training_complete_chat.txt
10 days ago
train.py
18.1 kB
Create train.py
10 days ago
ultra_fast_results .json
2.09 kB
Rename ultra_fast_results (1).json to ultra_fast_results .json
10 days ago