Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Girinath11
/
MixtureofRecursionwithRouter
like
1
Text Generation
Transformers
PyTorch
mixture_of_recursions
feature-extraction
recursive-transformer
technical-content
code-generation
math
conversation
bpe-tokenizer
adaptive-routing
custom_code
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
MixtureofRecursionwithRouter
947 MB
1 contributor
History:
39 commits
Girinath11
Update model_slm.py
9c59aeb
verified
about 2 months ago
checkpoints
Rename best_model.pt to checkpoints/best_model.pt
4 months ago
split_data
Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt
4 months ago
tokenizer
Rename merges.txt to tokenizer/merges.txt
4 months ago
.gitattributes
Safe
2.22 kB
Rename slm_training_complete_chat_val (1).txt to split_data/slm_training_complete_chat_val.txt
4 months ago
README.md
Safe
9.05 kB
Update README.md
4 months ago
config.json
Safe
725 Bytes
Update config.json
about 2 months ago
configuration_mixture_of_recursions.py
Safe
2.9 kB
Create configuration_mixture_of_recursions.py
about 2 months ago
custom_tokenizer.py
Safe
38.6 kB
Update custom_tokenizer.py
3 months ago
model_slm.py
24.9 kB
Update model_slm.py
about 2 months ago
modeling_mixture_of_recursions.py
Safe
6.99 kB
Create modeling_mixture_of_recursions.py
about 2 months ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
166 MB
xet
Upload pytorch_model.bin with huggingface_hub
2 months ago
requirements.txt
Safe
75 Bytes
Create requirements.txt
4 months ago
slm_training_complete_chat.txt
Safe
143 MB
xet
Upload slm_training_complete_chat.txt
4 months ago
train.py
Safe
26.1 kB
Update train.py
3 months ago
ultra_fast_results .json
Safe
2.09 kB
Rename ultra_fast_results (1).json to ultra_fast_results .json
4 months ago