Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zary0
/
mohrm
like
0
Text Generation
PyTorch
hrm
Mixture of Experts
mixture-of-experts
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
mohrm
3.71 GB
1 contributor
History:
503 commits
zary0
Update README for epoch 5
268da61
verified
2 days ago
.gitattributes
Safe
1.52 kB
initial commit
7 days ago
README.md
2.23 kB
Update README for epoch 5
2 days ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.51 GB
xet
Epoch 5: val_loss=3.5680, perplexity=35.44
2 days ago
training_state.pt
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
2.2 GB
xet
Training state at Epoch 4 (global_step 38670)
2 days ago