dmis-lab
/

OSP-1.4B-1T-Muon-SSNorm-EmbProj

Model card Files Files and versions

dmis-lab commited on Jun 25

Commit

1499273

·

verified ·

1 Parent(s): dab3c34

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -183,7 +183,7 @@ The models were trained on 1 trillion tokens, following the pre-training recipe
 ### Model
 - Architecture: Llama
-- Pretraining tokens: 100B
 - Precision: bfloat16
 ### Hardware

 ### Model
 - Architecture: Llama
+- Pretraining tokens: 1 trillion tokens
 - Precision: bfloat16
 ### Hardware