PowerInfer
/

TurboSparse-Mistral-Instruct

Feature Extraction

Model card Files Files and versions

Yixin Song commited on Jun 7, 2024

Commit

56e658e

·

verified ·

1 Parent(s): 70b61f3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ As we merged the predictors for FFN neurons in models, you can finetune TurboSpa
 ## Limitations
 * TurboSparse, having just undergone training with 150B tokens, may still exhibit performance gaps in certain tasks.
 * The TurboSparse model has only been trained on English-language datasets, hence its capabilities in other languages are still lacking.
-* The model may produce unexpected outputs due to its small size and probabilistic generation paradigm.
 ## License

 ## Limitations
 * TurboSparse, having just undergone training with 150B tokens, may still exhibit performance gaps in certain tasks.
 * The TurboSparse model has only been trained on English-language datasets, hence its capabilities in other languages are still lacking.
+* The model may produce unexpected outputs due to its small size, limited training tokens and probabilistic generation paradigm.
 ## License