Yixin Song
commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -27,7 +27,7 @@ As we merged the predictors for FFN neurons in models, you can finetune TurboSpa
|
|
| 27 |
## Limitations
|
| 28 |
* TurboSparse, having just undergone training with 150B tokens, may still exhibit performance gaps in certain tasks.
|
| 29 |
* The TurboSparse model has only been trained on English-language datasets, hence its capabilities in other languages are still lacking.
|
| 30 |
-
* The model may produce unexpected outputs due to its small size and probabilistic generation paradigm.
|
| 31 |
|
| 32 |
## License
|
| 33 |
|
|
|
|
| 27 |
## Limitations
|
| 28 |
* TurboSparse, having just undergone training with 150B tokens, may still exhibit performance gaps in certain tasks.
|
| 29 |
* The TurboSparse model has only been trained on English-language datasets, hence its capabilities in other languages are still lacking.
|
| 30 |
+
* The model may produce unexpected outputs due to its small size, limited training tokens and probabilistic generation paradigm.
|
| 31 |
|
| 32 |
## License
|
| 33 |
|