tomaarsen
/

span-marker-xlm-roberta-large-verbs

Token Classification

Model card Files Files and versions

tomaarsen HF Staff commited on Aug 7, 2023

Commit

1bdadff

·

1 Parent(s): 0b4a9ff

Add limitation due to RoBERTa

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -74,6 +74,20 @@ The following hyperparameters were used during training:
 | 0.0115        | 2.45  | 4000 | 0.0172          | 0.9821            | 0.9871         | 0.9845     | 0.9962           |
 ### Framework versions
 - Transformers 4.30.2

 | 0.0115        | 2.45  | 4000 | 0.0172          | 0.9821            | 0.9871         | 0.9845     | 0.9962           |
+### Limitations
+**Warning**: This model works best when punctuation is separated from the prior words, so
+```python
+# ✅
+model.predict("He plays J. Robert Oppenheimer , an American theoretical physicist .")
+# ❌
+model.predict("He plays J. Robert Oppenheimer, an American theoretical physicist.")
+# You can also supply a list of words directly: ✅
+model.predict(["He", "plays", "J.", "Robert", "Oppenheimer", ",", "an", "American", "theoretical", "physicist", "."])
+```
+The same may be beneficial for some languages, such as splitting `"l'ocean Atlantique"` into `"l' ocean Atlantique"`.
 ### Framework versions
 - Transformers 4.30.2