Commit
·
9d79541
1
Parent(s):
bbbd7a0
Update README.md
Browse files
README.md
CHANGED
|
@@ -191,6 +191,11 @@ We refer the reader to the [SantaCoder model page](https://huggingface.co/bigcod
|
|
| 191 |
- **Point of Contact:** [[email protected]](mailto:[email protected])
|
| 192 |
- **Languages:** Python, Java, and JavaScript
|
| 193 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 194 |
# Use
|
| 195 |
|
| 196 |
## Intended use
|
|
|
|
| 191 |
- **Point of Contact:** [[email protected]](mailto:[email protected])
|
| 192 |
- **Languages:** Python, Java, and JavaScript
|
| 193 |
|
| 194 |
+
There are two versions (branches) of the model:
|
| 195 |
+
* `main`: Uses the `gpt_bigcode` model. [Requires the bigcode fork of transformers](https://github.com/bigcode-project/transformers).
|
| 196 |
+
* `main_custom`: Packaged with its modeling code. Requires `transformers>=4.27`.
|
| 197 |
+
Alternatively, it can run on older versions by setting the configuration parameter `activation_function = "gelu_pytorch_tanh"`.
|
| 198 |
+
|
| 199 |
# Use
|
| 200 |
|
| 201 |
## Intended use
|