Update app.py
Browse files
app.py
CHANGED
|
@@ -189,7 +189,7 @@ with GraInter:
|
|
| 189 |
|
| 190 |
**W/10:** Willingness/10. A more narrow, 10-point score, measuring how far the model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
|
| 191 |
<br><br>
|
| 192 |
-
A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous.
|
| 193 |
<br><br>
|
| 194 |
**Unruly:** Knowledge of activities that are generally frowned upon.
|
| 195 |
<br>
|
|
|
|
| 189 |
|
| 190 |
**W/10:** Willingness/10. A more narrow, 10-point score, measuring how far the model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
|
| 191 |
<br><br>
|
| 192 |
+
A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous. Or that it answers questions correctly, but appends a paragraph to its answer explaining why the question is immoral to ask.
|
| 193 |
<br><br>
|
| 194 |
**Unruly:** Knowledge of activities that are generally frowned upon.
|
| 195 |
<br>
|