File size: 927 Bytes
c920a51
 
 
 
 
 
 
 
 
 
 
 
 
ed80259
 
 
 
 
 
 
bb9eed2
ed80259
 
 
 
 
 
 
 
 
bb9eed2
 
ed80259
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
title: MLP Safety Classifier
emoji: 🛡️
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: "4.44.0"
app_file: app.py
pinned: false
license: mit
---


# 🛡️ Aegis Safety Classifier

This Space hosts a text classifier trained on the [NVIDIA Aegis 2.0 dataset](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-2.0).  
It predicts whether a piece of text is **safe** or **unsafe**.

The model is a simple **TF-IDF + MLP** pipeline implemented in scikit-learn.

---

## Features

- Paste any text into the textbox and get a prediction.
- Adjustable **threshold** for the probability of being classified as "safe".
- JSON output with:
  - `prediction`: `1` = safe, `0` = unsafe
  - `probabilities`: [p(unsafe), p(safe)] if available

---

## How to Use

### 
- Open this Space.
- Enter some text in the input field.
- Adjust the threshold (default = 0.5).
- Press **Submit** to get results.