Update README.md
Browse files
README.md
CHANGED
|
@@ -6,22 +6,22 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
|
|
| 6 |
<img src="src/web/demo.png" alt="Demo Screenshot" width="600">
|
| 7 |
</p>
|
| 8 |
|
| 9 |
-
###
|
| 10 |
|
| 11 |
#### π **Web & API**
|
| 12 |
-
- **Built-in Web UI** β No frontend setup
|
| 13 |
-
- **FastAPI WebSocket Server** β Real-time
|
| 14 |
-
- **JavaScript Client** β
|
| 15 |
|
| 16 |
#### βοΈ **Core Improvements**
|
| 17 |
-
- **Buffering Preview** β Displays unvalidated transcription segments for
|
| 18 |
-
- **Multi-User Support** β
|
| 19 |
- **MLX Whisper Backend** β Optimized for Apple Silicon for faster local processing.
|
| 20 |
-
- **Enhanced Sentence Segmentation** β
|
| 21 |
- **Extended Logging** β More detailed logs to improve debugging and monitoring.
|
| 22 |
|
| 23 |
#### ποΈ **Advanced Features**
|
| 24 |
-
- **Real-Time Diarization** β
|
| 25 |
|
| 26 |
|
| 27 |
## Installation
|
|
|
|
| 6 |
<img src="src/web/demo.png" alt="Demo Screenshot" width="600">
|
| 7 |
</p>
|
| 8 |
|
| 9 |
+
### Differences from [Whisper Streaming](https://github.com/ufal/whisper_streaming)
|
| 10 |
|
| 11 |
#### π **Web & API**
|
| 12 |
+
- **Built-in Web UI** β No frontend setup required, just open your browser and start transcribing.
|
| 13 |
+
- **FastAPI WebSocket Server** β Real-time speech-to-text processing with async FFmpeg streaming.
|
| 14 |
+
- **JavaScript Client** β Ready-to-use MediaRecorder implementation for seamless client-side integration.
|
| 15 |
|
| 16 |
#### βοΈ **Core Improvements**
|
| 17 |
+
- **Buffering Preview** β Displays unvalidated transcription segments for immediate feedback.
|
| 18 |
+
- **Multi-User Support** β Handles multiple users simultaneously without conflicts.
|
| 19 |
- **MLX Whisper Backend** β Optimized for Apple Silicon for faster local processing.
|
| 20 |
+
- **Enhanced Sentence Segmentation** β Improved buffer trimming for better accuracy across languages.
|
| 21 |
- **Extended Logging** β More detailed logs to improve debugging and monitoring.
|
| 22 |
|
| 23 |
#### ποΈ **Advanced Features**
|
| 24 |
+
- **Real-Time Diarization** β Identify different speakers in real time using [Diart](https://github.com/juanmc2005/diart).
|
| 25 |
|
| 26 |
|
| 27 |
## Installation
|