Spaces:

langtech-innovation
/

WhisperLiveKit

Paused

Quentin Fuxa commited on Mar 1

Commit

519ccaa

1 Parent(s): 38f752b

huggingface steps added. Solve https://github.com/QuentinFuxa/whisper_streaming_web/issues/59 & https://github.com/QuentinFuxa/whisper_streaming_web/issues/60

Files changed (1) hide show

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
 #### 🚀 **Coming Soon**
-- **Faster Word Validation** – Accelerate real-time transcription by validating high-confidence words immediately upon first appearance for whisper backends that returns words & segment probabilities
 - **Enhanced Diarization Performance** – Optimize speaker identification by implementing longer steps for Diart processing and leveraging language-specific segmentation patterns to improve speaker boundary detection
@@ -78,6 +78,8 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
     diart
     ```
 3. **Run the FastAPI Server**:

 #### 🚀 **Coming Soon**
+- **Faster Word Validation** – Accelerate real-time transcription by validating high-confidence words immediately upon first appearance for whisper backends that return word & segment probabilities
 - **Enhanced Diarization Performance** – Optimize speaker identification by implementing longer steps for Diart processing and leveraging language-specific segmentation patterns to improve speaker boundary detection
     diart
     ```
+    Diart uses by default [pyannote.audio](https://github.com/pyannote/pyannote-audio) models from the _huggingface hub_. To use them, please follow the steps described [here](https://github.com/juanmc2005/diart?tab=readme-ov-file#get-access-to--pyannote-models).
 3. **Run the FastAPI Server**: