Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,19 +1,50 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
-
emoji:
|
| 4 |
-
colorFrom:
|
| 5 |
-
colorTo:
|
| 6 |
sdk: gradio
|
| 7 |
sdk_version: 4.36.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
-
hf_oauth: true
|
| 11 |
-
hf_oauth_expiration_minutes: 60
|
| 12 |
-
hf_oauth_scopes:
|
| 13 |
-
- read-repos
|
| 14 |
-
- write-repos
|
| 15 |
-
- manage-repos
|
| 16 |
---
|
| 17 |
-
This transcribes audio using Faster Whisper v3 turbo int8.
|
| 18 |
|
| 19 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
title: Audio Transcription
|
| 3 |
+
emoji: 🎙️
|
| 4 |
+
colorFrom: blue
|
| 5 |
+
colorTo: green
|
| 6 |
sdk: gradio
|
| 7 |
sdk_version: 4.36.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
---
|
|
|
|
| 11 |
|
| 12 |
+
# Multi-Source Audio Transcription with Faster Whisper
|
| 13 |
+
|
| 14 |
+
This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution.
|
| 15 |
+
|
| 16 |
+
## Features
|
| 17 |
+
|
| 18 |
+
- Transcribe audio from various sources:
|
| 19 |
+
- Uploaded audio files
|
| 20 |
+
- Direct URLs to MP3 files
|
| 21 |
+
- YouTube video URLs
|
| 22 |
+
- Utilizes the latest GitHub version of Faster Whisper for optimal performance
|
| 23 |
+
- Adjustable batch size for performance tuning
|
| 24 |
+
- Provides detailed metrics including transcription time and real-time factor
|
| 25 |
+
|
| 26 |
+
## How to Use
|
| 27 |
+
|
| 28 |
+
1. Enter the source of your audio:
|
| 29 |
+
- Path to a local audio file
|
| 30 |
+
- URL of an MP3 file
|
| 31 |
+
- URL of a YouTube video
|
| 32 |
+
2. Adjust the batch size if desired (default is 16)
|
| 33 |
+
3. Click 'Submit' to start the transcription process
|
| 34 |
+
|
| 35 |
+
## Output
|
| 36 |
+
|
| 37 |
+
The application will provide:
|
| 38 |
+
- A full transcription of the audio
|
| 39 |
+
- Detected language and confidence
|
| 40 |
+
- Duration of the audio
|
| 41 |
+
- Transcription time and real-time factor
|
| 42 |
+
- File size of the processed audio
|
| 43 |
+
|
| 44 |
+
## Note
|
| 45 |
+
|
| 46 |
+
This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment.
|
| 47 |
+
|
| 48 |
+
## Feedback and Contributions
|
| 49 |
+
|
| 50 |
+
I welcome feedback and contributions to improve this transcription tool.
|