Spaces:
Running
Running
| title: Audio Transcription | |
| emoji: 🎙️ | |
| colorFrom: blue | |
| colorTo: green | |
| sdk: gradio | |
| sdk_version: 4.36.0 | |
| app_file: app.py | |
| pinned: false | |
| # Multi-Source Audio Transcription with Faster Whisper | |
| This application transcribes audio from multiple sources using Faster Whisper v3 turbo int8, providing a flexible and powerful transcription solution. | |
| ## Features | |
| - Transcribe audio from various sources: | |
| - Uploaded audio files | |
| - Direct URLs to MP3 files | |
| - YouTube video URLs | |
| - Utilizes the latest GitHub version of Faster Whisper for optimal performance | |
| - Adjustable batch size for performance tuning | |
| - Provides detailed metrics including transcription time and real-time factor | |
| ## How to Use | |
| 1. Enter the source of your audio: | |
| - Path to a local audio file | |
| - URL of an MP3 file | |
| - URL of a YouTube video | |
| 2. Adjust the batch size if desired (default is 16) | |
| 3. Click 'Submit' to start the transcription process | |
| ## Output | |
| The application will provide: | |
| - A full transcription of the audio | |
| - Detected language and confidence | |
| - Duration of the audio | |
| - Transcription time and real-time factor | |
| - File size of the processed audio | |
| ## Note | |
| This application is a prototype and may be subject to further improvements and optimizations. Performance may vary based on the input source and the processing capabilities of the hosting environment. | |
| ## Feedback and Contributions | |
| I welcome feedback and contributions to improve this transcription tool. |