Spaces:
Running
Running
RASPIAUDIO
commited on
Commit
·
45d7137
1
Parent(s):
274613e
french version 1.01
Browse files
app.py
CHANGED
|
@@ -753,11 +753,38 @@ with gr.Blocks() as app:
|
|
| 753 |
This is a local web UI for F5 TTS with advanced batch processing support. This app supports the following TTS models:
|
| 754 |
|
| 755 |
* [F5-TTS](https://arxiv.org/abs/2410.06885) (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
|
| 756 |
-
* [E2 TTS](https://arxiv.org/abs/2406.18009) (Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS)
|
| 757 |
|
| 758 |
-
The checkpoint support French
|
| 759 |
|
| 760 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 761 |
|
| 762 |
If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s, and shortening your prompt.
|
| 763 |
|
|
|
|
| 753 |
This is a local web UI for F5 TTS with advanced batch processing support. This app supports the following TTS models:
|
| 754 |
|
| 755 |
* [F5-TTS](https://arxiv.org/abs/2410.06885) (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
|
|
|
|
| 756 |
|
|
|
|
| 757 |
|
| 758 |
+
# F5 TTS French Model
|
| 759 |
+
|
| 760 |
+
This model was trained exclusively on French public domain material from [LibriVox](https://librivox.org/) and voice recordings. It is designed for text-to-speech synthesis in French and has gone through two rounds of training:
|
| 761 |
+
|
| 762 |
+
1. **Round 1**:
|
| 763 |
+
- **Samples**: 80k
|
| 764 |
+
- **Epochs**: 100
|
| 765 |
+
- **Speakers**: Single speaker
|
| 766 |
+
|
| 767 |
+
2. **Round 2**:
|
| 768 |
+
- **Samples**: 120k
|
| 769 |
+
- **Epochs**: 150
|
| 770 |
+
- **Speakers**: Multiple speakers from different corpus
|
| 771 |
+
|
| 772 |
+
### Model Checkpoint
|
| 773 |
+
The checkpoint supports French and can be [downloaded here](https://huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced).
|
| 774 |
+
|
| 775 |
+
### Online Inference
|
| 776 |
+
You can test the model online using our [Hugging Face Space](https://huggingface.co/spaces/RASPIAUDIO/f5-tts_french).
|
| 777 |
+
|
| 778 |
+
### Resources
|
| 779 |
+
- [Tutorial video to train a new language model](https://www.youtube.com/watch?v=UO4usaOojys)
|
| 780 |
+
- [Discussion about this training can be found here](https://github.com/SWivid/F5-TTS/issues/434)
|
| 781 |
+
|
| 782 |
+
### About Us
|
| 783 |
+
Check out our website: [https://raspiaudio.com](https://raspiaudio.com) – We create cool, hackable audio devices that let you experiment with audio, voice assistants, and AI using ESP32 and Raspberry Pi!
|
| 784 |
+
|
| 785 |
+
|
| 786 |
+
|
| 787 |
+
Generations using CPU takes a very long time on the free CPU HF machine so I recommend to upgrade to any GPU machine
|
| 788 |
|
| 789 |
If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s, and shortening your prompt.
|
| 790 |
|