Fongbe ASR model w/out diacritics

How to use for inference

from speechbrain.inference.ASR import EncoderASR

asr_model = EncoderASR.from_hparams(
    source="whettenr/asr-fon-with-diacritics",
    savedir="pretrained_models/asr-fongbe-with-diacritics"
)

asr_model.transcribe_file("whettenr/asr-fon-with-diacritics/example.wav")

# expected output:
# huzuhuzu gɔngɔn ɖé ɖò dandan

Details of model

~100M parameters, 12 layer conformer encoder, FFNN decoder

Details of training

pretrained using BEST-RQ on 140 hours
- FFSTC 2 + beethogedeon/fongbe-speech (~40 hours)
- cappfm (~100 hours)
finetuned with CTC loss on training sets of
- FFSTC 2
- beethogedeon/fongbe-speech

@inproceedings{kponou25_interspeech,
  title     = {{Extending the Fongbe to French Speech Translation Corpus:  resources, models and benchmark}},
  author    = {D. Fortuné Kponou and Salima Mdhaffar and Fréjus A. A. Laleye and Eugène C. Ezin and Yannick Estève},
  year      = {2025},
  booktitle = {{Interspeech 2025}},
  pages     = {4533--4537},
  doi       = {10.21437/Interspeech.2025-1801},
  issn      = {2958-1796},
}

Downloads last month: 3

whettenr
/

asr-fon-with-diacritics

Fongbe ASR model w/out diacritics

How to use for inference

Details of model

Details of training

Dataset used to train whettenr/asr-fon-with-diacritics