Skip to main content
SearchLoginLogin or Signup

Lanfrica Talks #2 | Building African Voices

Published onMay 10, 2023
Lanfrica Talks #2 | Building African Voices
Lanfrica Talks #2 | Building African Voices by Perez Ogayo

Learn more about Lanfrica Talks at https://lanfrica.com/blog/lanfrica-talks

🎧 Listen to this talk as a podcast.

Abstract

Modern speech synthesis techniques can produce natural-sounding speech given sufficient high-quality data and compute resources. However, such data is not readily available for many languages. This paper focuses on speech synthesis for low-resourced African languages, from corpus creation to sharing and deploying the Text-to-Speech (TTS) systems. We first create a set of general-purpose instructions on building speech synthesis systems with minimum technological resources and subject-matter expertise. Next, we create new datasets and curate datasets from "found" data (existing recordings) through a participatory approach while considering accessibility, quality, and breadth. We demonstrate that we can develop synthesizers that generate intelligible speech with 25 minutes of created speech, even when recorded in suboptimal environments. Finally, we release the speech data, code, and trained voices for 12 African languages to support researchers and developers.

Bio

Perez Ogayo is a masters student at Carnegie Mellon University in the Language Technologies Institute(LTI) where she is focusing on low resource natural language processing. Her interests in NLP are multilingual machine translation, speech synthesis and recognition and NLP for endangered languages. She is a researcher at Masakhane working on Luo and Kiswahili.

Links

  1. Lanfrica | Building African Voices

  2. African voices website: https://www.africanvoices.tech/

  3. Connect with Perez Ogayo on Linkedln:https://www.linkedin.com/in/peresogayo/

  4. Connect on Twitter: https://twitter.com/a_ogayo

Comments
0
comment
No comments here
Why not start the discussion?