A large-scale open resource for African language speech technology

by March 6, 2026

by March 6, 2026 0 comments

Anchoring the African AI ecosystem

A core commitment of the WAXAL project was to work with and contribute directly to the African AI ecosystem. The data collection effort was led entirely by African academic and community organizations, guided by Google experts on world-class data collection practices. This collaborative approach ensures that the resource is built by and for the communities it serves; Each partner focused on a specific subgroup of languages, with a shared methodology. Project partners include Makerere Universitywhich collected ASR and/or TTS data for nine different languages, and University of Ghanawhich focused its efforts on eight languages, using an image-prompted data collection method. Additional key collaborators were digital umugandain partnership with Addis Ababa UniversityWho played an important role in leading the ASR collection for many regional languages. For high-quality, studio-recorded voices, media trust, loud and clear And African Institute for Mathematical Sciences Senegal Led TTS recording in different regional languages.

This framework is fundamentally based on the principle that the partners retain ownership of the data they collect in exchange for a shared commitment to make all datasets openly available to the broader community. This deep collaboration and open access philosophy has already enabled notable derivative research and publications.

Through this framework, these partners have already enabled new research, such as the development of cookbook For community-driven collection of impaired speech. The result of this research was the first open-source dataset For Akan Speakers with conditions such as cerebral palsy and stuttering demonstrated that individually, image-cued prompting is more effective than text-based prompts for these populations. This work provides an important roadmap for developing inclusive speech technologies in low-resource environments.
In addition, the initiative supported a major Study who presented 5,000 hours of speech corpus For the five Ghanaian languages – Akan, Ewe, Dagbani, Dagare, and Ikoposo. This work established the infrastructure for building robust ASR and TTS systems tailored to the linguistic diversity of West Africa by using a controlled crowdsourcing approach to capture natural, spontaneous vocalizations.
other essentials Research The focus is on benchmarking four state-of-the-art models (whisper, XLS-R, mmsAnd W2v-BERT) in 13 African languages. This study analyzed how performance increases with increased training data, providing important insights into data efficiency and highlighting that scaling benefits are strongly dependent on linguistic complexity and domain alignment.
Finally, a systematic literature review was published listing 74 datasets in 111 African languages to map the current extent of speech technology. This review emphasized the urgent need for multi-domain conversational corpora and the adoption of linguistically informed metrics, such as character error rate (CER), to better evaluate performance in morphologically rich and tonal language contexts.

Limitations and what to watch

Open speech corpora like WAXAL are an important step, but they do not by themselves close the gap in African-language technology. Reported totals for languages and hours vary across summaries of the project, and coverage remains uneven across the continent’s roughly 2,000 languages, with many tonal and morphologically rich languages still underrepresented. Dataset quality, dialectal variation, and domain coverage all affect how well downstream models perform, and a published literature review accompanying the work stressed the continuing need for multi-domain conversational corpora and linguistically informed evaluation metrics such as character error rate (CER). Sustained funding, local ownership, and ongoing data collection will determine whether these resources translate into widely usable voice technology.

Full details and access are available via Google Research and the WAXAL dataset on Hugging Face.

A large-scale open resource for African language speech technology

Anchoring the African AI ecosystem

Limitations and what to watch

OpenAI Introduces Codex Security in Research Preview for Context-Aware Vulnerability Detection, Verification, and Patch Generation in Codebases

Anthropic report says it’s too early for AI to impact jobs

Related Articles