WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.

Google joins push to localise AI for African languages with speech database

3 min read

Google has collaborated with African universities and research institutions to launch WAXAL, an open-source speech database designed to support the development of voice-based artificial intelligence for African languages. 

African institutions, including Makerere University in Uganda, the University of Ghana, Digital Umuganda in Rwanda, and the African Institute for Mathematical Sciences (AIMS), participated in the data collection for this initiative. The dataset provides foundational data for 21 Sub-Saharan African languages, including Hausa, Luganda, Yoruba, and Acholi.

WAXAL is designed to support the development of speech recognition systems, voice assistants, text-to-speech tools, and other voice-enabled applications across sectors such as education, healthcare, agriculture, and public services.

“This dataset provides the critical foundation for students, researchers, and entrepreneurs to build technology on their own terms, in their own languages,” said Aisha Walcott-Bryantt, Head of Google Research Africa

WAXAL’s launch comes amid growing efforts across Africa to develop language technologies that reflect local cultures and realities. 

In September 2025, the Nigerian government unveiled N-ATLAS, an open-source language model capable of recognising and transcribing spoken words and generating text, in Yoruba, Hausa, Igbo, and Nigerian-accented English. 

Similar initiatives are emerging in the private sector, where startups such as  South Africa’s Lelapa AI are building tools like Vulavula, which offers speech recognition, translation, and sentiment analysis. 

By making this speech dataset openly accessible, WAXAL provides the fuel for a growing wave of homegrown efforts to bring African languages into the digital age.

Although Sub-Saharan Africa is home to more than 2,000 languages, reports suggest that fewer than 5% of those languages have the resources needed for Natural Language Processing (NLP), which allows computers to understand and comprehend human language. This lack of representation in training datasets limits the effectiveness of speech recognition and text-to-speech systems for African users.  

Developed over three years with funding and technical support from Google, WAXAL addresses a major gap in global AI development.

WAXAL provides speech data for 21 Sub-Saharan African languages, including Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Swahili, and Yoruba. The dataset contains more than 11,000 hours of speech drawn from nearly two million individual recordings. 

Under the project’s partnership model, contributing institutions retain ownership of the data they collected, while making it openly available to researchers and developers worldwide.

“For AI to have a real impact in Africa, it must speak our languages and understand our contexts,” Joyce Nakatumba-Nabende, Senior Lecturer at Makerere University’s School of Computing and Information Technology, said. 

“The WAXAL dataset gives our researchers the high-quality data they need to build speech technologies that reflect our unique communities.”

Get The Best African Tech Newsletters In Your Inbox

Subscribe
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

MoneyGram launches stablecoin-powered app in Colombia

MoneyGram launches stablecoin-powered app in Colombia

The post MoneyGram launches stablecoin-powered app in Colombia appeared on BitcoinEthereumNews.com. MoneyGram has launched a new mobile application in Colombia that uses USD-pegged stablecoins to modernize cross-border remittances. According to an announcement on Wednesday, the app allows customers to receive money instantly into a US dollar balance backed by Circle’s USDC stablecoin, which can be stored, spent, or cashed out through MoneyGram’s global retail network. The rollout is designed to address the volatility of local currencies, particularly the Colombian peso. Built on the Stellar blockchain and supported by wallet infrastructure provider Crossmint, the app marks MoneyGram’s most significant move yet to integrate stablecoins into consumer-facing services. Colombia was selected as the first market due to its heavy reliance on inbound remittances—families in the country receive more than 22 times the amount they send abroad, according to Statista. The announcement said future expansions will target other remittance-heavy markets. MoneyGram, which has nearly 500,000 retail locations globally, has experimented with blockchain rails since partnering with the Stellar Development Foundation in 2021. It has since built cash on and off ramps for stablecoins, developed APIs for crypto integration, and incorporated stablecoins into its internal settlement processes. “This launch is the first step toward a world where every person, everywhere, has access to dollar stablecoins,” CEO Anthony Soohoo stated. The company emphasized compliance, citing decades of regulatory experience, though stablecoin oversight remains fluid. The US Congress passed the GENIUS Act earlier this year, establishing a framework for stablecoin regulation, which MoneyGram has pointed to as providing clearer guardrails. This is a developing story. This article was generated with the assistance of AI and reviewed by editor Jeffrey Albus before publication. Get the news in your inbox. Explore Blockworks newsletters: Source: https://blockworks.co/news/moneygram-stablecoin-app-colombia
Share
BitcoinEthereumNews2025/09/18 07:04
Solana Treasury Firm Holdings Could Double as Forward Industries Unveils $4 Billion Raise

Solana Treasury Firm Holdings Could Double as Forward Industries Unveils $4 Billion Raise

The post Solana Treasury Firm Holdings Could Double as Forward Industries Unveils $4 Billion Raise appeared on BitcoinEthereumNews.com. In brief Forward Industries, the largest publicly traded Solana treasury company, filed to raise $4 billion through an at-the-market equity offering to expand its SOL holdings. The company’s stock (FORD) fell 8.2% following the announcement, while the proceeds could more than double the $3.1 billion currently held in Solana treasuries. DeFi Development Corp. also registered a preferred stock offering with the SEC, following similar funding tactics used by Bitcoin treasury companies like MicroStrategy. Forward Industries, the newest and largest publicly traded Solana treasury company, has filed to raise $4 billion through an at-the-market equity offering. For the sake of comparison, this $4 billion raise is nearly the same size as Bitcoin treasury Strategy’s Stride preferred stock raise in July. And it’s double the size of the Strife preferred stock offering the company did in May. The proceeds would be used for working capital; pursuit of its Solana token strategy, and “the purchase of income-generating assets to grow its business,” the company said in a press release. Forward Industries declined to comment to Decrypt on what other income-generating assets it’s considering adding to its balance sheet.  As markets opened Wednesday morning, Forward saw its stock price take a dive. The shares, which trade under the FORD ticker on the Nasdaq, dipped to $31.29 before rebounding to $34.28 at the time of writing—marking a 8.2% fall for the session. If the company sells all the shares and spends the bulk of the proceeds on buying Solana, it could more than double the amount of SOL being held in treasuries. At the time of writing, there’s already $3.1 billion in Solana treasuries, according to crypto price aggregator CoinGecko. Users on Myriad, a prediction market owned by Decrypt parent company DASTAN, have been growing more confident that SOL will reach $250 sooner than…
Share
BitcoinEthereumNews2025/09/18 12:43
Microsoft plans to invest $4 billion in building a second AI data center in Wisconsin

Microsoft plans to invest $4 billion in building a second AI data center in Wisconsin

Microsoft will invest $4 billion to build a second AI data center in Wisconsin, bringing its total investment in the region to over $7 billion.
Share
Cryptopolitan2025/09/19 03:05