Bantu Gazette

Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Sports
  • Magazine
Menu
  • Black Frame Studio
  • Magazine

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026
Reading Time: 2 mins read

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 12, 2026
Reading Time: 2 mins read

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Nigeria Launches AI-Powered Government Information Platform in Four Languages
Technology

Nigeria Launches AI-Powered Government Information Platform in Four Languages

May 23, 2026
AI Expansion Targets Health and Education Systems in Rwanda
Technology

West Africa Bloc Turns to Artificial Intelligence for Digital Skills, Innovation

May 18, 2026
Safaricom Ethiopia Records 130.9% Revenue Growth
Technology

Safaricom Ethiopia Records 130.9% Revenue Growth

May 12, 2026
Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth
Technology

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

April 25, 2026
Young Ethiopian Founders Turn Ideas Into Real Solutions
Feature

Young Ethiopian Founders Turn Ideas Into Real Solutions

April 10, 2026
Digital Technologies Are Africa’s Greatest Leapfrog Opportunity
Technology

Digital Technologies Are Africa’s Greatest Leapfrog Opportunity

April 8, 2026

Most Recent

New Botswana City Project Launched to Support Economic Diversification
Politics & Economy

New Botswana City Project Launched to Support Economic Diversification

by Naledi Kgosi
June 8, 2026
0

GABORONE Botswana has launched the New Botswana City project in Gaborone, a development expected to attract investment, create jobs and...

Read moreDetails
Kenya’s Ebola Preparedness Highlights the Need for a Clear Framework

Kenya’s Ebola Preparedness Highlights the Need for a Clear Framework

June 8, 2026
Côte d’Ivoire Secures €103 Million to Expand Electricity Access to 100,000 Households

Côte d’Ivoire Secures €103 Million to Expand Electricity Access to 100,000 Households

June 6, 2026
Ghana Battles to Save Cocoa Industry as Production Falls to 20-Year Low

Ghana Launches AgriConnect Compact to Boost Food Security, Jobs, Agricultural Investment

June 6, 2026
Benin Announces Free Public Secondary Education for All Girls

Benin Announces Free Public Secondary Education for All Girls

June 8, 2026
Zimbabwe Secures Non-Permanent Seat on U.N. Security Council

Zimbabwe Secures Non-Permanent Seat on U.N. Security Council

June 3, 2026
Dangote Retains Africa’s Most Admired Brand Title for 8th Consecutive Year

Dangote Retains Africa’s Most Admired Brand Title for 8th Consecutive Year

June 4, 2026
New Botswana City Project Launched to Support Economic Diversification
Politics & Economy

New Botswana City Project Launched to Support Economic Diversification

by Naledi Kgosi
Reading Time: 2 mins read
June 8, 2026
0

GABORONE Botswana has launched the New Botswana City project in Gaborone, a development expected to attract investment, create jobs and...

Read moreDetails
Kenya’s Ebola Preparedness Highlights the Need for a Clear Framework
Health

Kenya’s Ebola Preparedness Highlights the Need for a Clear Framework

by Joyce Waceke
Reading Time: 4 mins read
June 8, 2026
0

An active Ebola outbreak in Uganda and the Democratic Republic of Congo has tested Kenya's public health preparedness and exposed...

Read moreDetails
Côte d’Ivoire Secures €103 Million to Expand Electricity Access to 100,000 Households
Energy & Trade

Côte d’Ivoire Secures €103 Million to Expand Electricity Access to 100,000 Households

by Seraphine Biyogo
Reading Time: 2 mins read
June 6, 2026
0

African Development Bank approves financing for the second phase of a national electrification project aimed at expanding grid connections, upgrading...

Read moreDetails

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Nigeria Launches AI-Powered Government Information Platform in Four Languages

Nigeria Launches AI-Powered Government Information Platform in Four Languages

by Marina Bisse
May 21, 2026
0

...

AI Expansion Targets Health and Education Systems in Rwanda

West Africa Bloc Turns to Artificial Intelligence for Digital Skills, Innovation

by Aissatou Fall
May 18, 2026
0

...

Safaricom Ethiopia Records 130.9% Revenue Growth

Safaricom Ethiopia Records 130.9% Revenue Growth

by Kalkidan Negash
May 11, 2026
0

...

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

by Elise Ntebah
April 25, 2026
0

...

Young Ethiopian Founders Turn Ideas Into Real Solutions

Young Ethiopian Founders Turn Ideas Into Real Solutions

by Abel Gorfu Asefa
April 10, 2026
0

...

Digital Technologies Are Africa’s Greatest Leapfrog Opportunity

Digital Technologies Are Africa’s Greatest Leapfrog Opportunity

by Samira Benhadda
April 8, 2026
0

...

New Botswana City Project Launched to Support Economic Diversification
Politics & Economy

New Botswana City Project Launched to Support Economic Diversification

by Naledi Kgosi
Reading Time: 2 mins read
June 8, 2026
0

GABORONE Botswana has launched the New Botswana City project in Gaborone, a development expected to attract investment, create jobs and...

Read moreDetails
Kenya’s Ebola Preparedness Highlights the Need for a Clear Framework

Kenya’s Ebola Preparedness Highlights the Need for a Clear Framework

by Joyce Waceke
June 8, 2026
0

An active Ebola outbreak in Uganda and the Democratic Republic of Congo has tested Kenya's public health preparedness and exposed...

Côte d’Ivoire Secures €103 Million to Expand Electricity Access to 100,000 Households

Côte d’Ivoire Secures €103 Million to Expand Electricity Access to 100,000 Households

by Seraphine Biyogo
June 6, 2026
0

African Development Bank approves financing for the second phase of a national electrification project aimed at expanding grid connections, upgrading...

Ghana Battles to Save Cocoa Industry as Production Falls to 20-Year Low

Ghana Launches AgriConnect Compact to Boost Food Security, Jobs, Agricultural Investment

by Marina Bisse
June 6, 2026
0

A $3.5 billion agricultural initiative backed by the World Bank Group and development partners aims to strengthen food security, create...

Benin Announces Free Public Secondary Education for All Girls

Benin Announces Free Public Secondary Education for All Girls

by Aissatou Fall
June 4, 2026
0

President Romuald Wadagni says the policy will remove financial barriers to education and help thousands of girls stay in school...

Next Post
Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact
Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Magazine