Bantu Gazette

Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Sports
  • Magazine
Menu
  • Black Frame Studio
  • Magazine

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026
Reading Time: 2 mins read

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 12, 2026
Reading Time: 2 mins read

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Senegal Launches Flagship Digital Projects Under its National Technology Program
Technology

Senegal Launches Flagship Digital Projects Under its National Technology Program

March 25, 2026
AI Expansion Targets Health and Education Systems in Rwanda
Technology

AI Expansion Targets Health and Education Systems in Rwanda

March 4, 2026
Rwanda Puts Technology at Core of Development, Minister Says
Technology

Rwanda Puts Technology at Core of Development, Minister Says

January 31, 2026
Liberia’s Infrastructure Push Spotlights Digital Connectivity
Technology

Liberia’s Infrastructure Push Spotlights Digital Connectivity

January 20, 2026
Niger Expands Digital Access with 1,000km of Fiber-Optic Cable
Technology

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

November 24, 2025
Benin Unveils AI Project to Preserve, Support Local Languages
Technology

Benin Unveils AI Project to Preserve, Support Local Languages

December 26, 2025

Most Recent

At WTO Talks in Yaoundé, African Nations Push to Expand Cotton Value Chain
Politics & Economy

At WTO Talks in Yaoundé, African Nations Push to Expand Cotton Value Chain

by Samira Benhadda
March 27, 2026
0

African ministers and development partners are pushing to expand cotton processing and textile production as a pathway to jobs and...

Read moreDetails
Okonjo-Iweala Urges Stronger Partnerships as WTO Launches Trade Support Program

Okonjo-Iweala Urges Stronger Partnerships as WTO Launches Trade Support Program

March 26, 2026
Women Entrepreneurs Receive First Grants as Global Trade Initiative Marks a Decade

Women Entrepreneurs Receive First Grants as Global Trade Initiative Marks a Decade

March 26, 2026
U.N. Adopts Ghana Resolution Calling Slave Trade “Gravest Crime Against Humanity”

U.N. Adopts Ghana Resolution Calling Slave Trade “Gravest Crime Against Humanity”

March 26, 2026
Senegal Launches Flagship Digital Projects Under its National Technology Program

Senegal Launches Flagship Digital Projects Under its National Technology Program

March 25, 2026
Nigeria’s State Oil Company Shifts Focus From Reserves to Sustained Revenue

Nigeria’s State Oil Company Shifts Focus From Reserves to Sustained Revenue

March 24, 2026
Tanzania Accelerates Progress Toward Universal Health Coverage

Tanzania Accelerates Progress Toward Universal Health Coverage

March 24, 2026
At WTO Talks in Yaoundé, African Nations Push to Expand Cotton Value Chain
Politics & Economy

At WTO Talks in Yaoundé, African Nations Push to Expand Cotton Value Chain

by Samira Benhadda
Reading Time: 2 mins read
March 27, 2026
0

African ministers and development partners are pushing to expand cotton processing and textile production as a pathway to jobs and...

Read moreDetails
Okonjo-Iweala Urges Stronger Partnerships as WTO Launches Trade Support Program
Finance

Okonjo-Iweala Urges Stronger Partnerships as WTO Launches Trade Support Program

by Felix Tih
Reading Time: 2 mins read
March 26, 2026
0

World Trade Organization Director-General Ngozi Okonjo-Iweala on Thursday called for stronger global partnerships as a new phase of a trade...

Read moreDetails
Women Entrepreneurs Receive First Grants as Global Trade Initiative Marks a Decade
Changemakers

Women Entrepreneurs Receive First Grants as Global Trade Initiative Marks a Decade

by Felix Tih
Reading Time: 2 mins read
March 26, 2026
0

A global fund supporting women exporters began releasing its first round of grants this week as trade officials and business...

Read moreDetails

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Senegal Launches Flagship Digital Projects Under its National Technology Program

Senegal Launches Flagship Digital Projects Under its National Technology Program

by Aissatou Fall
March 25, 2026
0

...

AI Expansion Targets Health and Education Systems in Rwanda

AI Expansion Targets Health and Education Systems in Rwanda

by Jane Mukami
February 23, 2026
0

...

Rwanda Puts Technology at Core of Development, Minister Says

Rwanda Puts Technology at Core of Development, Minister Says

by Amani Mwakalebela
January 23, 2026
0

...

Liberia’s Infrastructure Push Spotlights Digital Connectivity

Liberia’s Infrastructure Push Spotlights Digital Connectivity

by Seraphine Biyogo
January 20, 2026
0

...

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

by Samira Benhadda
November 21, 2025
0

...

Benin Unveils AI Project to Preserve, Support Local Languages

Benin Unveils AI Project to Preserve, Support Local Languages

by Cynthia N. Ganchok
November 11, 2025
0

...

At WTO Talks in Yaoundé, African Nations Push to Expand Cotton Value Chain
Politics & Economy

At WTO Talks in Yaoundé, African Nations Push to Expand Cotton Value Chain

by Samira Benhadda
Reading Time: 2 mins read
March 27, 2026
0

African ministers and development partners are pushing to expand cotton processing and textile production as a pathway to jobs and...

Read moreDetails
Okonjo-Iweala Urges Stronger Partnerships as WTO Launches Trade Support Program

Okonjo-Iweala Urges Stronger Partnerships as WTO Launches Trade Support Program

by Felix Tih
March 26, 2026
0

World Trade Organization Director-General Ngozi Okonjo-Iweala on Thursday called for stronger global partnerships as a new phase of a trade...

Women Entrepreneurs Receive First Grants as Global Trade Initiative Marks a Decade

Women Entrepreneurs Receive First Grants as Global Trade Initiative Marks a Decade

by Felix Tih
March 26, 2026
0

A global fund supporting women exporters began releasing its first round of grants this week as trade officials and business...

U.N. Adopts Ghana Resolution Calling Slave Trade “Gravest Crime Against Humanity”

U.N. Adopts Ghana Resolution Calling Slave Trade “Gravest Crime Against Humanity”

by Jane Mukami
March 25, 2026
0

The United Nations on Wednesday adopted a resolution spearheaded by Ghana that labels the transatlantic slave trade the “gravest crime...

Senegal Launches Flagship Digital Projects Under its National Technology Program

Senegal Launches Flagship Digital Projects Under its National Technology Program

by Aissatou Fall
March 25, 2026
0

Senegal officially launched a set of structural digital projects Tuesday, with Prime Minister Ousmane Sonko presiding over a ceremony in...

Next Post
Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact
Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Magazine