Bantu Gazette

Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Sports
  • Magazine
HIV
Menu
  • Black Frame Studio
  • Magazine

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026
Reading Time: 2 mins read

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 12, 2026
Reading Time: 2 mins read

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Rwanda Puts Technology at Core of Development, Minister Says
Technology

Rwanda Puts Technology at Core of Development, Minister Says

January 31, 2026
Liberia’s Infrastructure Push Spotlights Digital Connectivity
Technology

Liberia’s Infrastructure Push Spotlights Digital Connectivity

January 20, 2026
Niger Expands Digital Access with 1,000km of Fiber-Optic Cable
Technology

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

November 24, 2025
Benin Unveils AI Project to Preserve, Support Local Languages
Technology

Benin Unveils AI Project to Preserve, Support Local Languages

December 26, 2025
Zimbabwe Approves National Artificial Intelligence Strategy
Technology

Zimbabwe Approves National Artificial Intelligence Strategy

October 18, 2025
AI Reshaping Africa’s Fiscal Systems Through Innovation
Technology

AI Reshaping Africa’s Fiscal Systems Through Innovation

October 10, 2025

Most Recent

Gabon Turns to South Africa to Advance Mining Push Beyond Oil
Energy & Trade

Gabon Turns to South Africa to Advance Mining Push Beyond Oil

by Marcelo Edjang
February 12, 2026
0

Gabon signed a cooperation agreement with South Africa’s Council for Geoscience to strengthen geological research and accelerate development of its...

Read moreDetails
Marrakech Conference Presses for Faster Action to End Child Labor

Marrakech Conference Presses for Faster Action to End Child Labor

February 11, 2026
African Road Safety Charter to Enter Into Force as Mozambique Ratifies

African Road Safety Charter to Enter Into Force as Mozambique Ratifies

February 11, 2026
African Leaders Push Unified Strategy on Natural Diamonds

African Leaders Push Unified Strategy on Natural Diamonds

February 10, 2026
Ethiopia Launches First Smart Police Service in Africa

Ethiopia Launches First Smart Police Service in Africa

February 9, 2026
Africa Marks Largest-Ever Presence at 2026 Winter Olympics

Africa Marks Largest-Ever Presence at 2026 Winter Olympics

February 7, 2026
‘Intra-African Trade Gains Depend on Private Sector Uptake’

‘Intra-African Trade Gains Depend on Private Sector Uptake’

February 6, 2026
Gabon Turns to South Africa to Advance Mining Push Beyond Oil
Energy & Trade

Gabon Turns to South Africa to Advance Mining Push Beyond Oil

by Marcelo Edjang
Reading Time: 1 min read
February 12, 2026
0

Gabon signed a cooperation agreement with South Africa’s Council for Geoscience to strengthen geological research and accelerate development of its...

Read moreDetails
Marrakech Conference Presses for Faster Action to End Child Labor
Politics & Economy

Marrakech Conference Presses for Faster Action to End Child Labor

by Samira Benhadda
Reading Time: 1 min read
February 11, 2026
0

Marrakech became the focus of renewed efforts to end child labor Wednesday as delegates at the 6th Global Conference on...

Read moreDetails
African Road Safety Charter to Enter Into Force as Mozambique Ratifies
Politics & Economy

African Road Safety Charter to Enter Into Force as Mozambique Ratifies

by Genoveva Ntutumu
Reading Time: 2 mins read
February 11, 2026
0

The African Road Safety Charter will enter into force in 30 days after the Republic of Mozambique deposited its instrument...

Read moreDetails

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Rwanda Puts Technology at Core of Development, Minister Says

Rwanda Puts Technology at Core of Development, Minister Says

by Amani Mwakalebela
January 23, 2026
0

...

Liberia’s Infrastructure Push Spotlights Digital Connectivity

Liberia’s Infrastructure Push Spotlights Digital Connectivity

by Seraphine Biyogo
January 20, 2026
0

...

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

by Samira Benhadda
November 21, 2025
0

...

Benin Unveils AI Project to Preserve, Support Local Languages

Benin Unveils AI Project to Preserve, Support Local Languages

by Cynthia N. Ganchok
November 11, 2025
0

...

Zimbabwe Approves National Artificial Intelligence Strategy

Zimbabwe Approves National Artificial Intelligence Strategy

by Naledi Kgosi
October 16, 2025
0

...

AI Reshaping Africa’s Fiscal Systems Through Innovation

AI Reshaping Africa’s Fiscal Systems Through Innovation

by Felix Tih
October 9, 2025
0

...

Gabon Turns to South Africa to Advance Mining Push Beyond Oil
Energy & Trade

Gabon Turns to South Africa to Advance Mining Push Beyond Oil

by Marcelo Edjang
Reading Time: 1 min read
February 12, 2026
0

Gabon signed a cooperation agreement with South Africa’s Council for Geoscience to strengthen geological research and accelerate development of its...

Read moreDetails
Marrakech Conference Presses for Faster Action to End Child Labor

Marrakech Conference Presses for Faster Action to End Child Labor

by Samira Benhadda
February 11, 2026
0

Marrakech became the focus of renewed efforts to end child labor Wednesday as delegates at the 6th Global Conference on...

African Road Safety Charter to Enter Into Force as Mozambique Ratifies

African Road Safety Charter to Enter Into Force as Mozambique Ratifies

by Genoveva Ntutumu
February 11, 2026
0

The African Road Safety Charter will enter into force in 30 days after the Republic of Mozambique deposited its instrument...

African Leaders Push Unified Strategy on Natural Diamonds

African Leaders Push Unified Strategy on Natural Diamonds

by Naledi Kgosi
February 10, 2026
0

African diamond-producing nations must speak with a single voice to secure the future of the natural diamond industry, Namibia’s mines...

Ethiopia Launches First Smart Police Service in Africa

Ethiopia Launches First Smart Police Service in Africa

by Maraki Desta
February 9, 2026
0

Prime Minister Abiy Ahmed said Monday that Ethiopia has launched its first unmanned smart police service, a technology-based initiative aimed...

Next Post
Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact
Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Magazine