Bantu Gazette

Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Sports
  • Magazine
Menu
  • Black Frame Studio
  • Magazine

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026
Reading Time: 2 mins read

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026
Reading Time: 2 mins read

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Rwanda Puts Technology at Core of Development, Minister Says
Technology

Rwanda Puts Technology at Core of Development, Minister Says

January 31, 2026
Liberia’s Infrastructure Push Spotlights Digital Connectivity
Technology

Liberia’s Infrastructure Push Spotlights Digital Connectivity

January 20, 2026
Niger Expands Digital Access with 1,000km of Fiber-Optic Cable
Technology

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

November 24, 2025
Benin Unveils AI Project to Preserve, Support Local Languages
Technology

Benin Unveils AI Project to Preserve, Support Local Languages

December 26, 2025
Zimbabwe Approves National Artificial Intelligence Strategy
Technology

Zimbabwe Approves National Artificial Intelligence Strategy

October 18, 2025
AI Reshaping Africa’s Fiscal Systems Through Innovation
Technology

AI Reshaping Africa’s Fiscal Systems Through Innovation

October 10, 2025

Most Recent

African Leaders Push Unified Strategy on Natural Diamonds
Politics & Economy

African Leaders Push Unified Strategy on Natural Diamonds

by Naledi Kgosi
February 10, 2026
0

African diamond-producing nations must speak with a single voice to secure the future of the natural diamond industry, Namibia’s mines...

Read moreDetails
Ethiopia Launches First Smart Police Service in Africa

Ethiopia Launches First Smart Police Service in Africa

February 9, 2026
Africa Marks Largest-Ever Presence at 2026 Winter Olympics

Africa Marks Largest-Ever Presence at 2026 Winter Olympics

February 7, 2026
‘Intra-African Trade Gains Depend on Private Sector Uptake’

‘Intra-African Trade Gains Depend on Private Sector Uptake’

February 6, 2026
Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

February 6, 2026
South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

February 4, 2026
Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

February 3, 2026
African Leaders Push Unified Strategy on Natural Diamonds
Politics & Economy

African Leaders Push Unified Strategy on Natural Diamonds

by Naledi Kgosi
Reading Time: 1 min read
February 10, 2026
0

African diamond-producing nations must speak with a single voice to secure the future of the natural diamond industry, Namibia’s mines...

Read moreDetails
Ethiopia Launches First Smart Police Service in Africa
Politics & Economy

Ethiopia Launches First Smart Police Service in Africa

by Maraki Desta
Reading Time: 1 min read
February 9, 2026
0

Prime Minister Abiy Ahmed said Monday that Ethiopia has launched its first unmanned smart police service, a technology-based initiative aimed...

Read moreDetails
Africa Marks Largest-Ever Presence at 2026 Winter Olympics
Sports

Africa Marks Largest-Ever Presence at 2026 Winter Olympics

by Felix Tih
Reading Time: 3 mins read
February 7, 2026
0

Africa is represented by about 15 athletes from eight countries at the Milan-Cortina 2026 Winter Olympics, marking the continent’s largest...

Read moreDetails

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Rwanda Puts Technology at Core of Development, Minister Says

Rwanda Puts Technology at Core of Development, Minister Says

by Amani Mwakalebela
January 23, 2026
0

...

Liberia’s Infrastructure Push Spotlights Digital Connectivity

Liberia’s Infrastructure Push Spotlights Digital Connectivity

by Seraphine Biyogo
January 20, 2026
0

...

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

Niger Expands Digital Access with 1,000km of Fiber-Optic Cable

by Samira Benhadda
November 21, 2025
0

...

Benin Unveils AI Project to Preserve, Support Local Languages

Benin Unveils AI Project to Preserve, Support Local Languages

by Cynthia N. Ganchok
November 11, 2025
0

...

Zimbabwe Approves National Artificial Intelligence Strategy

Zimbabwe Approves National Artificial Intelligence Strategy

by Naledi Kgosi
October 16, 2025
0

...

AI Reshaping Africa’s Fiscal Systems Through Innovation

AI Reshaping Africa’s Fiscal Systems Through Innovation

by Felix Tih
October 9, 2025
0

...

African Leaders Push Unified Strategy on Natural Diamonds
Politics & Economy

African Leaders Push Unified Strategy on Natural Diamonds

by Naledi Kgosi
Reading Time: 1 min read
February 10, 2026
0

African diamond-producing nations must speak with a single voice to secure the future of the natural diamond industry, Namibia’s mines...

Read moreDetails
Ethiopia Launches First Smart Police Service in Africa

Ethiopia Launches First Smart Police Service in Africa

by Maraki Desta
February 9, 2026
0

Prime Minister Abiy Ahmed said Monday that Ethiopia has launched its first unmanned smart police service, a technology-based initiative aimed...

Africa Marks Largest-Ever Presence at 2026 Winter Olympics

Africa Marks Largest-Ever Presence at 2026 Winter Olympics

by Felix Tih
February 7, 2026
0

Africa is represented by about 15 athletes from eight countries at the Milan-Cortina 2026 Winter Olympics, marking the continent’s largest...

‘Intra-African Trade Gains Depend on Private Sector Uptake’

‘Intra-African Trade Gains Depend on Private Sector Uptake’

by Seraphine Biyogo
February 6, 2026
0

The African Continental Free Trade Area will fall short of delivering meaningful economic gains unless the private sector actively uses...

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

by Maraki Desta
February 6, 2026
0

Ghana and Zambia signed 10 agreements to expand cooperation across key sectors and introduce visa-free travel in support of African...

Next Post
Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact
Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Magazine