Bantu Gazette

Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Sports
  • Magazine
Menu
  • Black Frame Studio
  • Magazine

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026
Reading Time: 2 mins read

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 12, 2026
Reading Time: 2 mins read

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Safaricom Ethiopia Records 130.9% Revenue Growth
Technology

Safaricom Ethiopia Records 130.9% Revenue Growth

May 12, 2026
Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth
Technology

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

April 25, 2026
Young Ethiopian Founders Turn Ideas Into Real Solutions
Feature

Young Ethiopian Founders Turn Ideas Into Real Solutions

April 10, 2026
Digital Technologies Are Africa’s Greatest Leapfrog Opportunity
Technology

Digital Technologies Are Africa’s Greatest Leapfrog Opportunity

April 8, 2026
Nigeria Awards ₦2.5 Billion in Grants to 45 Student Ventures
Technology

Nigeria Awards ₦2.5 Billion in Grants to 45 Student Ventures

April 1, 2026
African Leaders Urged to Accelerate Use of Technology for Growth
Technology

African Leaders Urged to Accelerate Use of Technology for Growth

March 29, 2026

Most Recent

Rwanda Highlights Skills Development in Creative Economy Strategy
Tourism & Culture

Rwanda Highlights Skills Development in Creative Economy Strategy

by Jane Mukami
May 15, 2026
0

KIGALI Rwanda is investing in skills development as part of its strategy to expand the creative economy, with officials calling...

Read moreDetails
Ethiopia Export Earnings Surge to $8.7 Billion in Ten Months

Ethiopia Export Earnings Surge to $8.7 Billion in Ten Months

May 15, 2026
Liberia to Build its 1st Electrical Manufacturing Plant in $26M Deal with Kenyan Firm

Liberia to Build its 1st Electrical Manufacturing Plant in $26M Deal with Kenyan Firm

May 14, 2026
South Africa Allocates $1.7 Billion to Road Agency for Network Expansion

South Africa Allocates $1.7 Billion to Road Agency for Network Expansion

May 13, 2026
Ethiopia Receives Emperor Tewodros II Relics as African Heritage Repatriations Gather Pace

Ethiopia Receives Emperor Tewodros II Relics as African Heritage Repatriations Gather Pace

May 13, 2026
Rwanda Secures €45M to Expand Climate-Resilient Irrigation in Drought-Prone East

Rwanda Secures €45M to Expand Climate-Resilient Irrigation in Drought-Prone East

May 12, 2026
Ethiopian Airlines Named Fastest Growing Airline as Award Streak Continues

Ethiopian Airlines Named Fastest Growing Airline as Award Streak Continues

May 12, 2026
Rwanda Highlights Skills Development in Creative Economy Strategy
Tourism & Culture

Rwanda Highlights Skills Development in Creative Economy Strategy

by Jane Mukami
Reading Time: 2 mins read
May 15, 2026
0

KIGALI Rwanda is investing in skills development as part of its strategy to expand the creative economy, with officials calling...

Read moreDetails
Ethiopia Export Earnings Surge to $8.7 Billion in Ten Months
Politics & Economy

Ethiopia Export Earnings Surge to $8.7 Billion in Ten Months

by Kalkidan Negash
Reading Time: 1 min read
May 15, 2026
0

Ethiopia’s export revenues rose 43% to $8.71 billion in the first ten months of the current fiscal year, beating the...

Read moreDetails
Liberia to Build its 1st Electrical Manufacturing Plant in $26M Deal with Kenyan Firm
Energy & Trade

Liberia to Build its 1st Electrical Manufacturing Plant in $26M Deal with Kenyan Firm

by Aissatou Fall
Reading Time: 2 mins read
May 14, 2026
0

The Liberia Electricity Corporation (LEC) signed a $26 million agreement with Kenyan firm Thames Electricals Limited on Tuesday to establish...

Read moreDetails

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Safaricom Ethiopia Records 130.9% Revenue Growth

Safaricom Ethiopia Records 130.9% Revenue Growth

by Kalkidan Negash
May 11, 2026
0

...

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

by Elise Ntebah
April 25, 2026
0

...

Young Ethiopian Founders Turn Ideas Into Real Solutions

Young Ethiopian Founders Turn Ideas Into Real Solutions

by Abel Gorfu Asefa
April 10, 2026
0

...

Digital Technologies Are Africa’s Greatest Leapfrog Opportunity

Digital Technologies Are Africa’s Greatest Leapfrog Opportunity

by Samira Benhadda
April 8, 2026
0

...

Nigeria Awards ₦2.5 Billion in Grants to 45 Student Ventures

Nigeria Awards ₦2.5 Billion in Grants to 45 Student Ventures

by Cynthia N. Ganchok
April 1, 2026
0

...

African Leaders Urged to Accelerate Use of Technology for Growth

African Leaders Urged to Accelerate Use of Technology for Growth

by Samira Benhadda
March 29, 2026
0

...

Rwanda Highlights Skills Development in Creative Economy Strategy
Tourism & Culture

Rwanda Highlights Skills Development in Creative Economy Strategy

by Jane Mukami
Reading Time: 2 mins read
May 15, 2026
0

KIGALI Rwanda is investing in skills development as part of its strategy to expand the creative economy, with officials calling...

Read moreDetails
Ethiopia Export Earnings Surge to $8.7 Billion in Ten Months

Ethiopia Export Earnings Surge to $8.7 Billion in Ten Months

by Kalkidan Negash
May 15, 2026
0

Ethiopia’s export revenues rose 43% to $8.71 billion in the first ten months of the current fiscal year, beating the...

Liberia to Build its 1st Electrical Manufacturing Plant in $26M Deal with Kenyan Firm

Liberia to Build its 1st Electrical Manufacturing Plant in $26M Deal with Kenyan Firm

by Aissatou Fall
May 14, 2026
0

The Liberia Electricity Corporation (LEC) signed a $26 million agreement with Kenyan firm Thames Electricals Limited on Tuesday to establish...

South Africa Allocates $1.7 Billion to Road Agency for Network Expansion

South Africa Allocates $1.7 Billion to Road Agency for Network Expansion

by Naledi Kgosi
May 13, 2026
0

South Africa's government has allocated nearly R31 billion (about $1.7 billion) to the country's national roads agency this financial year...

Ethiopia Receives Emperor Tewodros II Relics as African Heritage Repatriations Gather Pace

Ethiopia Receives Emperor Tewodros II Relics as African Heritage Repatriations Gather Pace

by Kalkidan Negash
May 13, 2026
0

ADDIS ABABA Ethiopia formally received relics belonging to Emperor Tewodros II on Tuesday during a ceremony at St Martin's Chapel...

Next Post
Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact
Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Magazine