Bantu Gazette

Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Sports
  • Magazine
Menu
  • Black Frame Studio
  • Magazine

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026
Reading Time: 2 mins read

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 12, 2026
Reading Time: 2 mins read

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Amazon Satellite Expansion, EU Investment Boost Kenya’s Digital Hub Ambitions
Technology

Amazon Satellite Expansion, EU Investment Boost Kenya’s Digital Hub Ambitions

June 18, 2026
Nigeria Launches AI-Powered Government Information Platform in Four Languages
Technology

Nigeria Launches AI-Powered Government Information Platform in Four Languages

May 23, 2026
AI Expansion Targets Health and Education Systems in Rwanda
Technology

West Africa Bloc Turns to Artificial Intelligence for Digital Skills, Innovation

May 18, 2026
Safaricom Ethiopia Records 130.9% Revenue Growth
Technology

Safaricom Ethiopia Records 130.9% Revenue Growth

May 12, 2026
Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth
Technology

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

April 25, 2026
Young Ethiopian Founders Turn Ideas Into Real Solutions
Feature

Young Ethiopian Founders Turn Ideas Into Real Solutions

April 10, 2026

Most Recent

Somalia Opens Women’s Mental Health Unit to Expand Access to Psychiatric Care
Health

Somalia Opens Women’s Mental Health Unit to Expand Access to Psychiatric Care

by Amani Mwakalebela
June 30, 2026
0

The new unit at Mogadishu's Forlanini Hospital will provide specialized treatment and counseling for women, part of broader efforts to...

Read moreDetails
Guinea Unveils Poultry Development Strategy Aimed at Creating 560,000 Jobs

Guinea Unveils Poultry Development Strategy Aimed at Creating 560,000 Jobs

June 30, 2026
Kenya Opens Government Debt Market to Global Investors Through Clearstream Link

Kenya Opens Government Debt Market to Global Investors Through Clearstream Link

June 30, 2026
Gender Equality Progress Hinges on Implementation, Not New Policies, Experts Warn

Gender Equality Progress Hinges on Implementation, Not New Policies, Experts Warn

June 26, 2026
East Africa Moves to Deepen Cross-Border Financial Integration

East Africa Moves to Deepen Cross-Border Financial Integration

June 26, 2026
Ghana Battles to Save Cocoa Industry as Production Falls to 20-Year Low

Ghana, Côte d’Ivoire Deepen Cocoa Price Coordination to Shield Farmers from Market Swings

June 30, 2026
Intra-African Trade Holds the Key to Fertilizer Access Across the Continent

Intra-African Trade Holds the Key to Fertilizer Access Across the Continent

June 27, 2026
Somalia Opens Women’s Mental Health Unit to Expand Access to Psychiatric Care
Health

Somalia Opens Women’s Mental Health Unit to Expand Access to Psychiatric Care

by Amani Mwakalebela
Reading Time: 2 mins read
June 30, 2026
0

The new unit at Mogadishu's Forlanini Hospital will provide specialized treatment and counseling for women, part of broader efforts to...

Read moreDetails
Guinea Unveils Poultry Development Strategy Aimed at Creating 560,000 Jobs
Uncategorized

Guinea Unveils Poultry Development Strategy Aimed at Creating 560,000 Jobs

by Aissatou Fall
Reading Time: 2 mins read
June 30, 2026
0

The West African country aims to combine public and private investment to expand its poultry sector, create jobs and improve...

Read moreDetails
Kenya Opens Government Debt Market to Global Investors Through Clearstream Link
Finance

Kenya Opens Government Debt Market to Global Investors Through Clearstream Link

by Waceke Nganga
Reading Time: 2 mins read
June 30, 2026
0

The link will allow foreign investors to buy and hold Kenyan Treasury securities without opening local custody accounts, potentially increasing...

Read moreDetails

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Google Launches WAXAL Open Dataset for 21 African Languages

New speech resource aims to address data gaps limiting voice technology development across sub-Saharan Africa

Google Launches WAXAL Open Dataset for 21 African Languages

WAXAL

Felix Tihby Felix Tih
February 2, 2026

Google has launched a speech dataset covering 21 African languages to improve voice technology across the continent, the tech giant said in a statement on Monday.

Named from the Wolof word for “speak,” WAXAL contains more than 11,000 hours of speech data drawn from nearly 2 million individual recordings.

The dataset includes about 1,250 hours of transcribed speech for automatic speech recognition and more than 20 hours of studio-quality recordings designed for text-to-speech voice synthesis, Google said.

The project was developed over three years to support research and product development in regions where voice-enabled technologies remain limited due to a lack of accessible, high-quality local-language data.

Sub-Saharan Africa is home to more than 2,000 distinct languages, many of which are underrepresented in global technology systems.

Data collection was led by African institutions. Makerere University in Uganda and the University of Ghana coordinated work on a combined 13 languages, while Digital Umuganda in Rwanda oversaw data gathering for five major languages.

Studio recordings were produced with Media Trust and Loud n Clear, and the African Institute for Mathematical Sciences contributed multilingual data intended for future releases.

According to Google, the project was designed to ensure that partner institutions retain ownership of the data they collected while making the dataset available to the global research community under an open license.

To capture natural speech patterns, contributors were asked to describe images in their native languages.

Professional voice actors were also recorded in studio settings to support speech synthesis research.

The WAXAL dataset is available on the Hugging Face platform, alongside a technical paper detailing the methodology.

Languages included in the dataset are: Acholi, Akan, Dagaare, Dagbani, Dholuo, Ewe, Fante, Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Kikuyu, Lingala, Luganda, Malagasy, Masaaba, Nyankole, Rukiga, Shona, Soga (Lusoga), Swahili and Yoruba.

Google said the initiative is also intended to support the digital preservation of African languages alongside technological development.

Get the inside Story

Stay informed on the stories shaping Africa’s future. Get breaking news, in-depth analysis, opinions and exclusive insights from across the continent delivered to your inbox, free and unfiltered.


Get in touch for more:
Felix Tih
Editorial Director, Bantu Gazette
WhatsApp
LinkedIn
X (Twitter)
Instagram

Related Posts

Amazon Satellite Expansion, EU Investment Boost Kenya’s Digital Hub Ambitions

Amazon Satellite Expansion, EU Investment Boost Kenya’s Digital Hub Ambitions

by Waceke Nganga
June 16, 2026
0

...

Nigeria Launches AI-Powered Government Information Platform in Four Languages

Nigeria Launches AI-Powered Government Information Platform in Four Languages

by Marina Bisse
May 21, 2026
0

...

AI Expansion Targets Health and Education Systems in Rwanda

West Africa Bloc Turns to Artificial Intelligence for Digital Skills, Innovation

by Aissatou Fall
May 18, 2026
0

...

Safaricom Ethiopia Records 130.9% Revenue Growth

Safaricom Ethiopia Records 130.9% Revenue Growth

by Kalkidan Negash
May 11, 2026
0

...

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

Ghana Launches National AI Strategy to Drive Digital Transformation, Economic Growth

by Elise Ntebah
April 25, 2026
0

...

Young Ethiopian Founders Turn Ideas Into Real Solutions

Young Ethiopian Founders Turn Ideas Into Real Solutions

by Abel Gorfu Asefa
April 10, 2026
0

...

Somalia Opens Women’s Mental Health Unit to Expand Access to Psychiatric Care
Health

Somalia Opens Women’s Mental Health Unit to Expand Access to Psychiatric Care

by Amani Mwakalebela
Reading Time: 2 mins read
June 30, 2026
0

The new unit at Mogadishu's Forlanini Hospital will provide specialized treatment and counseling for women, part of broader efforts to...

Read moreDetails
Guinea Unveils Poultry Development Strategy Aimed at Creating 560,000 Jobs

Guinea Unveils Poultry Development Strategy Aimed at Creating 560,000 Jobs

by Aissatou Fall
June 30, 2026
0

The West African country aims to combine public and private investment to expand its poultry sector, create jobs and improve...

Kenya Opens Government Debt Market to Global Investors Through Clearstream Link

Kenya Opens Government Debt Market to Global Investors Through Clearstream Link

by Waceke Nganga
June 27, 2026
0

The link will allow foreign investors to buy and hold Kenyan Treasury securities without opening local custody accounts, potentially increasing...

Gender Equality Progress Hinges on Implementation, Not New Policies, Experts Warn

Gender Equality Progress Hinges on Implementation, Not New Policies, Experts Warn

by Felix Tih
June 26, 2026
0

Gender equality advocates have called on African governments to shift their focus from developing new gender policies to effectively implementing...

East Africa Moves to Deepen Cross-Border Financial Integration

East Africa Moves to Deepen Cross-Border Financial Integration

by Amani Mwakalebela
June 26, 2026
0

African Development Fund backs regional initiative with $9 million grant to strengthen capital markets and payment systems across nine countries

Next Post
Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Nigeria Issues $347 Million in Bonds to Clear Power Sector Debt

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

Ghana Launches Shea Hub to Boost Rural Economy, Women’s Empowerment

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

South Africa Joins Afreximbank, Unlocks $8 Billion Country Program

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Ghana, Zambia Strike 10 Deals, Approve Visa-Free Travel

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact

Bantu Gazette is a pioneering news platform that champions Africa's development, culture, and heritage. We spotlight the continent's successes, address its challenges, and provide insightful coverage of events that shape its future.

Our Platforms

  • Bantu Magazine
  • Bantu Brief
  • Black Frame Studio

Our Services

  • Bantu Agency
  • Advertise
  • Partnerships

Our Services

  • Editorial Director
  • Opportunities
  • Contact
Bantu Gazette
  • Energy & Trade
  • Finance
  • Health
  • Politics & Economy
  • Technology
  • Environment
  • Feature
  • Opinion
  • Changemakers
  • Tourism & Culture
  • Magazine