The Mozilla Common Voice initiative has released a new, expanded data set featuring 16 new languages — like Basaa and Kazakh — and 4,622 new hours

Mozilla Common Voice Adds 16 New Languages and 4,600 New Hours of Speech

submited by
Style Pass
2021-08-05 13:00:02

The Mozilla Common Voice initiative has released a new, expanded data set featuring 16 new languages — like Basaa and Kazakh — and 4,622 new hours of speech.

Mozilla Common Voice is an open-source initiative to make voice technology more inclusive. Contributors donate speech data to a public dataset, which anyone can then use to train voice-enabled technology.

Says Hillary Juma, Common Voice Community Manager: “Internet access is increasingly mediated through speech: Voice assistants and smart speakers give us directions, search for information, connect us to friends, used in assistive technology and much more. Yet this technology doesn’t work for millions of people. For example, neither Amazon’s Alexa, Apple’s Siri, nor Google Home support a single native African language.”

Hillary continues: “By giving individuals the ability to share their speech, we can help ensure all communities have access to voice technology and the opportunity it unlocks."

Leave a Comment