Overcoming bias in speech recognition with Speechmatics

Speechmatics’ Data Science Engineer, Benedetta Cevoli on the importance of speech recognition software and how to eliminate bias within the technology

Increasingly embedded into people’s professional and personal lives, speech recognition software can be found in voice assistants, driverless cars and in contact centres.

Speechmatics, global experts in deep learning and speech recognition, provide autonomous speech recognition technology that has the ability to understand all voices. The company looks to develop technology that is as unbiased as possible.

Last year, a study revealed that Speechmatics’ software outperformed tech giants including Google and Amazon when it comes to addressing bias in speech recognition.

Commenting on the importance of this software, particularly in business, Speechmatics’ Data Science Engineer, Benedetta Cevoli spoke to AI Magazine, she said: “The core value of speech recognition, especially in a business environment, is the rich understanding it provides. Businesses of all kinds are looking for ways to understand what their customers want: using speech recognition can play a major role in this, giving breadth and depth to understanding what clients are saying, whether it be by accurately recording language or deriving insights from the text produced.”

Issues of bias in speech recognition

With many artificial intelligence (AI) and machine learning (ML) technologies, speech recognition technology relies on the data it is trained on for its quality. As the data for speech recognition has been limited as it comes from a small section of society, the technology can contain significant bias if not tackled.

This bias can massively reduce the context in which speech recognition works well. Having in-built bias can mean there are many real-life scenarios where minority voices are not being understood effectively, with real-life consequences.

“Take, for instance, an emergency call and the need to transcribe key information - being misunderstood at that moment could be life-affecting,” explained Cevoli

The Data Science Engineer also outlined a way to overcome such issues, she said: “By using self-supervised learning techniques we can mitigate against this: they take vast amounts of unlabelled data and use some property of the data itself to construct a supervised task, without the need for human intervention. Using this method, we can massively increase the amount of data machines learn from and thus give them a significantly more accurate representation of all voices.”

Benedetta Cevoli

Tackling bias in speech recognition

Monitoring is also a crucial step in the fight against bias in speech recognition. Cevoli explained how this can be achieved: “We need to build systems that can zero in on problems as soon as they arise and then formulate strategies to assess how best to tackle them.” 

She concluded by outlining how to tackle these biases as soon as it has been recognised: “From the earliest stages of development, right through to the late application of these technologies, regularly questioning whether biases are being addressed is crucial. Questions like: are we thinking about the problem in a uniform way? Is the approach we're taking inclusive? Are we subconsciously marginalising a particular group of people?”

“As soon as we start designing any new technology, we need to build in ways to evaluate the role of diversity and inclusion at regular intervals and marry this with continuous research into ways in which we can enhance this evaluation process. It's also of critical importance that the teams working on these new technologies are diverse within themselves,” Cevoli added.


Featured Articles

Data & the IoT key to enabling smart cities of the future

Data is the lifeblood of smart cities like Barcelona, transforming everything from shopping and transportation to autonomous driving and augmented reality

ServiceNow & Nvidia to build enterprise-grade generative AI

ServiceNow and Nvidia announced a partnership to develop powerful, enterprise-grade generative AI capabilities that can transform business processes

IBM and SAP accelerating the rate of AI innovation

IBM aims to provide SAP customers with a better user experience, faster decision-making and greater insights to help transform their business processes

Google I/O: Google shares details of Duet AI collaborator


45% of executives state ChatGPT has increased AI investment

AI Strategy

Top 10 quantum computing companies globally in 2023