Racial disparities in automated speech recognition
Stanford University · Georgetown University
Abstract
Automated speech recognition (ASR) systems, which use sophisticated machine-learning algorithms to convert spoken language to text, have become increasingly widespread, powering popular virtual assistants, facilitating automated closed captioning, and enabling digital dictation platforms for health care. Over the last several years, the quality of these systems has dramatically improved, due both to advances in deep learning and to the collection of large-scale datasets used to train the systems. There is concern, however, that these tools do not work equally well for all subgroups of the population. Here, we examine the ability of five state-of-the-art ASR systems-developed by Amazon, Apple, Google, IBM, and…
Citation impact
- FWCI
- 106.13
- Percentile
- 100%
- References
- 39
Authors
10Topics & keywords
- Dictation
- Computer science
- Speech recognition
- Population
- Closed captioning
- Natural language processing
- IBM
- Artificial intelligence
- Quality Education