Racial disparities in automated speech recognition

Koenecke, Allison; Nam, Andrew; Lake, Emily; Nudell, Joe; Quartey, Minnie; Mengesha, Zion; Toups, Connor; Rickford, John R.; Jurafsky, Dan; Goel, Sharad

doi:10.1073/pnas.1915768117

articleProceedings of the National Academy of SciencesMar 23, 2020HYBRID OA

Racial disparities in automated speech recognition

AKAllison Koenecke ANAndrew Nam ELEmily Lake JNJoe Nudell MQMinnie Quartey

Stanford University · Georgetown University

PubMed

Indexed incrossrefpubmed

Abstract

Automated speech recognition (ASR) systems, which use sophisticated machine-learning algorithms to convert spoken language to text, have become increasingly widespread, powering popular virtual assistants, facilitating automated closed captioning, and enabling digital dictation platforms for health care. Over the last several years, the quality of these systems has dramatically improved, due both to advances in deep learning and to the collection of large-scale datasets used to train the systems. There is concern, however, that these tools do not work equally well for all subgroups of the population. Here, we examine the ability of five state-of-the-art ASR systems-developed by Amazon, Apple, Google, IBM, and…

Citation impact

657

total citations

FWCI: 106.13
Percentile: 100%
References: 39

Citations per year

Authors

10

Topics & keywords

Topics

Keywords

Dictation
Computer science
Speech recognition
Population
Closed captioning
Natural language processing
IBM
Artificial intelligence

UN Sustainable Development Goals

Quality Education

No related works found for this paper.