articleProceedings of the National Academy of SciencesMar 23, 2020HYBRID OA

Racial disparities in automated speech recognition

Stanford University · Georgetown University

PubMed
Indexed incrossrefpubmed

Abstract

Automated speech recognition (ASR) systems, which use sophisticated machine-learning algorithms to convert spoken language to text, have become increasingly widespread, powering popular virtual assistants, facilitating automated closed captioning, and enabling digital dictation platforms for health care. Over the last several years, the quality of these systems has dramatically improved, due both to advances in deep learning and to the collection of large-scale datasets used to train the systems. There is concern, however, that these tools do not work equally well for all subgroups of the population. Here, we examine the ability of five state-of-the-art ASR systems-developed by Amazon, Apple, Google, IBM, and…

Citation impact

657
total citations
FWCI
106.13
Percentile
100%
References
39
Citations per year

Authors

10

Topics & keywords

Keywords
  • Dictation
  • Computer science
  • Speech recognition
  • Population
  • Closed captioning
  • Natural language processing
  • IBM
  • Artificial intelligence
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.