Librispeech: An ASR corpus based on public domain audio books

Panayotov, Vassil; Chen, Guoguo; Povey, Daniel; Khudanpur, Sanjeev

doi:10.1109/icassp.2015.7178964

articleApr 1, 2015Closed access

Librispeech: An ASR corpus based on public domain audio books

VPVassil Panayotov GCGuoguo Chen DPDaniel Povey SKSanjeev Khudanpur

Johns Hopkins University

Indexed incrossref

Abstract

This paper introduces a new corpus of read English speech, suitable for training and evaluating speech recognition systems. The LibriSpeech corpus is derived from audiobooks that are part of the LibriVox project, and contains 1000 hours of speech sampled at 16 kHz. We have made the corpus freely available for download, along with separately prepared language-model training data and pre-built language models. We show that acoustic models trained on LibriSpeech give lower error rate on the Wall Street Journal (WSJ) test sets than models trained on WSJ itself. We are also releasing Kaldi scripts that make it easy to build these systems.

Citation impact

5,958

total citations

FWCI: 137.12
Percentile: 100%
References: 29

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Scripting language
Computer science
Speech recognition
Language model
Word error rate
Download
Acoustic model
Natural language processing

UN Sustainable Development Goals

Quality Education

No related works found for this paper.