Connectionist temporal classification
Dalle Molle Institute for Artificial Intelligence Research · Technical University of Munich
Abstract
Many real-world sequence learning tasks require the prediction of sequences of labels from noisy, unsegmented input data. In speech recognition, for example, an acoustic signal is transcribed into words or sub-word units. Recurrent neural networks (RNNs) are powerful sequence learners that would seem well suited to such tasks. However, because they require pre-segmented training data, and post-processing to transform their outputs into label sequences, their applicability has so far been limited. This paper presents a novel method for training RNNs to label unsegmented sequences directly, thereby solving both problems. An experiment on the TIMIT speech corpus demonstrates its advantages over both a baseline…
Citation impact
- FWCI
- 9.79
- Percentile
- 100%
- References
- 15
Authors
4Topics & keywords
- TIMIT
- Computer science
- Speech recognition
- Recurrent neural network
- Connectionism
- Hidden Markov model
- Sequence (biology)
- Artificial intelligence
- Quality Education