Tandem connectionist feature extraction for conventional HMM systems
International Computer Science Institute · Oregon Museum of Science and Industry
Abstract
Hidden Markov model speech recognition systems typically use Gaussian mixture models to estimate the distributions of decorrelated acoustic feature vectors that correspond to individual subword units. By contrast, hybrid connectionist-HMM systems use discriminatively-trained neural networks to estimate the probability distribution among subword units given the acoustic observations. In this work we show a large improvement in word recognition performance by combining neural-net discriminative feature processing with Gaussian-mixture distribution modeling. By training the network to generate the subword probability posteriors, then using transformations of these estimates as the base features for a…
Citation impact
- FWCI
- 19.18
- Percentile
- 100%
- References
- 14
Authors
3Topics & keywords
- Hidden Markov model
- Discriminative model
- Computer science
- Connectionism
- Pattern recognition (psychology)
- Mixture model
- Speech recognition
- Word error rate
- Reduced inequalities