A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)
National Institute of Standards and Technology
Abstract
Describes a system developed at NIST to produce a composite automatic speech recognition (ASR) system output when the outputs of multiple ASR systems are available, and for which, in many cases, the composite ASR output has a lower error rate than any of the individual systems. The system implements a "voting" or rescoring process to reconcile differences in ASR system outputs. We refer to this system as the NIST Recognizer Output Voting Error Reduction (ROVER) system. As additional knowledge sources are added to an ASR system (e.g. acoustic and language models), error rates are typically decreased. This paper describes a post-recognition process which models the output generated by multiple ASR systems as…
Citation impact
- FWCI
- 50.30
- Percentile
- 100%
- References
- 1
Authors
1Topics & keywords
- Computer science
- Word error rate
- NIST
- Voting
- Word (group theory)
- Speech recognition
- Reduction (mathematics)
- Process (computing)