Deep neural networks for small footprint text-dependent speaker verification
Johns Hopkins University · Google (United States)
Abstract
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-dependent speaker verification task. At development stage, a DNN is trained to classify speakers at the framelevel. During speaker enrollment, the trained DNN is used to extract speaker specific features from the last hidden layer. The average of these speaker features, or d-vector, is taken as the speaker model. At evaluation stage, a d-vector is extracted for each utterance and compared to the enrolled speaker model to make a verification decision. Experimental results show the DNN based speaker verification system achieves good performance compared to a popular i-vector system on a small footprint text-dependent…
Citation impact
- FWCI
- 49.93
- Percentile
- 100%
- References
- 29
Authors
5Topics & keywords
- Computer science
- Speaker verification
- Speech recognition
- Speaker recognition
- Word error rate
- Artificial neural network
- Footprint
- Task (project management)
- Quality Education