End-to-end text-dependent speaker verification

Heigold, Georg; Moreno, Ignacio López; Bengio, Samy; Shazeer, Noam

doi:10.1109/icassp.2016.7472652

articleMar 1, 2016Closed access

End-to-end text-dependent speaker verification

GHGeorg Heigold ILIgnacio López Moreno SBSamy Bengio NSNoam Shazeer

Saarland University · Google (United States)

Indexed incrossref

Abstract

In this paper we present a data-driven, integrated approach to speaker verification, which maps a test utterance and a few reference utterances directly to a single score for verification and jointly optimizes the system's components using the same evaluation protocol and metric as at test time. Such an approach will result in simple and efficient systems, requiring little domain-specific knowledge and making few model assumptions. We implement the idea by formulating the problem as a single neural network architecture, including the estimation of a speaker model on only a few utterances, and evaluate it on our internal "Ok Google" benchmark for text-dependent speaker verification. The proposed approach…

Citation impact

531

total citations

FWCI: 85.28
Percentile: 100%
References: 39

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Speaker verification
Utterance
Benchmark (surveying)
End-to-end principle
Metric (unit)
Domain (mathematical analysis)
Speech recognition

UN Sustainable Development Goals

Quality Education

No related works found for this paper.