Recent Advances in End-to-End Automatic Speech Recognition

Li, Jinyu

doi:10.1561/116.00000050

articleAPSIPA Transactions on Signal and Information ProcessingApr 20, 2022GOLD OA

Recent Advances in End-to-End Automatic Speech Recognition

JLJinyu Li

Microsoft (United States) · Microsoft Research (United Kingdom)

Indexed incrossrefdoaj

Abstract

Recently, the speech community is seeing a significant trend of moving from deep neural network based hybrid modeling to end-to-end (E2E) modeling for automatic speech recognition (ASR). While E2E models achieve the state-of-the-art results in most benchmarks in terms of ASR accuracy, hybrid models are still used in a large proportion of commercial ASR systems at the current time. There are lots of practical factors that affect the production model deployment decision. Traditional hybrid models, being optimized for production for decades, are usually good at these factors. Without providing excellent solutions to all these factors, it is hard for E2E models to be widely commercialized. In this paper, we will…

Citation impact

365

total citations

FWCI: 48.97
Percentile: 100%
References: 302

Citations per year

Authors

1

JL
Jinyu LiCorresponding
Microsoft (United States), Microsoft Research (United Kingdom)

Topics & keywords

Topics

Keywords

End-to-end principle
Computer science
Speech recognition
Artificial intelligence

No related works found for this paper.