articleOct 12, 2020GREEN OA

A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild

International Institute of Information Technology, Hyderabad · University of Bath

Indexed inarxivcrossref

Abstract

In this work, we investigate the problem of lip-syncing a talking face video of an arbitrary identity to match a target speech segment. Current works excel at producing accurate lip movements on a static image or videos of specific people seen during the training phase. However, they fail to accurately morph the lip movements of arbitrary identities in dynamic, unconstrained talking face videos, resulting in significant parts of the video being out-of-sync with the new audio. We identify key reasons pertaining to this and hence resolve them by learning from a powerful lip-sync discriminator. Next, we propose new, rigorous evaluation benchmarks and metrics to accurately measure lip synchronization in…

No related works found for this paper.