articleACM Transactions on GraphicsJul 20, 2017Closed access

Synthesizing Obama

University of Washington

Indexed incrossref

Abstract

Given audio of President Barack Obama, we synthesize a high quality video of him speaking with accurate lip sync, composited into a target video clip. Trained on many hours of his weekly address footage, a recurrent neural network learns the mapping from raw audio features to mouth shapes. Given the mouth shape at each time instant, we synthesize high quality mouth texture, and composite it with proper 3D pose matching to change what he appears to be saying in a target video to match the input audio track. Our approach produces photorealistic results.

Citation impact

1,069
total citations
FWCI
30.96
Percentile
100%
References
51
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • sync
  • Artificial intelligence
  • Texture (cosmology)
  • Computer vision
  • Computer graphics (images)
  • Track (disk drive)
  • Image (mathematics)
No related works found for this paper.

Funding