GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash\n Equilibrium
Indexed inarxiv
Abstract
Generative Adversarial Networks (GANs) excel at creating realistic images\nwith complex models for which maximum likelihood is infeasible. However, the\nconvergence of GAN training has still not been proved. We propose a two\ntime-scale update rule (TTUR) for training GANs with stochastic gradient\ndescent on arbitrary GAN loss functions. TTUR has an individual learning rate\nfor both the discriminator and the generator. Using the theory of stochastic\napproximation, we prove that the TTUR converges under mild assumptions to a\nstationary local Nash equilibrium. The convergence carries over to the popular\nAdam optimization, for which we prove that it follows the dynamics of a heavy\nball with friction and…
Citation impact
3,814
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
5Topics & keywords
Topics
Keywords
- Discriminator
- Computer science
- Maxima and minima
- Stochastic gradient descent
- Nash equilibrium
- Convergence (economics)
- Benchmark (surveying)
- Generator (circuit theory)
UN Sustainable Development Goals
- Reduced inequalities
No related works found for this paper.