SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Shanghai Jiao Tong University · University College London
Abstract
As a new way of training generative models, Generative Adversarial Net (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data. However, it has limitations when the goal is for generating sequences of discrete tokens. A major reason lies in that the discrete outputs from the generative model make it difficult to pass the gradient update from the discriminative model to the generative model. Also, the discriminative model can only assess a complete sequence, while for a partially generated sequence, it is non-trivial to balance its current score and the future one once the entire sequence has been generated. In this…
Citation impact
- FWCI
- 111.13
- Percentile
- 100%
- References
- 43
Authors
4Topics & keywords
- Discriminative model
- Sequence (biology)
- Generator (circuit theory)
- Computer science
- Discriminator
- Generative model
- Reinforcement learning
- Generative grammar
- Reduced inequalities