SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Shanghai Jiao Tong University · University College London

Indexed incrossref

Abstract

As a new way of training generative models, Generative Adversarial Net (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data. However, it has limitations when the goal is for generating sequences of discrete tokens. A major reason lies in that the discrete outputs from the generative model make it difficult to pass the gradient update from the discriminative model to the generative model. Also, the discriminative model can only assess a complete sequence, while for a partially generated sequence, it is non-trivial to balance its current score and the future one once the entire sequence has been generated. In this…

Citation impact

2,292
total citations
FWCI
111.13
Percentile
100%
References
43
Citations per year

Authors

4

Topics & keywords

Keywords
  • Discriminative model
  • Sequence (biology)
  • Generator (circuit theory)
  • Computer science
  • Discriminator
  • Generative model
  • Reinforcement learning
  • Generative grammar
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.