SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Yu, Lantao; Zhang, Weinan; Wang, Jun; Yu, Yong

doi:10.1609/aaai.v31i1.10804

articleProceedings of the AAAI Conference on Artificial IntelligenceFeb 13, 2017DIAMOND OA

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

LYLantao Yu WZWeinan Zhang JWJun Wang YYYong Yu

Shanghai Jiao Tong University · University College London

Indexed incrossref

Abstract

As a new way of training generative models, Generative Adversarial Net (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data. However, it has limitations when the goal is for generating sequences of discrete tokens. A major reason lies in that the discrete outputs from the generative model make it difficult to pass the gradient update from the discriminative model to the generative model. Also, the discriminative model can only assess a complete sequence, while for a partially generated sequence, it is non-trivial to balance its current score and the future one once the entire sequence has been generated. In this…

Citation impact

2,292

total citations

FWCI: 111.13
Percentile: 100%
References: 43

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Discriminative model
Sequence (biology)
Generator (circuit theory)
Computer science
Discriminator
Generative model
Reinforcement learning
Generative grammar

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.