Adversarial Learning for Neural Dialogue Generation
Stanford University · New York University · +1 more institution
Abstract
In this paper, drawing intuition from the Turing test, we propose using adversarial training for open-domain dialogue generation: the system is trained to produce sequences that are indistinguishable from human-generated dialogue utterances. We cast the task as a reinforcement learning (RL) problem where we jointly train two systems, a generative model to produce response sequences, and a discriminator-analagous to the human evaluator in the Turing test-to distinguish between the human-generated dialogues and the machine-generated ones. The outputs from the discriminator are then used as rewards for the generative model, pushing the system to generate dialogues that mostly resemble human dialogues.
Citation impact
- FWCI
- 100.82
- Percentile
- 100%
- References
- 61
Authors
6Topics & keywords
- Adversarial system
- Turing test
- Computer science
- Discriminator
- Artificial intelligence
- Adversary
- Generative grammar
- Metric (unit)
- Reduced inequalities