Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks
Indexed inarxivdatacite
Abstract
Recurrent Neural Networks can be trained to produce sequences of tokens given some input, as exemplified by recent results in machine translation and image captioning. The current approach to training them consists of maximizing the likelihood of each token in the sequence given the current (recurrent) state and the previous token. At inference, the unknown previous token is then replaced by a token generated by the model itself. This discrepancy between training and inference can yield errors that can accumulate quickly along the generated sequence. We propose a curriculum learning strategy to gently change the training process from a fully guided scheme using the true previous token, towards a less guided…
Citation impact
1,264
total citations
- FWCI
- —
- Percentile
- —
- References
- 25
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Security token
- Closed captioning
- Computer science
- Inference
- Sequence (biology)
- Scheme (mathematics)
- Artificial neural network
- Artificial intelligence
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.