Sequence to Sequence Learning with Neural Networks
Indexed inarxivdatacite
Abstract
Deep Neural Networks (DNNs) are powerful models that have achieved excellent performance on difficult learning tasks. Although DNNs work well whenever large labeled training sets are available, they cannot be used to map sequences to sequences. In this paper, we present a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure. Our method uses a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector. Our main result is that on an English to French translation task from the WMT'14 dataset, the translations produced by the LSTM achieve a…
Citation impact
13,340
total citations
- FWCI
- —
- Percentile
- —
- References
- 29
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Computer science
- Artificial intelligence
- Sentence
- Phrase
- Sequence (biology)
- Natural language processing
- Word (group theory)
- Task (project management)
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.