Abstract
We introduce a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. Such problems cannot be trivially addressed by existent approaches such as sequence-to-sequence [1] and Neural Turing Machines [2], because the number of target classes in each step of the output depends on the length of the input, which is variable. Problems such as sorting variable sized sequences, and various combinatorial optimization problems belong to this class. Our model solves the problem of variable size output dictionaries using a recently proposed mechanism of neural attention. It differs from the previous attention…
Citation impact
1,337
total citations
- FWCI
- 77.21
- Percentile
- 100%
- References
- 16
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Pointer (user interface)
- Computer science
- Sequence (biology)
- Artificial neural network
- Variable (mathematics)
- Theoretical computer science
- Encoder
- Convolutional neural network
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.