A Structured Self-attentive Sentence Embedding
Indexed inarxivdatacite
Abstract
This paper proposes a new model for extracting an interpretable sentence embedding by introducing self-attention. Instead of using a vector, we use a 2-D matrix to represent the embedding, with each row of the matrix attending on a different part of the sentence. We also propose a self-attention mechanism and a special regularization term for the model. As a side effect, the embedding comes with an easy way of visualizing what specific parts of the sentence are encoded into the embedding. We evaluate our model on 3 different tasks: author profiling, sentiment classification, and textual entailment. Results show that our model yields a significant performance gain compared to other sentence embedding methods in…
Citation impact
1,471
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
7Topics & keywords
Topics
Keywords
- Embedding
- Sentence
- Computer science
- Natural language processing
- Logical consequence
- Artificial intelligence
- Regularization (linguistics)
No related works found for this paper.