articlearXiv (Cornell University)Jan 1, 2024GREEN OA

Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks

Bar-Ilan University

Indexed indatacite

Abstract

There is a lot of research interest in encoding variable length sentences into fixed length vectors, in a way that preserves the sentence meanings. Two common methods include representations based on averaging word vectors, and representations based on the hidden states of recurrent neural networks such as LSTMs. The sentence vectors are used as features for subsequent machine learning tasks or for pre-training in the context of deep learning. However, not much is known about the properties that are encoded in these sentence representations and about the language information they capture. We propose a framework that facilitates better understanding of the encoded representations. We define prediction tasks…

Citation impact

243
total citations
FWCI
Percentile
References
0
Citations per year

Authors

1

Topics & keywords

Keywords
  • Sentence
  • Computer science
  • Artificial intelligence
  • Natural language processing
  • Classifier (UML)
  • Word (group theory)
  • Representation (politics)
  • Embedding
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.