Siamese Recurrent Architectures for Learning Sentence Similarity

Massachusetts Institute of Technology · M S Ramaiah University of Applied Sciences

Indexed incrossref

Abstract

We present a siamese adaptation of the Long Short-Term Memory (LSTM) network for labeled data comprised of pairs of variable-length sequences. Our model is applied to assess semantic similarity between sentences, where we exceed state of the art, outperforming carefully handcrafted features and recently proposed neural network systems of greater complexity. For these applications, we provide word-embedding vectors supplemented with synonymic information to the LSTMs, which use a fixed size vector to encode the underlying meaning expressed in a sentence (irrespective of the particular wording/syntax). By restricting subsequent operations to rely on a simple Manhattan metric, we compel the sentence…

Citation impact

927
total citations
FWCI
88.86
Percentile
100%
References
42
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Sentence
  • Artificial intelligence
  • Natural language processing
  • ENCODE
  • Word (group theory)
  • Syntax
  • Metric (unit)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.