Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

HPHamid PalangiLDLi DengYSYelong ShenJGJianfeng GaoXHXiaodong He

University of British Columbia · Microsoft (United States)

Indexed inarxivcrossref

Abstract

This paper develops a model that addresses sentence embedding, a hot topic in current natural language processing research, using recurrent neural networks (RNN) with Long Short-Term Memory (LSTM) cells. The proposed LSTM-RNN model sequentially takes each word in a sentence, extracts its information, and embeds it into a semantic vector. Due to its ability to capture long term memory, the LSTM-RNN accumulates increasingly richer information as it goes through the sentence, and when it reaches the last word, the hidden layer of the network provides a semantic representation of the whole sentence. In this paper, the LSTM-RNN is trained in a weakly supervised manner on user click-through data logged by a…

Citation impact

603
total citations
FWCI
91.45
Percentile
100%
References
45
Citations per year

Authors

8
  • HP
    Hamid PalangiCorresponding

    University of British Columbia

  • LD
    Li Deng

    Microsoft (United States)

  • YS
    Yelong Shen

    Microsoft (United States)

  • JG
    Jianfeng Gao

    Microsoft (United States)

  • XH
    Xiaodong He

    Microsoft (United States)

Topics & keywords

Keywords
  • Embedding
  • Sentence
  • Word embedding
  • Representation (politics)
  • Word (group theory)
  • Similarity (geometry)
  • Process (computing)
  • Semantic similarity
No related works found for this paper.