Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval
University of British Columbia · Microsoft (United States)
Abstract
This paper develops a model that addresses sentence embedding, a hot topic in current natural language processing research, using recurrent neural networks (RNN) with Long Short-Term Memory (LSTM) cells. The proposed LSTM-RNN model sequentially takes each word in a sentence, extracts its information, and embeds it into a semantic vector. Due to its ability to capture long term memory, the LSTM-RNN accumulates increasingly richer information as it goes through the sentence, and when it reaches the last word, the hidden layer of the network provides a semantic representation of the whole sentence. In this paper, the LSTM-RNN is trained in a weakly supervised manner on user click-through data logged by a…
Citation impact
- FWCI
- 91.45
- Percentile
- 100%
- References
- 45
Authors
8- HPHamid PalangiCorresponding
University of British Columbia
- LDLi Deng
Microsoft (United States)
- YSYelong Shen
Microsoft (United States)
- JGJianfeng Gao
Microsoft (United States)
- XHXiaodong He
Microsoft (United States)
Topics & keywords
- Embedding
- Sentence
- Word embedding
- Representation (politics)
- Word (group theory)
- Similarity (geometry)
- Process (computing)
- Semantic similarity