Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies

Université Paris Sciences et Lettres · École des hautes études en sciences sociales · +2 more institutions

Indexed incrossrefdoaj

Abstract

The success of long short-term memory (LSTM) neural networks in language processing is typically attributed to their ability to capture long-distance statistical regularities. Linguistic regularities are often sensitive to syntactic structure; can such dependencies be captured by LSTMs, which do not have explicit structural representations? We begin addressing this question using number agreement in English subject-verb dependencies. We probe the architecture’s grammatical competence both using training objectives with an explicit grammatical target (number prediction, grammaticality judgments) and using language models. In the strongly supervised settings, the LSTM achieved very high overall accuracy (less…

Citation impact

781
total citations
FWCI
102.48
Percentile
100%
References
49
Citations per year

Authors

3

Topics & keywords

Keywords
  • Grammaticality
  • Computer science
  • Natural language processing
  • Syntax
  • Artificial intelligence
  • Language model
  • Parsing
  • Grammar
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding