Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies
Université Paris Sciences et Lettres · École des hautes études en sciences sociales · +2 more institutions
Abstract
The success of long short-term memory (LSTM) neural networks in language processing is typically attributed to their ability to capture long-distance statistical regularities. Linguistic regularities are often sensitive to syntactic structure; can such dependencies be captured by LSTMs, which do not have explicit structural representations? We begin addressing this question using number agreement in English subject-verb dependencies. We probe the architecture’s grammatical competence both using training objectives with an explicit grammatical target (number prediction, grammaticality judgments) and using language models. In the strongly supervised settings, the LSTM achieved very high overall accuracy (less…
Citation impact
- FWCI
- 102.48
- Percentile
- 100%
- References
- 49
Authors
3- TLTal LinzenCorresponding
Université Paris Sciences et Lettres, École des hautes études en sciences sociales
- EDEmmanuel DupouxCorresponding
Université Paris Sciences et Lettres, École des hautes études en sciences sociales
- YGYoav GoldbergCorresponding
Bar-Ilan University, Laboratoire d'Informatique de Paris-Nord
Topics & keywords
- Grammaticality
- Computer science
- Natural language processing
- Syntax
- Artificial intelligence
- Language model
- Parsing
- Grammar
- Quality Education