preprintJan 1, 2017GOLD OA

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Salesforce (United States) · The University of Tokyo

Indexed incrossref

Abstract

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a single model. We introduce a joint many-task model together with a strategy for successively growing its depth to solve increasingly complex tasks. Higher layers include shortcut connections to lower-level task predictions to reflect linguistic hierarchies. We use a simple regularization term to allow for optimizing all model weights to improve one task's loss without exhibiting catastrophic interference of the other tasks. Our single end-to-end model obtains…

Citation impact

532
total citations
FWCI
71.45
Percentile
100%
References
50
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Regularization (linguistics)
  • Artificial intelligence
  • Parsing
  • Task (project management)
  • Natural language processing
  • Syntax
  • Multi-task learning
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.