A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Hashimoto, Kazuma; Xiong, Caiming; Tsuruoka, Yoshimasa; Socher, Richard

doi:10.18653/v1/d17-1206

preprintJan 1, 2017GOLD OA

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

KHKazuma Hashimoto CXCaiming Xiong YTYoshimasa Tsuruoka RSRichard Socher

Salesforce (United States) · The University of Tokyo

Indexed incrossref

Abstract

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a single model. We introduce a joint many-task model together with a strategy for successively growing its depth to solve increasingly complex tasks. Higher layers include shortcut connections to lower-level task predictions to reflect linguistic hierarchies. We use a simple regularization term to allow for optimizing all model weights to improve one task's loss without exhibiting catastrophic interference of the other tasks. Our single end-to-end model obtains…

Citation impact

532

total citations

FWCI: 71.45
Percentile: 100%
References: 50

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Regularization (linguistics)
Artificial intelligence
Parsing
Task (project management)
Natural language processing
Syntax
Multi-task learning

UN Sustainable Development Goals

Quality Education

No related works found for this paper.