A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks
Salesforce (United States) · The University of Tokyo
Abstract
Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a single model. We introduce a joint many-task model together with a strategy for successively growing its depth to solve increasingly complex tasks. Higher layers include shortcut connections to lower-level task predictions to reflect linguistic hierarchies. We use a simple regularization term to allow for optimizing all model weights to improve one task's loss without exhibiting catastrophic interference of the other tasks. Our single end-to-end model obtains…
Citation impact
- FWCI
- 71.45
- Percentile
- 100%
- References
- 50
Authors
4Topics & keywords
- Computer science
- Regularization (linguistics)
- Artificial intelligence
- Parsing
- Task (project management)
- Natural language processing
- Syntax
- Multi-task learning
- Quality Education