A unified architecture for natural language processing

Collobert, Ronan; Weston, Jason

doi:10.1145/1390156.1390177

articleJan 1, 2008Closed access

A unified architecture for natural language processing

RCRonan Collobert JWJason Weston

Princeton University

Indexed incrossref

Abstract

We describe a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense (grammatically and semantically) using a language model. The entire network is trained jointly on all these tasks using weight-sharing, an instance of multitask learning. All the tasks use labeled data except the language model which is learnt from unlabeled text and represents a novel form of semi-supervised learning for the shared tasks. We show how both multitask learning and semi-supervised learning improve the generalization of the…

Citation impact

5,200

total citations

FWCI: 66.38
Percentile: 100%
References: 24

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Natural language processing
Artificial intelligence
Sentence
Multi-task learning
Generalization
Convolutional neural network
Natural language understanding

UN Sustainable Development Goals

Quality Education

No related works found for this paper.