articleJan 1, 2008Closed access

A unified architecture for natural language processing

Princeton University

Indexed incrossref

Abstract

We describe a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity tags, semantic roles, semantically similar words and the likelihood that the sentence makes sense (grammatically and semantically) using a language model. The entire network is trained jointly on all these tasks using weight-sharing, an instance of multitask learning. All the tasks use labeled data except the language model which is learnt from unlabeled text and represents a novel form of semi-supervised learning for the shared tasks. We show how both multitask learning and semi-supervised learning improve the generalization of the…

Citation impact

5,200
total citations
FWCI
66.38
Percentile
100%
References
24
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Natural language processing
  • Artificial intelligence
  • Sentence
  • Multi-task learning
  • Generalization
  • Convolutional neural network
  • Natural language understanding
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.