articleJul 28, 2017Closed access

Deep Learning for Extreme Multi-label Text Classification

Carnegie Mellon University

Indexed incrossref

Abstract

Extreme multi-label text classification (XMTC) refers to the problem of assigning to each document its most relevant subset of class labels from an extremely large label collection, where the number of labels could reach hundreds of thousands or millions. The huge label space raises research challenges such as data sparsity and scalability. Significant progress has been made in recent years by the development of new machine learning methods, such as tree induction with large-margin partitions of the instance spaces and label-vector embedding in the target space. However, deep learning has not been explored for XMTC, despite its big successes in other related areas. This paper presents the first attempt at…

Citation impact

657
total citations
FWCI
45.16
Percentile
100%
References
56
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Margin (machine learning)
  • Artificial intelligence
  • Multi-label classification
  • Convolutional neural network
  • Deep learning
  • Scalability
  • Benchmark (surveying)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding