reviewACM Computing SurveysMar 1, 2002GREEN OA

Machine learning in automated text categorization

Consorzio Pisa Ricerche

Indexed inarxivcrossref

Abstract

The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last 10 years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this problem is based on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of preclassified documents, the characteristics of the categories. The advantages of this approach over the knowledge engineering approach (consisting in the manual definition of a classifier by domain experts) are a very good effectiveness, considerable savings in terms of expert…

Citation impact

7,883
total citations
FWCI
291.63
Percentile
100%
References
204
Citations per year

Authors

1

Topics & keywords

Keywords
  • Computer science
  • Categorization
  • Software portability
  • Artificial intelligence
  • Classifier (UML)
  • Machine learning
  • Text categorization
  • Natural language processing
UN Sustainable Development Goals
  • Decent work and economic growth
No related works found for this paper.