articleJul 19, 2010GREEN OA

Short text classification in twitter to improve information filtering

The Ohio State University · University at Buffalo, State University of New York

Indexed incrossref

Abstract

In microblogging services such as Twitter, the users may become overwhelmed by the raw data. One solution to this problem is the classification of short text messages. As short texts do not provide sufficient word occurrences, traditional classification methods such as "Bag-Of-Words" have limitations. To address this problem, we propose to use a small set of domain-specific features extracted from the author's profile and text. The proposed approach effectively classifies the text to a predefined set of generic classes such as News, Events, Opinions, Deals, and Private Messages.

Citation impact

729
total citations
FWCI
88.56
Percentile
100%
References
17
Citations per year

Authors

5

Topics & keywords

Keywords
  • Microblogging
  • Computer science
  • Social media
  • Set (abstract data type)
  • Information retrieval
  • Word (group theory)
  • Domain (mathematical analysis)
  • Artificial intelligence
No related works found for this paper.