articleJul 19, 2010GREEN OA
Short text classification in twitter to improve information filtering
The Ohio State University · University at Buffalo, State University of New York
Indexed incrossref
Abstract
In microblogging services such as Twitter, the users may become overwhelmed by the raw data. One solution to this problem is the classification of short text messages. As short texts do not provide sufficient word occurrences, traditional classification methods such as "Bag-Of-Words" have limitations. To address this problem, we propose to use a small set of domain-specific features extracted from the author's profile and text. The proposed approach effectively classifies the text to a predefined set of generic classes such as News, Events, Opinions, Deals, and Private Messages.
Citation impact
729
total citations
- FWCI
- 88.56
- Percentile
- 100%
- References
- 17
Citations per year
Authors
5Topics & keywords
Topics
Keywords
- Microblogging
- Computer science
- Social media
- Set (abstract data type)
- Information retrieval
- Word (group theory)
- Domain (mathematical analysis)
- Artificial intelligence
No related works found for this paper.