Building text classifiers using positive and unlabeled examples
University of Illinois Chicago · Bioengineering Center · +4 more institutions
Abstract
We study the problem of building text classifiers using positive and unlabeled examples. The key feature of this problem is that there is no negative example for learning. Recently, a few techniques for solving this problem were proposed in the literature. These techniques are based on the same idea, which builds a classifier in two steps. Each existing technique uses a different method for each step. We first introduce some new methods for the two steps, and perform a comprehensive evaluation of all possible combinations of methods of the two steps. We then propose a more principled approach to solving the problem based on a biased formulation of SVM, and show experimentally that it is more accurate than the…
Citation impact
- FWCI
- 26.36
- Percentile
- 100%
- References
- 55
Authors
5- BLB. LiuCorresponding
University of Illinois Chicago
- YDY. Dai
Bioengineering Center, University of Illinois Chicago
- XLXiaoli Li
National University of Singapore, Singapore-MIT Alliance for Research and Technology
- WLW.S. Lee
Singapore-MIT Alliance for Research and Technology, National University of Singapore
- PSPhilip S. Yu
IBM Research - Thomas J. Watson Research Center, IBM (United States)
Topics & keywords
- Computer science
- Artificial intelligence
- Classifier (UML)
- Machine learning
- Support vector machine
- Key (lock)
- Pattern recognition (psychology)
- Quality Education