articleTechnometricsJul 19, 2007Closed access

Large-Scale Bayesian Logistic Regression for Text Categorization

Rutgers, The State University of New Jersey · Center for Discrete Mathematics and Theoretical Computer Science · +1 more institution

Indexed incrossref

Abstract

Logistic regression analysis of high-dimensional data, such as natural language text, poses computational and statistical challenges. Maximum likelihood estimation often fails in these applications. We present a simple Bayesian logistic regression approach that uses a Laplace prior to avoid overfitting and produces sparse predictive models for text data. We apply this approach to a range of document classification problems and show that it produces compact predictive models at least as effective as those produced by support vector machine classifiers or ridge logistic regression combined with feature selection. We describe our model fitting algorithm, our open source implementations (BBR and BMR), and…

Citation impact

818
total citations
FWCI
74.01
Percentile
100%
References
67
Citations per year

Authors

3

Topics & keywords

Keywords
  • Overfitting
  • Logistic model tree
  • Logistic regression
  • Computer science
  • Artificial intelligence
  • Feature selection
  • Bayesian probability
  • Machine learning
No related works found for this paper.

Funding