bookThe MIT Press eBooksDec 12, 2008Closed access

Dataset Shift in Machine Learning

Indexed incrossref

Abstract

An overview of recent efforts in the machine learning community to deal with dataset and covariate shift, which occurs when test and training inputs and outputs have different distributions. Dataset shift is a common problem in predictive modeling that occurs when the joint distribution of inputs and outputs differs between training and test stages. Covariate shift, a particular case of dataset shift, occurs when only the input distribution changes. Dataset shift is present in most practical applications, for reasons ranging from the bias introduced by experimental design to the irreproducibility of the testing conditions at training time. (An example is -email spam filtering, which may fail to recognize spam…

Citation impact

1,506
total citations
FWCI
9.36
Percentile
100%
References
154
Citations per year

Topics & keywords

Keywords
  • Artificial intelligence
  • Machine learning
  • Covariate
  • Computer science
  • Bayesian probability
  • Paradigm shift
  • Filter (signal processing)
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding