Domain Adaptation for Statistical Classifiers

Daumé, Hal; Marcu, Daniel

doi:10.1613/jair.1872

articleJournal of Artificial Intelligence ResearchJun 21, 2006DIAMOND OA

Domain Adaptation for Statistical Classifiers

HDHal Daumé DMDaniel Marcu

University of Southern California

Indexed inarxivcrossrefdoaj

Abstract

The most basic assumption used in statistical learning theory is that training data and test data are drawn from the same underlying distribution. Unfortunately, in many applications, the "in-domain" test data is drawn from a distribution that is related, but not identical, to the "out-of-domain" distribution of the training data. We consider the common case in which labeled out-of-domain data is plentiful, but labeled in-domain data is scarce. We introduce a statistical formulation of this problem in terms of a simple mixture model and present an instantiation of this framework to maximum entropy classifiers and their linear chain counterparts. We present efficient inference algorithms for this special case…

Citation impact

830

total citations

FWCI: 43.81
Percentile: 100%
References: 44

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Conditional probability distribution
Artificial intelligence
Domain (mathematical analysis)
Machine learning
Domain adaptation
Statistical inference
Test data

UN Sustainable Development Goals

Quality Education

No related works found for this paper.