Self-taught learning

Raina, Rajat; Battle, Alexis; Lee, Honglak; Packer, Benjamin; Ng, Andrew Y.

doi:10.1145/1273496.1273592

articleJun 20, 2007Closed access

Self-taught learning

RRRajat Raina ABAlexis Battle HLHonglak Lee BPBenjamin Packer AYAndrew Y. Ng

Stanford University

Indexed incrossref

Abstract

We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabeled data follows the same class labels or generative distribution as the labeled data. Thus, we would like to use a large number of unlabeled images (or audio samples, or text documents) randomly downloaded from the Internet to improve performance on a given image (or audio, or text) classification task. Such unlabeled data is significantly easier to obtain than in typical semi-supervised or transfer learning settings, making self-taught learning widely applicable to many practical learning problems. We describe an approach to self-taught…

Citation impact

1,532

total citations

FWCI: 51.08
Percentile: 100%
References: 27

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Semi-supervised learning
Labeled data
Kernel (algebra)
Machine learning
Support vector machine
Construct (python library)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.