Discovering objects and their location in images

Šivic, Josef; Russell, Bryan; Efros, Alexei A.; Zisserman, Andrew; Freeman, William T.

doi:10.1109/iccv.2005.77

articleJan 1, 2005GREEN OA

Discovering objects and their location in images

JŠJosef Šivic BRBryan Russell AAAlexei A. Efros AZAndrew Zisserman WTWilliam T. Freeman

University of Oxford · Massachusetts Institute of Technology · +1 more institution

Indexed incrossrefdatacite

Abstract

We seek to discover the object categories depicted in a set of unlabelled images. We achieve this using a model developed in the statistical text literature: probabilistic latent semantic analysis (pLSA). In text analysis, this is used to discover topics in a corpus using the bag-of-words document representation. Here we treat object categories as topics, so that an image containing instances of several categories is modeled as a mixture of topics. The model is applied to images by using a visual analogue of a word, formed by vector quantizing SIFT-like region descriptors. The topic discovery approach successfully translates to the visual domain: for a small set of objects, we show that both the object…

Citation impact

984

total citations

FWCI: 58.22
Percentile: 100%
References: 37

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Artificial intelligence
Computer science
Probabilistic latent semantic analysis
Pattern recognition (psychology)
Vocabulary
Object (grammar)
Set (abstract data type)
Bag-of-words model in computer vision

UN Sustainable Development Goals

Quality Education

No related works found for this paper.