Discovering objects and their location in images
University of Oxford · Massachusetts Institute of Technology · +1 more institution
Abstract
We seek to discover the object categories depicted in a set of unlabelled images. We achieve this using a model developed in the statistical text literature: probabilistic latent semantic analysis (pLSA). In text analysis, this is used to discover topics in a corpus using the bag-of-words document representation. Here we treat object categories as topics, so that an image containing instances of several categories is modeled as a mixture of topics. The model is applied to images by using a visual analogue of a word, formed by vector quantizing SIFT-like region descriptors. The topic discovery approach successfully translates to the visual domain: for a small set of objects, we show that both the object…
Citation impact
- FWCI
- 58.22
- Percentile
- 100%
- References
- 37
Authors
5Topics & keywords
- Artificial intelligence
- Computer science
- Probabilistic latent semantic analysis
- Pattern recognition (psychology)
- Vocabulary
- Object (grammar)
- Set (abstract data type)
- Bag-of-words model in computer vision
- Quality Education