Multiple Bernoulli relevance models for image and video annotation
University of Massachusetts Amherst
Abstract
Retrieving images in response to textual queries requires some knowledge of the semantics of the picture. Here, we show how we can do both automatic image annotation and retrieval (using one word queries) from images and videos using a multiple Bernoulli relevance model. The model assumes that a training set of images or videos along with keyword annotations is provided. Multiple keywords are provided for an image and the specific correspondence between a keyword and an image is not provided. Each image is partitioned into a set of rectangular regions and a real-valued feature vector is computed over these regions. The relevance model is a joint probability distribution of the word annotations and the image…
Citation impact
- FWCI
- 29.52
- Percentile
- 100%
- References
- 21
Authors
3Topics & keywords
- Computer science
- Artificial intelligence
- Feature (linguistics)
- Automatic image annotation
- Pattern recognition (psychology)
- Image retrieval
- Relevance (law)
- Set (abstract data type)