Image retrieval using scene graphs
Stanford University · Max Planck Institute for Informatics · +2 more institutions
Abstract
This paper develops a novel framework for semantic image retrieval based on the notion of a scene graph. Our scene graphs represent objects (“man”, “boat”), attributes of objects (“boat is white”) and relationships between objects (“man standing on boat”). We use these scene graphs as queries to retrieve semantically related images. To this end, we design a conditional random field model that reasons about possible groundings of scene graphs to test images. The likelihoods of these groundings are used as ranking scores for retrieval. We introduce a novel dataset of 5,000 human-generated scene graphs grounded to images and use this dataset to evaluate our method for image retrieval. In particular, we evaluate…
Citation impact
- FWCI
- 34.07
- Percentile
- 100%
- References
- 87
Authors
7Topics & keywords
- Computer science
- Artificial intelligence
- Computer vision
- Image retrieval
- Image (mathematics)
- Information retrieval