Multi-Label Image Recognition With Graph Convolutional Networks
Nanjing University · Megvii (China) · +1 more institution
Abstract
The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important dependencies, we propose a multi-label classification model based on Graph Convolutional Network (GCN). The model builds a directed graph over the object labels, where each node (label) is represented by word embeddings of a label, and GCN is learned to map this label graph into a set of inter-dependent object classifiers. These classifiers are applied to the image descriptors extracted by another sub-net, enabling the whole network to be…
Citation impact
- FWCI
- 85.60
- Percentile
- 100%
- References
- 52
Authors
4Topics & keywords
- Computer science
- Pattern recognition (psychology)
- Artificial intelligence
- Graph
- Multi-label classification
- Visualization
- Cognitive neuroscience of visual object recognition
- Convolutional neural network