Is object localization for free? - Weakly-supervised learning with convolutional neural networks
Institut national de recherche en sciences et technologies du numérique · École Normale Supérieure - PSL · +2 more institutions
Abstract
Successful methods for visual object recognition typically rely on training datasets containing lots of richly annotated images. Detailed image annotation, e.g. by object bounding boxes, however, is both expensive and often subjective. We describe a weakly supervised convolutional neural network (CNN) for object classification that relies only on image-level labels, yet can learn from cluttered scenes containing multiple objects. We quantify its object classification and object location prediction performance on the Pascal VOC 2012 (20 object classes) and the much larger Microsoft COCO (80 object classes) datasets. We find that the network (i) outputs accurate image-level labels, (ii) predicts approximate…
Citation impact
- FWCI
- 65.13
- Percentile
- 100%
- References
- 94
Authors
4- MOMaxime OquabCorresponding
Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure - PSL, Département d'Informatique
- LBLéon Bottou
Meta (United States)
- ILIvan Laptev
Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure - PSL, Département d'Informatique
- JŠJosef Šivic
Institut national de recherche en sciences et technologies du numérique, École Normale Supérieure - PSL, Département d'Informatique
Topics & keywords
- Pascal (unit)
- Artificial intelligence
- Convolutional neural network
- Computer science
- Bounding overwatch
- Minimum bounding box
- Object (grammar)
- Annotation