Rethinking Classification and Localization for Object Detection
Universidad del Noreste · Microsoft Research (United Kingdom)
Abstract
Two head structures (i.e. fully connected head and convolution head) have been widely used in R-CNN based detectors for classification and localization tasks. However, there is a lack of understanding of how does these two head structures work for these two tasks. To address this issue, we perform a thorough analysis and find an interesting fact that the two head structures have opposite preferences towards the two tasks. Specifically, the fully connected head (fc-head) is more suitable for the classification task, while the convolution head (conv-head) is more suitable for the localization task. Furthermore, we examine the output feature maps of both heads and find that fc-head has more spatial sensitivity…
Citation impact
- FWCI
- 41.10
- Percentile
- 100%
- References
- 74
Authors
7Topics & keywords
- Head (geology)
- Computer science
- Convolution (computer science)
- Artificial intelligence
- Object detection
- Object (grammar)
- Feature (linguistics)
- Pattern recognition (psychology)