The application of two-level attention models in deep convolutional neural network for fine-grained image classification
Peking University · University of Computer Studies Yangon · +4 more institutions
Abstract
Fine-grained classification is challenging because categories can only be discriminated by subtle and local differences. Variances in the pose, scale or rotation usually make the problem more difficult. Most fine-grained classification systems follow the pipeline of finding foreground object or object parts (where) to extract discriminative features (what). In this paper, we propose to apply visual attention to fine-grained classification task using deep neural network. Our pipeline integrates three types of attention: the bottom-up attention that propose candidate patches, the object-level top-down attention that selects relevant patches to a certain object, and the part-level top-down attention that…
Citation impact
- FWCI
- 41.18
- Percentile
- 100%
- References
- 40
Authors
6- TXTianjun XiaoCorresponding
Peking University, University of Computer Studies Yangon
- YXYichong Xu
Microsoft (United States), Microsoft Research Asia (China)
- KYKuiyuan Yang
Microsoft Research Asia (China), Microsoft (United States)
- JZJiaxing Zhang
Microsoft Research Asia (China), Microsoft (United States)
- YPYuxin Peng
University of Computer Studies Yangon, Peking University
Topics & keywords
- Computer science
- Discriminative model
- Pipeline (software)
- Artificial intelligence
- Minimum bounding box
- Convolutional neural network
- Object (grammar)
- Bounding overwatch
- Reduced inequalities