CCNet: Criss-Cross Attention for Semantic Segmentation
Huazhong University of Science and Technology · Horizon Robotics (China) · +2 more institutions
Abstract
Full-image dependencies provide useful contextual information to benefit visual understanding problems. In this work, we propose a Criss-Cross Network (CCNet) for obtaining such contextual information in a more effective and efficient way. Concretely, for each pixel, a novel criss-cross attention module in CCNet harvests the contextual information of all the pixels on its criss-cross path. By taking a further recurrent operation, each pixel can finally capture the full-image dependencies from all pixels. Overall, CCNet is with the following merits: 1) GPU memory friendly. Compared with the non-local block, the proposed recurrent criss-cross attention module requires 11x less GPU memory usage. 2) High…
Citation impact
- FWCI
- 135.54
- Percentile
- 100%
- References
- 69
Authors
6Topics & keywords
- Computer science
- Benchmark (surveying)
- Pixel
- Block (permutation group theory)
- Code (set theory)
- Artificial intelligence
- Segmentation
- Set (abstract data type)
- Sustainable cities and communities