Deep High-Resolution Representation Learning for Visual Recognition
Microsoft Research Asia (China) · University of Science and Technology of China · +5 more institutions
Abstract
High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions in series (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams…
Citation impact
- FWCI
- 212.84
- Percentile
- 100%
- References
- 210
Authors
12Topics & keywords
- Subnetwork
- Computer science
- Artificial intelligence
- Representation (politics)
- Segmentation
- Computer vision
- ENCODE
- Pattern recognition (psychology)