Deep High-Resolution Representation Learning for Visual Recognition

Wang, Jingdong; Sun, Ke; Cheng, Tianheng; Jiang, Borui; Deng, Chaorui; Zhao, Yang; Liu, Dong; Mu, Yadong; Tan, Mingkui; Wang, Xinggang; Liu, Wenyu; Xiao, Bin

doi:10.1109/tpami.2020.2983686

articleIEEE Transactions on Pattern Analysis and Machine IntelligenceApr 1, 2020Closed access

Deep High-Resolution Representation Learning for Visual Recognition

JWJingdong Wang KSKe Sun TCTianheng Cheng BJBorui Jiang CDChaorui Deng

Microsoft Research Asia (China) · University of Science and Technology of China · +5 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions in series (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams…

Citation impact

4,490

total citations

FWCI: 212.84
Percentile: 100%
References: 210

Citations per year

Authors

12

Topics & keywords

Topics

Keywords

Subnetwork
Computer science
Artificial intelligence
Representation (politics)
Segmentation
Computer vision
ENCODE
Pattern recognition (psychology)

No related works found for this paper.