Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

Xiong, Yuwen; Li, Zhiqi; Chen, Yuntao; Wang, Feng; Zhu, Xizhou; Luo, Jiapeng; Wang, Wenhai; Lü, Tong; Li, Hongsheng; Qiao, Yu; Lu, Lewei; Zhou, Jie; Dai, Yifeng

doi:10.1109/cvpr52733.2024.00540

articleJun 16, 2024Closed access

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

YXYuwen Xiong ZLZhiqi Li YCYuntao Chen FWFeng Wang XZXizhou Zhu

University of Toronto · Nanjing University · +4 more institutions

Indexed incrossref

Abstract

We introduce Deformable Convolution v4 (DCNv4), a highly efficient and effective operator designed for a broad spectrum of vision applications. DCNv4 addresses the limitations of its predecessor, DCNv3, with two key enhancements: 1. removing softmax normalization in spatial aggregation to enhance its dynamic property and expressive power and 2. optimizing memory access to minimize redundant operations for speedup. These improvements result in a significantly faster convergence compared to DCNv3 and a substantial increase in processing speed, with DCNv4 achieving more than three times the forward speed. DCNv4 demonstrates exceptional performance across various tasks, including image classification, instance and…

Citation impact

212

total citations

FWCI: 47.39
Percentile: 100%
References: 50

Citations per year

Authors

13

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Computer vision
Operator (biology)
Chemistry

No related works found for this paper.