Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
University of Toronto · Nanjing University · +4 more institutions
Abstract
We introduce Deformable Convolution v4 (DCNv4), a highly efficient and effective operator designed for a broad spectrum of vision applications. DCNv4 addresses the limitations of its predecessor, DCNv3, with two key enhancements: 1. removing softmax normalization in spatial aggregation to enhance its dynamic property and expressive power and 2. optimizing memory access to minimize redundant operations for speedup. These improvements result in a significantly faster convergence compared to DCNv3 and a substantial increase in processing speed, with DCNv4 achieving more than three times the forward speed. DCNv4 demonstrates exceptional performance across various tasks, including image classification, instance and…
Citation impact
- FWCI
- 47.39
- Percentile
- 100%
- References
- 50
Authors
13Topics & keywords
- Computer science
- Artificial intelligence
- Computer vision
- Operator (biology)
- Chemistry