articleJun 16, 2024Closed access

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

University of Toronto · Nanjing University · +4 more institutions

Indexed incrossref

Abstract

We introduce Deformable Convolution v4 (DCNv4), a highly efficient and effective operator designed for a broad spectrum of vision applications. DCNv4 addresses the limitations of its predecessor, DCNv3, with two key enhancements: 1. removing softmax normalization in spatial aggregation to enhance its dynamic property and expressive power and 2. optimizing memory access to minimize redundant operations for speedup. These improvements result in a significantly faster convergence compared to DCNv3 and a substantial increase in processing speed, with DCNv4 achieving more than three times the forward speed. DCNv4 demonstrates exceptional performance across various tasks, including image classification, instance and…

Citation impact

212
total citations
FWCI
47.39
Percentile
100%
References
50
Citations per year

Authors

13

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Computer vision
  • Operator (biology)
  • Chemistry
No related works found for this paper.