articleJun 1, 2020Closed access

Dynamic Convolution: Attention Over Convolution Kernels

Microsoft Research (United Kingdom)

Indexed incrossref

Abstract

Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and the width (number of channels) of CNNs, resulting in limited representation capability. To address this issue, we present Dynamic Convolution, a new design that increases model complexity without increasing the network depth or width. Instead of using a single convolution kernel per layer, dynamic convolution aggregates multiple parallel convolution kernels dynamically based upon their attentions, which are input dependent. Assembling multiple kernels is not only computationally efficient due to the small kernel size, but also has more…

Citation impact

1,342
total citations
FWCI
55.91
Percentile
100%
References
72
Citations per year

Authors

6

Topics & keywords

Keywords
  • Convolution (computer science)
  • FLOPS
  • Kernel (algebra)
  • Computer science
  • Convolutional neural network
  • Representation (politics)
  • Algorithm
  • Computational complexity theory
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.