Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution
National University of Singapore · Meta (Israel)
Abstract
In natural images, information is conveyed at different frequencies where higher frequencies are usually encoded with fine details and lower frequencies are usually encoded with global structures. Similarly, the output feature maps of a convolution layer can also be seen as a mixture of information at different frequencies. In this work, we propose to factorize the mixed feature maps by their frequencies, and design a novel Octave Convolution (OctConv) operation to store and process feature maps that vary spatially “slower” at a lower spatial resolution reducing both memory and computation cost. Unlike existing multi-scale methods, OctConv is formulated as a single, generic, plug-and-play convolutional unit…
Citation impact
- FWCI
- 35.82
- Percentile
- 100%
- References
- 79
Authors
8Topics & keywords
- Octave (electronics)
- Convolutional neural network
- Convolution (computer science)
- Redundancy (engineering)
- Computer science
- Artificial neural network
- Speech recognition
- Acoustics