Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Hong Kong University of Science and Technology · Rutgers Sexual and Reproductive Health and Rights
Abstract
To design fast neural networks, many works have been focusing on reducing the number of floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does not necessarily lead to a similar level of re-duction in latency. This mainly stems from inefficiently low floating-point operations per second (FLOPS). To achieve faster networks, we revisit popular operators and demonstrate that such low FLOPS is mainly due to frequent memory access of the operators, especially the depthwise con-volution. We hence propose a novel partial convolution (PConv) that extracts spatial features more efficiently, by cutting down redundant computation and memory access simultaneously. Building upon our PConv,…
Citation impact
- FWCI
- 231.04
- Percentile
- 100%
- References
- 118
Authors
7- JCJierun ChenCorresponding
Hong Kong University of Science and Technology
- SKShiu-hong Kao
Hong Kong University of Science and Technology
- HHHao He
Hong Kong University of Science and Technology
- WZWeipeng Zhuo
Hong Kong University of Science and Technology
- WSWen Song
Rutgers Sexual and Reproductive Health and Rights
Topics & keywords
- FLOPS
- Computer science
- Latency (audio)
- Parallel computing
- Reduction (mathematics)
- Throughput
- Computation
- Code (set theory)
- Affordable and clean energy