articleJun 1, 2023Closed access

Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

Hong Kong University of Science and Technology · Rutgers Sexual and Reproductive Health and Rights

Indexed incrossref

Abstract

To design fast neural networks, many works have been focusing on reducing the number of floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does not necessarily lead to a similar level of re-duction in latency. This mainly stems from inefficiently low floating-point operations per second (FLOPS). To achieve faster networks, we revisit popular operators and demonstrate that such low FLOPS is mainly due to frequent memory access of the operators, especially the depthwise con-volution. We hence propose a novel partial convolution (PConv) that extracts spatial features more efficiently, by cutting down redundant computation and memory access simultaneously. Building upon our PConv,…

Citation impact

2,033
total citations
FWCI
231.04
Percentile
100%
References
118
Citations per year

Authors

7

Topics & keywords

Keywords
  • FLOPS
  • Computer science
  • Latency (audio)
  • Parallel computing
  • Reduction (mathematics)
  • Throughput
  • Computation
  • Code (set theory)
UN Sustainable Development Goals
  • Affordable and clean energy
No related works found for this paper.

Funding