Efficient Processing of Deep Neural Networks: A Tutorial and Survey

Sze, Vivienne; Chen, Yu‐Hsin; Yang, Tien-Ju; Emer, Joel

doi:10.1109/jproc.2017.2761740

articleProceedings of the IEEENov 20, 2017Closed access

Efficient Processing of Deep Neural Networks: A Tutorial and Survey

VSVivienne Sze YCYu‐Hsin Chen TYTien-Ju Yang JEJoel Emer

Massachusetts Institute of Technology · Nvidia (United States)

Indexed incrossref

Abstract

Deep neural networks (DNNs) are currently widely used for many artificial intelligence (AI) applications including computer vision, speech recognition, and robotics. While DNNs deliver state-of-the-art accuracy on many AI tasks, it comes at the cost of high computational complexity. Accordingly, techniques that enable efficient processing of DNNs to improve energy efficiency and throughput without sacrificing application accuracy or increasing hardware cost are critical to the wide deployment of DNNs in AI systems. This article aims to provide a comprehensive tutorial and survey about the recent advances toward the goal of enabling efficient processing of DNNs. Specifically, it will provide an overview of…

Citation impact

3,960

total citations

FWCI: 113.12
Percentile: 100%
References: 200

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Benchmarking
Key (lock)
Field (mathematics)
Computer architecture
Artificial intelligence
Software deployment
Computer engineering

UN Sustainable Development Goals

Affordable and clean energy

No related works found for this paper.