articleOct 1, 2016Closed access

Fused-layer CNN accelerators

Stony Brook University

Indexed incrossref

Abstract

Deep convolutional neural networks (CNNs) are rapidly becoming the dominant approach to computer vision and a major component of many other pervasive machine learning tasks, such as speech recognition, natural language processing, and fraud detection. As a result, accelerators for efficiently evaluating CNNs are rapidly growing in popularity. The conventional approaches to designing such CNN accelerators is to focus on creating accelerators to iteratively process the CNN layers. However, by processing each layer to completion, the accelerator designs must use off-chip memory to store intermediate data between layers, because the intermediate data are too large to fit on chip. In this work, we observe that a…

Citation impact

526
total citations
FWCI
22.47
Percentile
100%
References
25
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Convolutional neural network
  • Field-programmable gate array
  • Deep learning
  • Dataflow
  • Process (computing)
  • Virtex
  • Chip
No related works found for this paper.