articleOct 1, 2016Closed access
Fused-layer CNN accelerators
Indexed incrossref
Abstract
Deep convolutional neural networks (CNNs) are rapidly becoming the dominant approach to computer vision and a major component of many other pervasive machine learning tasks, such as speech recognition, natural language processing, and fraud detection. As a result, accelerators for efficiently evaluating CNNs are rapidly growing in popularity. The conventional approaches to designing such CNN accelerators is to focus on creating accelerators to iteratively process the CNN layers. However, by processing each layer to completion, the accelerator designs must use off-chip memory to store intermediate data between layers, because the intermediate data are too large to fit on chip. In this work, we observe that a…
Citation impact
526
total citations
- FWCI
- 22.47
- Percentile
- 100%
- References
- 25
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Computer science
- Convolutional neural network
- Field-programmable gate array
- Deep learning
- Dataflow
- Process (computing)
- Virtex
- Chip
No related works found for this paper.