articleFeb 4, 2016Closed access

Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks

Arizona State University · American Rock Mechanics Association

Indexed incrossref

Abstract

Convolutional Neural Networks (CNNs) have gained popularity in many computer vision applications such as image classification, face detection, and video analysis, because of their ability to train and classify with high accuracy. Due to multiple convolution and fully-connected layers that are compute-/memory-intensive, it is difficult to perform real-time classification with low power consumption on today?s computing systems. FPGAs have been widely explored as hardware accelerators for CNNs because of their reconfigurability and energy efficiency, as well as fast turn-around-time, especially with high-level synthesis methodologies. Previous FPGA-based CNN accelerators, however, typically implemented generic…

Citation impact

551
total citations
FWCI
44.23
Percentile
100%
References
25
Citations per year

Authors

8

Topics & keywords

Keywords
  • Computer science
  • Field-programmable gate array
  • Reconfigurability
  • Convolutional neural network
  • Stratix
  • Throughput
  • Embedded system
  • MPSoC
UN Sustainable Development Goals
  • Affordable and clean energy
No related works found for this paper.

Funding