Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks
Peking University · University of California, Los Angeles
Abstract
Convolutional neural network (CNN) has been widely employed for image recognition because it can achieve high accuracy by emulating behavior of optic nerves in living creatures. Recently, rapid growth of modern applications based on deep learning algorithms has further improved research and implementations. Especially, various accelerators for deep CNN have been proposed based on FPGA platform because it has advantages of high performance, reconfigurability, and fast development round, etc. Although current FPGA accelerators have demonstrated better performance over generic processors, the accelerator design space has not been well exploited. One critical problem is that the computation throughput may not well…
Citation impact
- FWCI
- 125.29
- Percentile
- 100%
- References
- 17
Authors
6Topics & keywords
- Reconfigurability
- Computer science
- Field-programmable gate array
- Convolutional neural network
- Memory bandwidth
- Scalability
- Deep learning
- Throughput