Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
Massachusetts Institute of Technology · Nvidia (United States)
Abstract
A recent trend in deep neural network (DNN) development is to extend the reach of deep learning applications to platforms that are more resource and energy-constrained, e.g., mobile devices. These endeavors aim to reduce the DNN model size and improve the hardware processing efficiency and have resulted in DNNs that are much more compact in their structures and/or have high data sparsity. These compact or sparse models are different from the traditional large ones in that there is much more variation in their layer shapes and sizes and often require specialized hardware to exploit sparsity for performance improvement. Therefore, many DNN accelerators designed for large DNNs do not perform well on these models.…
Citation impact
- FWCI
- 54.09
- Percentile
- 100%
- References
- 56
Authors
4Topics & keywords
- Computer science
- Throughput
- Computation
- Artificial neural network
- Exploit
- Process (computing)
- Bandwidth (computing)
- Mobile device
- Affordable and clean energy