Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow
University of British Columbia
Abstract
Recent advances in graphics processing units (GPUs) have resulted in massively parallel hardware that is easily programmable and widely available in commodity desktop computer systems. GPUs typically use single-instruction, multiple-data (SIMD) pipelines to achieve high performance with minimal overhead incurred by control hardware. Scalar threads are grouped together into SIMD batches, sometimes referred to as warps. While SIMD is ideally suited for simple programs, recent GPUs include control flow instructions in the GPU instruction set architecture and programs using these instructions may experience reduced performance due to the way branch execution is supported by hardware. One approach is to add a stack…
Citation impact
- FWCI
- 28.65
- Percentile
- 100%
- References
- 60
Authors
4Topics & keywords
- Computer science
- SIMD
- Parallel computing
- Control flow
- Instruction set
- Overhead (engineering)
- Scheduling (production processes)
- Graphics