articleProceedings of the IEEEJan 24, 2005Closed access

The Design and Implementation of FFTW3

IBM Research - Austin · IBM (United States) · +1 more institution

Indexed incrossref

Abstract

FFTW is an implementation of the discrete Fourier transform (DFT) that adapts to the hardware in order to maximize performance. This paper shows that such an approach can yield an implementation that is competitive with hand-optimized libraries, and describes the software structure that makes our current FFTW3 version flexible and adaptive. We further discuss a new algorithm for real-data DFTs of prime size, a new way of implementing DFTs by means of machine-specific single-instruction, multiple-data (SIMD) instructions, and how a special-purpose compiler can derive optimized implementations of the discrete cosine and sine transforms automatically from a DFT algorithm.

Citation impact

5,132
total citations
FWCI
202.51
Percentile
100%
References
71
Citations per year

Authors

2

Topics & keywords

Keywords
  • Compiler
  • Computer science
  • SIMD
  • Implementation
  • Sine
  • Discrete cosine transform
  • Prime (order theory)
  • Trigonometric functions
No related works found for this paper.