This repository designs and evaluates a parallel DFT (Discrete Fourier Transform) implementation on CUDA GPUs and MPI (Message Passing Interface) architectures.
CUDA | MPI |
---|---|
Both CUDA and MPI implementations saw significant performance benefits over the sequential algorithm.