Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/shadyboukhary/gpu-research-fft-openacc-cuda
Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.
https://github.com/shadyboukhary/gpu-research-fft-openacc-cuda
acceleration cuda fast-fourier-transform fft gpu-acceleration gpu-computing gpu-programming nvcc openacc parallel-computing pgi pgi-compiler radix-2
Last synced: about 1 month ago
JSON representation
Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.
- Host: GitHub
- URL: https://github.com/shadyboukhary/gpu-research-fft-openacc-cuda
- Owner: ShadyBoukhary
- License: mit
- Created: 2018-03-06T00:25:55.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-08-26T01:44:27.000Z (over 6 years ago)
- Last Synced: 2024-07-30T19:44:00.173Z (5 months ago)
- Topics: acceleration, cuda, fast-fourier-transform, fft, gpu-acceleration, gpu-computing, gpu-programming, nvcc, openacc, parallel-computing, pgi, pgi-compiler, radix-2
- Language: Cuda
- Homepage:
- Size: 9.12 MB
- Stars: 10
- Watchers: 1
- Forks: 3
- Open Issues: 0