Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/denyskryvytskyi/capgemini-cuda

CUDA implementation of vector additon, matrix multiplication, reduction and sorting
https://github.com/denyskryvytskyi/capgemini-cuda

bitonic-sort cpp cuda cuda-kernels gpgpu matrix matrix-multiplication matrix-multiplication-parallel matrix-transpose nvidia nvidia-cuda nvidia-gpu reduction-dimension sort sorting-algorithms-implemented vector vector-addition vectorization

Last synced: 6 days ago
JSON representation

CUDA implementation of vector additon, matrix multiplication, reduction and sorting

Awesome Lists containing this project

README

        

# Capgemini CUDA tasks
Tasks were implemented and tested in Windows 10 with the Visual Studio CUDA Integration tool and NVCC compiler.

Tasks list:
- vectors addition;
- matrix multiplication using tiles, GPU shared memory, and matrix transposition;
- reduction (sum) with a custom kernel and Nvidia Thrust library for performance comparison;
- sorting using the Nvidia Thrust library.

## Getting Started
- Install [CUDA Toolkit](https://developer.nvidia.com/cuda-toolkit).
- [Windows] Install Visual Studio 2022 (we need MSVC compiler to compile host code and link with device code for the final executable).
- [Linux] Install gcc compiler.

Compile code using **nvcc** compiler:

`nvcc .cu -o -O3`

Run:
`./`