https://github.com/ProjectPhysX/PTXprofiler
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
https://github.com/ProjectPhysX/PTXprofiler
cuda gpu gpu-acceleration gpu-computing gpu-programming hpc nvidia nvidia-cuda nvidia-gpu opencl profiler ptx ptx-utils roofline-model sycl
Last synced: 2 months ago
JSON representation
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
- Host: GitHub
- URL: https://github.com/ProjectPhysX/PTXprofiler
- Owner: ProjectPhysX
- License: other
- Created: 2023-01-11T19:24:04.000Z (over 2 years ago)
- Default Branch: master
- Last Pushed: 2025-03-20T06:14:32.000Z (3 months ago)
- Last Synced: 2025-03-27T22:05:59.762Z (3 months ago)
- Topics: cuda, gpu, gpu-acceleration, gpu-computing, gpu-programming, hpc, nvidia, nvidia-cuda, nvidia-gpu, opencl, profiler, ptx, ptx-utils, roofline-model, sycl
- Language: C++
- Homepage:
- Size: 11.7 KB
- Stars: 50
- Watchers: 4
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
- awesome-oneapi - PTXprofiler - A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis. (Table of Contents / Tools and Development)