Projects in Awesome Lists tagged with gpu-kernels
A curated list of projects in awesome lists tagged with gpu-kernels .
https://github.com/rocm/rocprofiler-compute
Advanced Profiling and Analytics for AMD Hardware
gpu-kernels hardware-counters hpc linux performance-analysis profiling
Last synced: 11 Dec 2025
https://github.com/xmartlabs/cuda-calculator
Online CUDA Occupancy Calculator
cuda gpgpu gpu gpu-computing gpu-kernels gpu-programming kernel nvidia occupancy
Last synced: 10 Mar 2025
https://github.com/eyalroz/gpu-kernel-runner
Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line
cuda debugging-tool gpgpu gpu gpu-kernel-performance gpu-kernels multi-language opencl performance-analysis performance-testing profiling runner
Last synced: 22 Jan 2026
https://github.com/amd-agi/neurips2025-gpu-kernels-tutorial
Repo containing artifacts for Neurips 2025 tutorial- How to Build Agents to Generate Kernels for Faster LLMs (and Other Models!)
gpu-kernels kernel-optimization llamas
Last synced: 15 May 2026
https://github.com/manishklach/intent-attention-kernel
Intent-aware attention research prototype that treats long-context inference as structured semantic blocks instead of a flat token stream, proving CPU-first correctness and analytical KV/FLOP savings before GPU kernel implementation.
agentic-ai ai-infrastructure attention block-attention cost-model cuda gpu-kernels inference kernel-research kv-cache llm-inference long-context python pytorch research semantic-attention sparse-attention systems transformers triton
Last synced: 28 May 2026
https://github.com/fulvius31/triton-cache-tracker
A lightweight utility for monitoring and analyzing Triton kernel compilation cache behavior.
cache cuda gpu gpu-kernels triton triton-openai
Last synced: 30 Apr 2026