An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with gpu-optimization

A curated list of projects in awesome lists tagged with gpu-optimization .

https://github.com/ind4skylivey/0ptiscaler4linux

The intelligent OptiScaler installer Linux gamers needed. Automates FSR4, XeSS & DLSS configuration with GPU-optimized profiles for RDNA3/4, Arc & RTX cards.

amd-fsr dlss frame-generation fsr4 gaming-performance gpu-optimization linux-gaming linux-tools mesa optiscaler proton rdna3 rdna4 shell-scripting steam-deck upscaling vulkan xess

Last synced: 27 Dec 2025

https://github.com/amd-agi/gpu-optimization-for-llm-inference

This is a short course covering GPU optimization techniques for LLM inference

gpu-optimization llamas llm-inference

Last synced: 15 May 2026

https://github.com/ai-infra-curriculum/ai-infra-performance-learning

AI Infrastructure Performance Engineer Learning Track - GPU optimization, inference optimization, and cost reduction

advanced cost-optimization gpu-optimization inference performance profiling tensorrt

Last synced: 22 Nov 2025

https://github.com/tukue/aws-finops-container-optimization

🚀 AWS FinOps Container Optimization for AI Workloads Reference implementation of FinOps best practices for optimizing ECS/EKS-based AI workloads on AWS. Achieve cost optimization through spot instances, autoscaling, and intelligent resource management. 🎯 Key Features: • Spot instance strategies for AI training/inference and cost visibility

autoscaling aws finops gpu-optimization infrastructure-as-code

Last synced: 07 Mar 2026

https://github.com/flosmume/cpp-cuda-deepvision-rtx-starter

CUDA C++ practice project for RTX 4070 SUPER — explore GPU concurrency, pinned memory, and Nsight profiling. Includes SAXPY and 2D blur kernels to train optimization, stream overlap, and timing analysis for NVIDIA Developer Technology Engineering skillset.

cpp cuda cuda-kernels cuda-streams deep-learning-inference gpu gpu-optimization gpu-profiling high-performance-computing nsight nvidia parrallel-computing pinned-memory

Last synced: 16 May 2026

https://github.com/bjornmelin/nlp-engineering-hub

📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤

cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers

Last synced: 17 Apr 2026

https://github.com/bjornmelin/edge-ai-engineering

📱 Optimized ML for edge devices. Showcasing efficient model deployment, GPU-CPU memory transfer optimization, and real-world edge AI applications. 🤖

cuda edge-computing embedded-systems gpu-optimization iot mobile-ml model-optimization python tflite

Last synced: 02 May 2026