An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with gpu-optimization

A curated list of projects in awesome lists tagged with gpu-optimization .

https://github.com/ind4skylivey/0ptiscaler4linux

The intelligent OptiScaler installer Linux gamers needed. Automates FSR4, XeSS & DLSS configuration with GPU-optimized profiles for RDNA3/4, Arc & RTX cards.

amd-fsr dlss frame-generation fsr4 gaming-performance gpu-optimization linux-gaming linux-tools mesa optiscaler proton rdna3 rdna4 shell-scripting steam-deck upscaling vulkan xess

Last synced: 27 Dec 2025

https://github.com/ai-infra-curriculum/ai-infra-performance-learning

AI Infrastructure Performance Engineer Learning Track - GPU optimization, inference optimization, and cost reduction

advanced cost-optimization gpu-optimization inference performance profiling tensorrt

Last synced: 22 Nov 2025

https://github.com/bjornmelin/nlp-engineering-hub

📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤

cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers

Last synced: 20 Mar 2025

https://github.com/tukue/aws-finops-container-optimization

🚀 AWS FinOps Container Optimization for AI Workloads Reference implementation of FinOps best practices for optimizing ECS/EKS-based AI workloads on AWS. Achieve cost optimization through spot instances, autoscaling, and intelligent resource management. 🎯 Key Features: • Spot instance strategies for AI training/inference and cost visibility

autoscaling aws finops gpu-optimization infrastructure-as-code

Last synced: 07 Mar 2026

https://github.com/flosmume/cpp-cuda-deepvision-rtx-starter

CUDA C++ practice project for RTX 4070 SUPER — explore GPU concurrency, pinned memory, and Nsight profiling. Includes SAXPY and 2D blur kernels to train optimization, stream overlap, and timing analysis for NVIDIA Developer Technology Engineering skillset.

cpp cuda cuda-kernels cuda-streams deep-learning-inference gpu gpu-optimization gpu-profiling high-performance-computing nsight nvidia parrallel-computing pinned-memory

Last synced: 31 Oct 2025

https://github.com/bjornmelin/tensorflow-evolution

🧠 Progressive journey through TensorFlow, from basics to advanced architectures. Featuring custom training pipelines, optimized GPU implementations, and production-ready models. Includes CUDA optimizations for large-scale training. 🚀

cuda deep-learning gpu-optimization machine-learning ml-engineering neural-networks python tensorflow

Last synced: 28 Jul 2025

https://github.com/dongskie43/nlp-engineering-hub

📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤

cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers

Last synced: 04 Aug 2025

https://github.com/bjornmelin/edge-ai-engineering

📱 Optimized ML for edge devices. Showcasing efficient model deployment, GPU-CPU memory transfer optimization, and real-world edge AI applications. 🤖

cuda edge-computing embedded-systems gpu-optimization iot mobile-ml model-optimization python tflite

Last synced: 28 Mar 2025