Projects in Awesome Lists tagged with gpu-optimization
A curated list of projects in awesome lists tagged with gpu-optimization .
https://github.com/nvidia/cuopt-resources
A collection of NVIDIA cuOpt samples and other resources
cvrp cvrptw gpu gpu-acceleration gpu-optimization intralogistics last-mile-delivery logistics nvidia-gpu operations-research optimization optimization-algorithms optimization-tools pickup-and-delivery route-optimization traveling-salesman-problem tsp-solver vehicle-routing-problem vrp vrp-solver
Last synced: 04 Apr 2025
https://github.com/yui0/waifu2x-glsl
Fast waifu2x converter with GPU optimization
fast-waifu2x-converter glew glsl gpgpu gpu-optimization linux macos nyanko resolution waifu2x waifu2x-glsl
Last synced: 28 Jan 2026
https://github.com/robthepcguy/performance-mod-guide-for-valheim
Boost Valheim's FPS to forge a smoother Viking journey!
cpu-optimization game-configuration gaming-efficiency gaming-mods gpu-optimization high-priority-mode optimization-techniques performance-tweaking steam-guide tech-guide valheim-mods valheim-performance valheim-tips valheim-tricks viking-game
Last synced: 06 Oct 2025
https://github.com/yui0/waifu2x-ocl
Fast waifu2x converter with GPU optimization
fast-waifu2x-converter gpu-optimization linux macos nyanko opencl resolution waifu2x waifu2x-ocl windows
Last synced: 28 Jan 2026
https://github.com/ind4skylivey/0ptiscaler4linux
The intelligent OptiScaler installer Linux gamers needed. Automates FSR4, XeSS & DLSS configuration with GPU-optimized profiles for RDNA3/4, Arc & RTX cards.
amd-fsr dlss frame-generation fsr4 gaming-performance gpu-optimization linux-gaming linux-tools mesa optiscaler proton rdna3 rdna4 shell-scripting steam-deck upscaling vulkan xess
Last synced: 27 Dec 2025
https://github.com/md-emon-hasan/fine-tuning
End-to-end fine-tuning of Hugging Face models using LoRA, QLoRA, quantization, and PEFT techniques. Optimized for low-memory with efficient model deployment
bitsandbytes deep-learning fine-tuning fp16-training gpu-optimization gradient-checkpointing huggingface huggingface-datasets lora low-memory-training machine-learning model-training natural-language-processing nlp parameter-efficient-fine-tuning peft pytorch qlora quantization transformers
Last synced: 23 Jul 2025
https://github.com/ai-infra-curriculum/ai-infra-performance-learning
AI Infrastructure Performance Engineer Learning Track - GPU optimization, inference optimization, and cost reduction
advanced cost-optimization gpu-optimization inference performance profiling tensorrt
Last synced: 22 Nov 2025
https://github.com/bjornmelin/nlp-engineering-hub
📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤
cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers
Last synced: 20 Mar 2025
https://github.com/tukue/aws-finops-container-optimization
🚀 AWS FinOps Container Optimization for AI Workloads Reference implementation of FinOps best practices for optimizing ECS/EKS-based AI workloads on AWS. Achieve cost optimization through spot instances, autoscaling, and intelligent resource management. 🎯 Key Features: • Spot instance strategies for AI training/inference and cost visibility
autoscaling aws finops gpu-optimization infrastructure-as-code
Last synced: 07 Mar 2026
https://github.com/flosmume/cpp-cuda-deepvision-rtx-starter
CUDA C++ practice project for RTX 4070 SUPER — explore GPU concurrency, pinned memory, and Nsight profiling. Includes SAXPY and 2D blur kernels to train optimization, stream overlap, and timing analysis for NVIDIA Developer Technology Engineering skillset.
cpp cuda cuda-kernels cuda-streams deep-learning-inference gpu gpu-optimization gpu-profiling high-performance-computing nsight nvidia parrallel-computing pinned-memory
Last synced: 31 Oct 2025
https://github.com/bjornmelin/tensorflow-evolution
🧠Progressive journey through TensorFlow, from basics to advanced architectures. Featuring custom training pipelines, optimized GPU implementations, and production-ready models. Includes CUDA optimizations for large-scale training. 🚀
cuda deep-learning gpu-optimization machine-learning ml-engineering neural-networks python tensorflow
Last synced: 28 Jul 2025
https://github.com/dongskie43/nlp-engineering-hub
📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤
cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers
Last synced: 04 Aug 2025
https://github.com/bjornmelin/edge-ai-engineering
📱 Optimized ML for edge devices. Showcasing efficient model deployment, GPU-CPU memory transfer optimization, and real-world edge AI applications. 🤖
cuda edge-computing embedded-systems gpu-optimization iot mobile-ml model-optimization python tflite
Last synced: 28 Mar 2025