Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/MishaLaskin/curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
contrastive-learning contrastive-loss contrastive-predictive-coding curl deep-learning deep-learning-algorithms deep-neural-networks deep-q-learning deep-q-network deep-reinforcement-learning deep-rl deeplearning deeplearning-ai gpu model-free-rl off-policy reinforcement-agents reinforcement-learning reinforcement-learning-algorithms sac
Last synced: 03 Jul 2024
![](https://github.com/MishaLaskin.png)
https://github.com/matthewfeickert/nvidia-gpu-ml-library-test
Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being properly picked up
cuda cudnn gpu jax nvidia pytorch setup tensorflow torch
Last synced: 03 Jul 2024
![](https://github.com/matthewfeickert.png)
https://github.com/js1010/cusim
Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)
cuda gensim gpu lda topic-modeling w2v word-embedding
Last synced: 03 Jul 2024
![](https://github.com/js1010.png)
https://github.com/dionhaefner/pyhpc-benchmarks
A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python :rocket:
benchmarks cupy gpu high-performance-computing jax parallel-computing python pytorch tensorflow
Last synced: 03 Jul 2024
![](https://github.com/dionhaefner.png)
https://github.com/peci1/nvidia-htop
A tool for enriching the output of nvidia-smi.
command-line gpu nvidia nvidia-smi
Last synced: 03 Jul 2024
![](https://github.com/peci1.png)
https://github.com/EtienneCmb/visbrain
A multi-purpose GPU-accelerated open-source suite for brain data visualization
brain connectivity deep-sources gpu gui mni neuroscience opengl plot python sleep vispy visualization
Last synced: 03 Jul 2024
![](https://github.com/EtienneCmb.png)
https://github.com/fastaudio/fastaudio
🔊 Audio and fastai v2
audio deep-learning fastai gpu python pytorch
Last synced: 02 Jul 2024
![](https://github.com/fastaudio.png)
https://github.com/PKU-DAIR/Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
artificial-intelligence autograd data-science deep-learning deep-neural-networks distributed-systems distributed-training embeddings gpu high-dimensional machine-learning python state-of-the-art
Last synced: 01 Jul 2024
![](https://github.com/PKU-DAIR.png)
https://github.com/nihui/ncnn-android-styletransfer
The style transfer android example
android arm cpu deep-learning gpu ncnn neural-network styletransfer vulkan
Last synced: 01 Jul 2024
![](https://github.com/nihui.png)
https://github.com/nihui/dain-ncnn-vulkan
DAIN, Depth-Aware Video Frame Interpolation implemented with ncnn library
dain gpu ncnn video-interpolation vulkan
Last synced: 01 Jul 2024
![](https://github.com/nihui.png)
https://github.com/nihui/cain-ncnn-vulkan
CAIN, Channel Attention Is All You Need for Video Frame Interpolation implemented with ncnn library
cain gpu ncnn video-interpolation vulkan
Last synced: 01 Jul 2024
![](https://github.com/nihui.png)
https://github.com/idealab-isu/GPView
GPU Accelerated Voxelization Framework for 3D CAD models.
Last synced: 01 Jul 2024
![](https://github.com/idealab-isu.png)
https://github.com/PaddlePaddle/Serving
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
dag deep-learning docker gpu micro-service microservice-toolkit online-service paddle paddle-serving pipeline prediction predictor python rpc-service serving
Last synced: 01 Jul 2024
![](https://github.com/PaddlePaddle.png)
https://github.com/CVCUDA/CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
bytedance cloud computer-vision cpp cuda cv-cuda gpu image-processing machine-learning nvidia python
Last synced: 01 Jul 2024
![](https://github.com/CVCUDA.png)
https://github.com/pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
deep-learning embedded gpu machine-learning mobile neural-network tensor
Last synced: 01 Jul 2024
![](https://github.com/pytorch.png)
https://github.com/xtknight/vdpau-va-driver-vp9
Experimental VP9 codec support for vdpau-va-driver (NVIDIA VDPAU-VAAPI wrapper) and chromium-vaapi
4k acceleration chromium chromium-vaapi gpu hardware nvdec nvidia va-api vaapi vdpau vdpau-va-driver video vp9
Last synced: 01 Jul 2024
![](https://github.com/xtknight.png)
https://github.com/DiffSharp/DiffSharp
DiffSharp: Differentiable Functional Programming
autodiff deep-learning dotnet gpu machine-learning neural-network tensor
Last synced: 01 Jul 2024
![](https://github.com/DiffSharp.png)
https://github.com/ChrisCummins/clgen
Deep learning program generator
benchmarking big-data deep-learning gpu lstm machine-learning neural-network opencl synthetic-programs
Last synced: 01 Jul 2024
![](https://github.com/ChrisCummins.png)
https://github.com/cgi-estonia-space/ALUs
GPU accelerated earth observation data processors
earth-observation eo gpu gpu-computing nvidia nvidia-gpu sentinel-1 sentinel-2
Last synced: 30 Jun 2024
![](https://github.com/cgi-estonia-space.png)
https://github.com/dstackai/dstack
dstack is an open-source orchestration engine for cost-effectively running AI workloads in the cloud as well as on-premises. Discord: https://discord.gg/u8SmfwPpMd
aws azure cloud gcp gpu llms machine-learning orchestration python
Last synced: 29 Jun 2024
![](https://github.com/dstackai.png)
https://github.com/pytorch/serve
Serve, optimize and scale PyTorch models in production
cpu deep-learning docker gpu kubernetes machine-learning metrics mlops optimization pytorch serving
Last synced: 29 Jun 2024
![](https://github.com/pytorch.png)
https://github.com/T-vK/MobilePassThrough
Make GPU passthrough on notebooks easy and accessible!
automated check compatibility dgpu gpu igpu mediated passthrough script vm
Last synced: 29 Jun 2024
![](https://github.com/T-vK.png)
![](https://github.com/zk4x.png)
https://github.com/JunweiLiang/awesome_lists
Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)
advice ai assistant-professor awesome-list gpu grants-search
Last synced: 29 Jun 2024
![](https://github.com/JunweiLiang.png)
https://github.com/flipacholas/Architecture-of-consoles
Technical articles about console architecture
architecture articles computer-architecture console cpu discussion gpu translation videogame
Last synced: 29 Jun 2024
![](https://github.com/flipacholas.png)
https://github.com/Jerc007/Open-GPGPU-FlexGrip-
FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation
gpu microarchitecture simulation-model
Last synced: 29 Jun 2024
![](https://github.com/Jerc007.png)
https://github.com/haasn/libplacebo
Official mirror of libplacebo
d3d11 ffmpeg glsl gpu mirror mpv multimedia opengl shaders video video-player video-processing videolan vlc vulkan
Last synced: 29 Jun 2024
![](https://github.com/haasn.png)
https://github.com/BlazingDB/blazingsql
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
arrow artificial-intelligence blazingsql conda-environment cudf data-science gpu gpu-acceleration gpu-dataframes machine-learning machine-learning-workflow python rapids rapidsai sql sql-engine
Last synced: 29 Jun 2024
![](https://github.com/BlazingDB.png)
https://github.com/merzlab/QUICK
QUICK: A GPU-enabled ab intio quantum chemistry software package
chemistry computational-chemistry cuda density-functional-theory electronic-structure-calculations gpu gpu-acceleration hartree-fock parallel-computing quantum-chemistry
Last synced: 28 Jun 2024
![](https://github.com/merzlab.png)
https://github.com/wavefunction91/GauXC
GauXC is a modern, modular C++ library for the evaluation of quantities related to the exchange-correlation (XC) energy (e.g. potential, etc) in the Gaussian basis set discretization of Kohn-Sham density function theory (KS-DFT) on heterogenous architectures.
Last synced: 28 Jun 2024
![](https://github.com/wavefunction91.png)
https://github.com/electronic-structure/SIRIUS
Domain specific library for electronic structure calculations
cuda density-functional-theory electronic-structure-calculations full-potential gpu lapw mpi planewave pseudopotential rocm
Last synced: 28 Jun 2024
![](https://github.com/electronic-structure.png)
https://github.com/jerryji1993/DNABERT
DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome
deep-learning dnabert-model genome gpu kmer kmer-format machine-learning natural-language-processing nlp sequence
Last synced: 28 Jun 2024
![](https://github.com/jerryji1993.png)
![](https://github.com/JuliaAttic.png)
https://github.com/JuliaGPU/GPUCompiler.jl
Reusable compiler infrastructure for Julia GPU backends.
compiler gpu hacktoberfest julia
Last synced: 28 Jun 2024
![](https://github.com/JuliaGPU.png)
https://github.com/ZenitH-AT/nvidia-update
Checks for a new version of the NVIDIA driver, downloads and installs it.
Last synced: 27 Jun 2024
![](https://github.com/ZenitH-AT.png)
https://github.com/triagemd/tensorflow-builds
Tensorflow binaries and Docker images compiled with GPU support and CPU optimizations.
bazel cuda cudnn docker gpu machine-learning nvidia python tensorflow tensorflow-serving
Last synced: 27 Jun 2024
![](https://github.com/triagemd.png)
https://github.com/selkies-project/docker-nvidia-glx-desktop
KDE Plasma Desktop container designed for Kubernetes supporting OpenGL GLX and Vulkan for NVIDIA GPUs with WebRTC and HTML5, providing an open-source remote cloud graphics or game streaming platform. Spawns its own fully isolated X Server instead of using the host X Server, not requiring /tmp/.X11-unix host sockets or host configuration.
cloud-gaming docker docker-image game-streaming gpu gstreamer html5 kubernetes linux-gaming nvidia nvidia-docker nvidia-gpu opengl remote-access remote-control remote-desktop ubuntu vulkan webrtc wine
Last synced: 27 Jun 2024
![](https://github.com/selkies-project.png)
https://github.com/harrism/hemi
Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
c-plus-plus cuda cuda-device cuda-kernels gpu hemi
Last synced: 27 Jun 2024
![](https://github.com/harrism.png)
https://github.com/AlexanderVeselov/RayTracing
Realtime GPU Path tracer based on OpenCL and OpenGL
3d cpp gpgpu gpu opencl opengl pathtracing pbr raytracing render
Last synced: 27 Jun 2024
![](https://github.com/AlexanderVeselov.png)
https://github.com/Yours3lf/rpi-vk-driver
VK driver for the Raspberry Pi (Broadcom Videocore IV)
broadcom broadcom-videocore-iv driver gpu raspberry-pi raspberrypi rpi-vk-driver videocore-iv
Last synced: 26 Jun 2024
![](https://github.com/Yours3lf.png)
https://github.com/taichi-dev/difftaichi
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)
differentiable-programming gpu graphics robotics simulation taichi
Last synced: 26 Jun 2024
![](https://github.com/taichi-dev.png)
https://github.com/mitmul/pynvvl
A Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
cuda cupy gpu numpy nvidia-video-loader nvvl python video video-processing
Last synced: 26 Jun 2024
![](https://github.com/mitmul.png)
https://github.com/coderobe/VBiosFinder
Extract embedded VBIOS from (almost) any BIOS Update
bios gpu hacktoberfest hardware linux pci pci-passthrough uefi vbios
Last synced: 25 Jun 2024
![](https://github.com/coderobe.png)
https://github.com/marekkaczkowski/Touch-Bar-iStats
Show CPU/GPU/MEM temperature on Touch Bar with BetterTouchTool!
bar bettertouchtool bettertouchtool-widget btt cpu gpu icons istats macbook macbookpro memory template touch touchbar utils widget widgets
Last synced: 25 Jun 2024
![](https://github.com/marekkaczkowski.png)
https://github.com/OlafenwaMoses/ImageAI
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
ai-practice-recommendations algorithm artificial-intelligence artificial-neural-networks densenet detection gpu image-prediction image-recognition imageai inceptionv3 machine-learning object-detection offline-capable prediction python python3 squeezenet video
Last synced: 24 Jun 2024
![](https://github.com/OlafenwaMoses.png)
https://github.com/codingonion/awesome-cuda-tensorrt-fpga
🔥🔥🔥 A collection of some awesome public NVIDIA CUDA, cuBLAS, cuDNN, TensorRT, AMD ROCm and FPGA projects.
awesome blas cublas cuda cudnn fpga gpu hdl large-language-models llama3 llm mojo nvidia pytorch tensorrt web3 yolo yolov10 yolov5 zkp
Last synced: 24 Jun 2024
![](https://github.com/codingonion.png)
https://github.com/cair/tmu
Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features, drop clause, Type III Feedback, focused negative sampling, multi-task classifier, autoencoder, literal budget, and one-vs-one multi-class classifier. TMU is written in Python with wrappers for C and CUDA-based clause evaluation and updating.
absorbing-states autoencoder convolution cuda gpu incremental incremental-computation multi-output pattern-recognition propositional-logic regression relational-logic sparse tsetlin-machine
Last synced: 24 Jun 2024
![](https://github.com/cair.png)
https://github.com/ivy-llc/ivy
The Unified AI Framework
abstraction autograd deep-learning gpu ivy jax machine-learning mxnet neural-network numpy python pytorch template tensorflow
Last synced: 24 Jun 2024
![](https://github.com/ivy-llc.png)
https://github.com/NVIDIA-Genomics-Research/GenomeWorks
SDK for GPU accelerated genome assembly and analysis
alignment cuda genomics gpu mapping nvidia partial-order-alignment poa python-api
Last synced: 23 Jun 2024
![](https://github.com/NVIDIA-Genomics-Research.png)
https://github.com/Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer
Last synced: 22 Jun 2024
![](https://github.com/Tencent.png)
https://github.com/DeadManWalkingTO/Windows10MiningTweaksDmW
Windows 10 Mining Tweaks by DeadManWalking (DeadManWalkingTO-GitHub)
btc cpu deadmanwalking deadmanwalkingto dmw eth ethereum gpu miner mining optimization optimizer tweaks usage win10 windows windows10 xmr
Last synced: 21 Jun 2024
![](https://github.com/DeadManWalkingTO.png)
https://intel.github.io/scikit-learn-intelex/
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
ai-inference ai-machine-learning ai-training analytics big-data data-analysis gpu intel machine-learning machine-learning-algorithms oneapi python scikit-learn swrepo
Last synced: 21 Jun 2024
![](https://github.com/intel.png)
https://tlkh.github.io/asitop/
Perf monitoring CLI tool for Apple Silicon
apple-silicon cli cpu gpu m1 macos
Last synced: 21 Jun 2024
![](https://github.com/tlkh.png)
https://github.com/vladkens/macmon
🦀⚙️ Sudoless performance monitoring for Apple Silicon processors
apple apple-silicon arm64 asitop cli cpu cpu-monitoring cpu-usage gpu gpu-monitoring gpu-usage m1 macos monitoring powermetrics ratatui ratatui-rs rust terminal tui
Last synced: 21 Jun 2024
![](https://github.com/vladkens.png)
https://github.com/binga/cloud-gpus
This repository contains information about Cloud GPU offerings for Machine Learning practitioners.
cloud-gpu colaboratory credits deep-learning gpu kaggle-scripts machine-learning
Last synced: 20 Jun 2024
![](https://github.com/binga.png)
https://github.com/pmndrs/detect-gpu
Classifies GPUs based on their 3D rendering benchmark score allowing the developer to provide sensible default settings for graphically intensive applications.
adaptive babylonjs benchmarks browser demo detection device gpu hardware pixijs progressive-enhancement threejs webgl webgl2
Last synced: 20 Jun 2024
![](https://github.com/pmndrs.png)
https://github.com/FaceONNX/FaceONNX
Face recognition and analytics library based on deep neural networks and ONNX runtime
antispoofing classification cpu deep-neural-networks detection dotnet embeddings estimation face face-analytics-library face-detection face-onnx face-recognition faceonnx gpu landmarks onnx onnx-runtime recognition
Last synced: 20 Jun 2024
![](https://github.com/FaceONNX.png)
https://github.com/microsoft/Microsoft-Rocket-Video-Analytics-Platform
A highly extensible software stack to empower everyone to build practical real-world live video analytics applications for object detection and counting with cutting edge machine learning algorithms.
azure counting docker dotnet-core edge-computing gpu object-detection tensorflow video-analytics yolov3
Last synced: 19 Jun 2024
![](https://github.com/microsoft.png)
https://github.com/jdermody/brightwire
Bright Wire is an open source machine learning library for .NET with GPU support (via CUDA)
convolutional-neural-networks csharp cuda cuda-support gpu gpu-support machine-learning machine-learning-library machinelearning neural-network recurrent-neural-networks
Last synced: 19 Jun 2024
![](https://github.com/jdermody.png)
https://github.com/harujoh/KelpNet
Pure C# machine learning framework
csharp deep-learning dotnet gpu machine-learning neural-network onnx opencl
Last synced: 19 Jun 2024
![](https://github.com/harujoh.png)
https://github.com/przemyslawzaworski/Unity3D-CG-programming
Various shaders.
Last synced: 19 Jun 2024
![](https://github.com/przemyslawzaworski.png)
https://github.com/discosultan/VulkanCore
Vulkan 1.0 graphics and compute API bindings for .NET Standard
c-sharp dotnet-standard gpgpu gpu graphics netstandard vulkan
Last synced: 19 Jun 2024
![](https://github.com/discosultan.png)
https://github.com/keijiro/StableFluids
A straightforward GPU implementation of Jos Stam's "Stable Fluids" on Unity.
compute fluid-dynamics gpu shader unity unity3d
Last synced: 19 Jun 2024
![](https://github.com/keijiro.png)
https://github.com/hpcaitech/FastFold
Optimizing AlphaFold Training and Inference on GPU Clusters
alphafold2 cuda evoformer gpu habana-gaudi parallelism protein-folding protein-structure pytorch
Last synced: 18 Jun 2024
![](https://github.com/hpcaitech.png)
https://github.com/rapidsai/cugraph
cuGraph - RAPIDS Graph Analytics Library
complex-networks cuda gpu graph graph-algorithms graph-analysis graph-framework graphml nvidia rapids
Last synced: 17 Jun 2024
![](https://github.com/rapidsai.png)
https://github.com/bh107/bohrium
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
cuda gpu gpu-acceleration multi-core numpy opencl parallel-computing
Last synced: 17 Jun 2024
![](https://github.com/bh107.png)
https://github.com/casact/rp-bnn-claims
Individual Claims Forecasting with Bayesian Mixture Density Networks
actuarial actuarial-data gpu reserving
Last synced: 16 Jun 2024
![](https://github.com/casact.png)
https://github.com/hellzerg/indicium
Portable, advanced system information utility
audio bios bios-info computer-specs cpu devices gpu info information motherboard operating-system-details os-details pc-specs peripherals ram storage system system-info system-information windows-product-key
Last synced: 16 Jun 2024
![](https://github.com/hellzerg.png)
https://github.com/curtisgray/wingman
Wingman is the fastest and easiest way to run Llama models on your PC or Mac.
ai chatbot chatgpt download downloader gpu gpu-acceleration gpu-monitoring inference inference-engine inference-server linux llama llamacpp llm local macos openai windows
Last synced: 16 Jun 2024
![](https://github.com/curtisgray.png)
https://github.com/pytorch/torchrec
Pytorch domain library for recommendation systems
cuda deep-learning gpu pytorch recommendation-system recommender-system sharding
Last synced: 16 Jun 2024
![](https://github.com/pytorch.png)
https://github.com/DeepRec-AI/HybridBackend
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
deep-learning gpu hybrid-parallelism parquet recommender-system
Last synced: 16 Jun 2024
![](https://github.com/DeepRec-AI.png)
http://diffsharp.github.io/DiffSharp/
DiffSharp: Differentiable Functional Programming
autodiff deep-learning dotnet gpu machine-learning neural-network tensor
Last synced: 16 Jun 2024
![](https://github.com/DiffSharp.png)
https://github.com/AstroAccelerateOrg/astro-accelerate
AstroAccelerate is a many-core accelerated software package for processing time-domain radio-astronomy data.
Last synced: 16 Jun 2024
![](https://github.com/AstroAccelerateOrg.png)
https://github.com/pritul2/yolov5_FaceMask
Detecting person with or without face mask. Trained using YOLOv5.
covid-19 cpu face-mask-detection gpu yolov5
Last synced: 15 Jun 2024
![](https://github.com/pritul2.png)
https://github.com/AntonMu/TrainYourOwnYOLO
Train a state-of-the-art yolov3 object detector from scratch!
annotating-images custom-yolo deep-learning deep-learning-tutorial detector google-colab gpu inference keras keras-models object-detection python tensorflow2 tf2 trainyourownyolo transfer-learning wandb weights-and-biases yolo yolov3
Last synced: 15 Jun 2024
![](https://github.com/AntonMu.png)
https://github.com/yasenh/libtorch-yolov5
A LibTorch inference implementation of the yolov5
Last synced: 15 Jun 2024
![](https://github.com/yasenh.png)
https://github.com/Deyht/CIANNA
Convolutional Interactive Artificial Neural Networks by/for Astrophysicists
astronomy astrophysics convolutional-neural-networks cuda deep-learning deep-neural-networks gpu machine-learning ml neural-network object-detection yolo
Last synced: 15 Jun 2024
![](https://github.com/Deyht.png)
https://github.com/intel/intel-technology-enabling-for-openshift
This project enables Intel® platform technologies (SGX, QAT) and GPUs on Red Hat OpenShift Container Platform
accelerator cloud datacenter device-plugin gpu kmm kubernetes nfd openshift operator qat sgx xeon yaml
Last synced: 15 Jun 2024
![](https://github.com/intel.png)
https://github.com/bheisler/RustaCUDA
Rusty wrapper for the CUDA Driver API
Last synced: 14 Jun 2024
![](https://github.com/bheisler.png)
![](https://github.com/csl-iisc.png)
https://github.com/CRAFT-THU/BSim
High Performance Simulation of Spiking Neural Network on GPGPUs
Last synced: 14 Jun 2024
![](https://github.com/CRAFT-THU.png)
https://github.com/Dr-Noob/peakperf
Achieve peak performance on x86 CPUs and NVIDIA GPUs
assembly avx cpu cpu-frequency cpu-microarchitecture cuda gflop gpu intrinsics microarchitecture microbenchmark nvidia performance
Last synced: 14 Jun 2024
![](https://github.com/Dr-Noob.png)
https://github.com/NVIDIA/gpu-operator
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Last synced: 14 Jun 2024
![](https://github.com/NVIDIA.png)
https://github.com/Tencent/Forward
A library for high performance deep learning inference on NVIDIA GPUs.
cuda deep-learning forward gpu inference inference-engine keras neural-network onnx pytorch tensorflow tensorrt
Last synced: 14 Jun 2024
![](https://github.com/Tencent.png)
https://github.com/vectorch-ai/ScaleLLM
A high-performance inference system for large language models, designed for production environments.
cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer
Last synced: 13 Jun 2024
![](https://github.com/vectorch-ai.png)
https://github.com/schrodinger/gpusimilarity
A Cuda/Thrust implementation of fingerprint similarity searching
cheminformatics chemistry gpu similarity-analysis
Last synced: 13 Jun 2024
![](https://github.com/schrodinger.png)
https://github.com/scverse/rapids_singlecell
Rapids_singlecell: A GPU-accelerated tool for scRNA analysis. Offers seamless scverse compatibility for efficient single-cell data processing and analysis.
anndata bioinformatics gpu scverse single-cell
Last synced: 13 Jun 2024
![](https://github.com/scverse.png)
https://github.com/chembl/FPSim2
Simple package for fast molecular similarity searches
cheminformatics chemistry gpu python similarity-search
Last synced: 13 Jun 2024
![](https://github.com/chembl.png)
https://github.com/OpenBMB/BMInf
Efficient Inference for Big Models
deep-learning gpu pretrained-language-models
Last synced: 13 Jun 2024
![](https://github.com/OpenBMB.png)
https://github.com/Ipotrick/Daxa
Daxa is a convenient, simple and modern gpu abstraction built on vulkan
Last synced: 13 Jun 2024
![](https://github.com/Ipotrick.png)
https://github.com/andylamp/gpurelperf
A handy utility to find relative GPU performance quickly in multi-gpu boxes
Last synced: 12 Jun 2024
![](https://github.com/andylamp.png)
https://github.com/QaidVoid/Complete-Single-GPU-Passthrough
Single GPU VFIO Passthrough Guide
gpu libvirt-hooks linux passthrough qemu-kvm vfio-pci virtio
Last synced: 12 Jun 2024
![](https://github.com/QaidVoid.png)
https://github.com/m1k1o/go-transcode
On-demand transcoding origin server for live inputs and static files in Go using ffmpeg. Also with NVIDIA GPU hardware acceleration.
demand-transcoding docker ffmpeg golang gpu live-streaming nvidia-cuda streams transcoding
Last synced: 12 Jun 2024
![](https://github.com/m1k1o.png)
https://github.com/NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
Last synced: 12 Jun 2024
![](https://github.com/NVIDIA.png)