Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/IntelPython/dpnp

Data Parallel Extension for NumPy

dpcpp gpu gpu-acceleration intel mkl numpy oneapi pstl python3 sycl

Last synced: 03 Jul 2024

https://github.com/matthewfeickert/nvidia-gpu-ml-library-test

Simple tests for JAX, PyTorch, and TensorFlow to test if the installed NVIDIA drivers are being properly picked up

cuda cudnn gpu jax nvidia pytorch setup tensorflow torch

Last synced: 03 Jul 2024

https://github.com/js1010/cusim

Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)

cuda gensim gpu lda topic-modeling w2v word-embedding

Last synced: 03 Jul 2024

https://github.com/dionhaefner/pyhpc-benchmarks

A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python :rocket:

benchmarks cupy gpu high-performance-computing jax parallel-computing python pytorch tensorflow

Last synced: 03 Jul 2024

https://github.com/peci1/nvidia-htop

A tool for enriching the output of nvidia-smi.

command-line gpu nvidia nvidia-smi

Last synced: 03 Jul 2024

https://github.com/EtienneCmb/visbrain

A multi-purpose GPU-accelerated open-source suite for brain data visualization

brain connectivity deep-sources gpu gui mni neuroscience opengl plot python sleep vispy visualization

Last synced: 03 Jul 2024

https://github.com/fastaudio/fastaudio

🔊 Audio and fastai v2

audio deep-learning fastai gpu python pytorch

Last synced: 02 Jul 2024

https://github.com/PKU-DAIR/Hetu

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

artificial-intelligence autograd data-science deep-learning deep-neural-networks distributed-systems distributed-training embeddings gpu high-dimensional machine-learning python state-of-the-art

Last synced: 01 Jul 2024

https://github.com/nihui/dain-ncnn-vulkan

DAIN, Depth-Aware Video Frame Interpolation implemented with ncnn library

dain gpu ncnn video-interpolation vulkan

Last synced: 01 Jul 2024

https://github.com/nihui/cain-ncnn-vulkan

CAIN, Channel Attention Is All You Need for Video Frame Interpolation implemented with ncnn library

cain gpu ncnn video-interpolation vulkan

Last synced: 01 Jul 2024

https://github.com/nihui/realsr-ncnn-vulkan

RealSR super resolution implemented with ncnn library

amd gpu intel ncnn nvidia realsr vulkan

Last synced: 01 Jul 2024

https://github.com/nihui/srmd-ncnn-vulkan

SRMD super resolution implemented with ncnn library

amd gpu intel ncnn nvidia srmd vulkan

Last synced: 01 Jul 2024

https://github.com/idealab-isu/GPView

GPU Accelerated Voxelization Framework for 3D CAD models.

cpp cuda gpu voxelization

Last synced: 01 Jul 2024

https://github.com/PaddlePaddle/Serving

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

dag deep-learning docker gpu micro-service microservice-toolkit online-service paddle paddle-serving pipeline prediction predictor python rpc-service serving

Last synced: 01 Jul 2024

https://github.com/CVCUDA/CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

bytedance cloud computer-vision cpp cuda cv-cuda gpu image-processing machine-learning nvidia python

Last synced: 01 Jul 2024

https://github.com/pytorch/executorch

On-device AI across mobile, embedded and edge for PyTorch

deep-learning embedded gpu machine-learning mobile neural-network tensor

Last synced: 01 Jul 2024

https://github.com/xtknight/vdpau-va-driver-vp9

Experimental VP9 codec support for vdpau-va-driver (NVIDIA VDPAU-VAAPI wrapper) and chromium-vaapi

4k acceleration chromium chromium-vaapi gpu hardware nvdec nvidia va-api vaapi vdpau vdpau-va-driver video vp9

Last synced: 01 Jul 2024

https://github.com/DiffSharp/DiffSharp

DiffSharp: Differentiable Functional Programming

autodiff deep-learning dotnet gpu machine-learning neural-network tensor

Last synced: 01 Jul 2024

https://github.com/cgi-estonia-space/ALUs

GPU accelerated earth observation data processors

earth-observation eo gpu gpu-computing nvidia nvidia-gpu sentinel-1 sentinel-2

Last synced: 30 Jun 2024

https://github.com/dstackai/dstack

dstack is an open-source orchestration engine for cost-effectively running AI workloads in the cloud as well as on-premises. Discord: https://discord.gg/u8SmfwPpMd

aws azure cloud gcp gpu llms machine-learning orchestration python

Last synced: 29 Jun 2024

https://github.com/pytorch/serve

Serve, optimize and scale PyTorch models in production

cpu deep-learning docker gpu kubernetes machine-learning metrics mlops optimization pytorch serving

Last synced: 29 Jun 2024

https://github.com/T-vK/MobilePassThrough

Make GPU passthrough on notebooks easy and accessible!

automated check compatibility dgpu gpu igpu mediated passthrough script vm

Last synced: 29 Jun 2024

https://github.com/zk4x/zyx

Tensor library for machine learning

deep-learning gpu rust

Last synced: 29 Jun 2024

https://github.com/JunweiLiang/awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

advice ai assistant-professor awesome-list gpu grants-search

Last synced: 29 Jun 2024

https://github.com/Jerc007/Open-GPGPU-FlexGrip-

FlexGripPlus: an open-source GPU model for reliability evaluation and micro architectural simulation

gpu microarchitecture simulation-model

Last synced: 29 Jun 2024

https://github.com/wavefunction91/GauXC

GauXC is a modern, modular C++ library for the evaluation of quantities related to the exchange-correlation (XC) energy (e.g. potential, etc) in the Gaussian basis set discretization of Kohn-Sham density function theory (KS-DFT) on heterogenous architectures.

dft gpu integrator

Last synced: 28 Jun 2024

https://github.com/jerryji1993/DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

deep-learning dnabert-model genome gpu kmer kmer-format machine-learning natural-language-processing nlp sequence

Last synced: 28 Jun 2024

https://github.com/JuliaAttic/CUDArt.jl

Julia wrapper for CUDA runtime API

cuda gpu julia

Last synced: 28 Jun 2024

https://github.com/JuliaGPU/GPUCompiler.jl

Reusable compiler infrastructure for Julia GPU backends.

compiler gpu hacktoberfest julia

Last synced: 28 Jun 2024

https://github.com/ZenitH-AT/nvidia-update

Checks for a new version of the NVIDIA driver, downloads and installs it.

driver gpu nvidia updater

Last synced: 27 Jun 2024

https://github.com/triagemd/tensorflow-builds

Tensorflow binaries and Docker images compiled with GPU support and CPU optimizations.

bazel cuda cudnn docker gpu machine-learning nvidia python tensorflow tensorflow-serving

Last synced: 27 Jun 2024

https://github.com/selkies-project/docker-nvidia-glx-desktop

KDE Plasma Desktop container designed for Kubernetes supporting OpenGL GLX and Vulkan for NVIDIA GPUs with WebRTC and HTML5, providing an open-source remote cloud graphics or game streaming platform. Spawns its own fully isolated X Server instead of using the host X Server, not requiring /tmp/.X11-unix host sockets or host configuration.

cloud-gaming docker docker-image game-streaming gpu gstreamer html5 kubernetes linux-gaming nvidia nvidia-docker nvidia-gpu opengl remote-access remote-control remote-desktop ubuntu vulkan webrtc wine

Last synced: 27 Jun 2024

https://github.com/harrism/hemi

Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.

c-plus-plus cuda cuda-device cuda-kernels gpu hemi

Last synced: 27 Jun 2024

https://github.com/AlexanderVeselov/RayTracing

Realtime GPU Path tracer based on OpenCL and OpenGL

3d cpp gpgpu gpu opencl opengl pathtracing pbr raytracing render

Last synced: 27 Jun 2024

https://github.com/Yours3lf/rpi-vk-driver

VK driver for the Raspberry Pi (Broadcom Videocore IV)

broadcom broadcom-videocore-iv driver gpu raspberry-pi raspberrypi rpi-vk-driver videocore-iv

Last synced: 26 Jun 2024

https://github.com/taichi-dev/difftaichi

10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)

differentiable-programming gpu graphics robotics simulation taichi

Last synced: 26 Jun 2024

https://github.com/mitmul/pynvvl

A Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python

cuda cupy gpu numpy nvidia-video-loader nvvl python video video-processing

Last synced: 26 Jun 2024

https://github.com/coderobe/VBiosFinder

Extract embedded VBIOS from (almost) any BIOS Update

bios gpu hacktoberfest hardware linux pci pci-passthrough uefi vbios

Last synced: 25 Jun 2024

https://github.com/codingonion/awesome-cuda-tensorrt-fpga

🔥🔥🔥 A collection of some awesome public NVIDIA CUDA, cuBLAS, cuDNN, TensorRT, AMD ROCm and FPGA projects.

awesome blas cublas cuda cudnn fpga gpu hdl large-language-models llama3 llm mojo nvidia pytorch tensorrt web3 yolo yolov10 yolov5 zkp

Last synced: 24 Jun 2024

https://github.com/cair/tmu

Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features, drop clause, Type III Feedback, focused negative sampling, multi-task classifier, autoencoder, literal budget, and one-vs-one multi-class classifier. TMU is written in Python with wrappers for C and CUDA-based clause evaluation and updating.

absorbing-states autoencoder convolution cuda gpu incremental incremental-computation multi-output pattern-recognition propositional-logic regression relational-logic sparse tsetlin-machine

Last synced: 24 Jun 2024

https://github.com/NVIDIA-Genomics-Research/GenomeWorks

SDK for GPU accelerated genome assembly and analysis

alignment cuda genomics gpu mapping nvidia partial-order-alignment poa python-api

Last synced: 23 Jun 2024

https://github.com/Tencent/TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer

Last synced: 22 Jun 2024

https://intel.github.io/scikit-learn-intelex/

Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

ai-inference ai-machine-learning ai-training analytics big-data data-analysis gpu intel machine-learning machine-learning-algorithms oneapi python scikit-learn swrepo

Last synced: 21 Jun 2024

https://tlkh.github.io/asitop/

Perf monitoring CLI tool for Apple Silicon

apple-silicon cli cpu gpu m1 macos

Last synced: 21 Jun 2024

https://github.com/binga/cloud-gpus

This repository contains information about Cloud GPU offerings for Machine Learning practitioners.

cloud-gpu colaboratory credits deep-learning gpu kaggle-scripts machine-learning

Last synced: 20 Jun 2024

https://github.com/pmndrs/detect-gpu

Classifies GPUs based on their 3D rendering benchmark score allowing the developer to provide sensible default settings for graphically intensive applications.

adaptive babylonjs benchmarks browser demo detection device gpu hardware pixijs progressive-enhancement threejs webgl webgl2

Last synced: 20 Jun 2024

https://github.com/microsoft/Microsoft-Rocket-Video-Analytics-Platform

A highly extensible software stack to empower everyone to build practical real-world live video analytics applications for object detection and counting with cutting edge machine learning algorithms.

azure counting docker dotnet-core edge-computing gpu object-detection tensorflow video-analytics yolov3

Last synced: 19 Jun 2024

https://github.com/harujoh/KelpNet

Pure C# machine learning framework

csharp deep-learning dotnet gpu machine-learning neural-network onnx opencl

Last synced: 19 Jun 2024

https://github.com/m4rs-mt/ILGPU

ILGPU JIT Compiler for high-performance .Net GPU programs

amd cil compiler cpu cuda dotnet gpgpu gpgpu-computing gpu ilgpu intel jit kernels msil nvidia opencl parallel ptx

Last synced: 19 Jun 2024

https://github.com/discosultan/VulkanCore

Vulkan 1.0 graphics and compute API bindings for .NET Standard

c-sharp dotnet-standard gpgpu gpu graphics netstandard vulkan

Last synced: 19 Jun 2024

https://github.com/keijiro/StableFluids

A straightforward GPU implementation of Jos Stam's "Stable Fluids" on Unity.

compute fluid-dynamics gpu shader unity unity3d

Last synced: 19 Jun 2024

https://github.com/hpcaitech/FastFold

Optimizing AlphaFold Training and Inference on GPU Clusters

alphafold2 cuda evoformer gpu habana-gaudi parallelism protein-folding protein-structure pytorch

Last synced: 18 Jun 2024

https://github.com/rreusser/regl-gpu-lines

Pure GPU, instanced, screen-projected lines for regl

gpu lines regl shaders webgl

Last synced: 18 Jun 2024

https://github.com/bh107/bohrium

Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX

cuda gpu gpu-acceleration multi-core numpy opencl parallel-computing

Last synced: 17 Jun 2024

https://github.com/casact/rp-bnn-claims

Individual Claims Forecasting with Bayesian Mixture Density Networks

actuarial actuarial-data gpu reserving

Last synced: 16 Jun 2024

https://github.com/pytorch/torchrec

Pytorch domain library for recommendation systems

cuda deep-learning gpu pytorch recommendation-system recommender-system sharding

Last synced: 16 Jun 2024

https://github.com/DeepRec-AI/HybridBackend

A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster

deep-learning gpu hybrid-parallelism parquet recommender-system

Last synced: 16 Jun 2024

http://diffsharp.github.io/DiffSharp/

DiffSharp: Differentiable Functional Programming

autodiff deep-learning dotnet gpu machine-learning neural-network tensor

Last synced: 16 Jun 2024

https://github.com/AstroAccelerateOrg/astro-accelerate

AstroAccelerate is a many-core accelerated software package for processing time-domain radio-astronomy data.

cuda gpu radio-astronomy

Last synced: 16 Jun 2024

https://github.com/pritul2/yolov5_FaceMask

Detecting person with or without face mask. Trained using YOLOv5.

covid-19 cpu face-mask-detection gpu yolov5

Last synced: 15 Jun 2024

https://github.com/yasenh/libtorch-yolov5

A LibTorch inference implementation of the yolov5

gpu libtorch yolov5

Last synced: 15 Jun 2024

https://github.com/intel/intel-technology-enabling-for-openshift

This project enables Intel® platform technologies (SGX, QAT) and GPUs on Red Hat OpenShift Container Platform

accelerator cloud datacenter device-plugin gpu kmm kubernetes nfd openshift operator qat sgx xeon yaml

Last synced: 15 Jun 2024

https://github.com/bheisler/RustaCUDA

Rusty wrapper for the CUDA Driver API

cuda cuda-api gpu rust

Last synced: 14 Jun 2024

https://github.com/CRAFT-THU/BSim

High Performance Simulation of Spiking Neural Network on GPGPUs

gpu simulator snn

Last synced: 14 Jun 2024

https://github.com/NVIDIA/gpu-operator

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

cuda gpu kubernetes nvidia

Last synced: 14 Jun 2024

https://github.com/Tencent/Forward

A library for high performance deep learning inference on NVIDIA GPUs.

cuda deep-learning forward gpu inference inference-engine keras neural-network onnx pytorch tensorflow tensorrt

Last synced: 14 Jun 2024

https://github.com/vectorch-ai/ScaleLLM

A high-performance inference system for large language models, designed for production environments.

cuda efficiency gpu inference llama llama3 llm llm-inference model performance production serving speculative transformer

Last synced: 13 Jun 2024

https://github.com/schrodinger/gpusimilarity

A Cuda/Thrust implementation of fingerprint similarity searching

cheminformatics chemistry gpu similarity-analysis

Last synced: 13 Jun 2024

https://github.com/scverse/rapids_singlecell

Rapids_singlecell: A GPU-accelerated tool for scRNA analysis. Offers seamless scverse compatibility for efficient single-cell data processing and analysis.

anndata bioinformatics gpu scverse single-cell

Last synced: 13 Jun 2024

https://github.com/chembl/FPSim2

Simple package for fast molecular similarity searches

cheminformatics chemistry gpu python similarity-search

Last synced: 13 Jun 2024

https://github.com/OpenBMB/BMInf

Efficient Inference for Big Models

deep-learning gpu pretrained-language-models

Last synced: 13 Jun 2024

https://github.com/Ipotrick/Daxa

Daxa is a convenient, simple and modern gpu abstraction built on vulkan

gpu vulkan

Last synced: 13 Jun 2024

https://github.com/andylamp/gpurelperf

A handy utility to find relative GPU performance quickly in multi-gpu boxes

gpu load-balancing mxnet

Last synced: 12 Jun 2024

https://github.com/m1k1o/go-transcode

On-demand transcoding origin server for live inputs and static files in Go using ffmpeg. Also with NVIDIA GPU hardware acceleration.

demand-transcoding docker ffmpeg golang gpu live-streaming nvidia-cuda streams transcoding

Last synced: 12 Jun 2024

https://github.com/NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 12 Jun 2024