Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by NVIDIA

A curated list of projects in awesome lists by NVIDIA .

https://github.com/NVIDIA/nvidia-docker

Build and run Docker containers leveraging NVIDIA GPUs

cuda docker gpu nvidia-docker

Last synced: 30 Jul 2024

https://github.com/NVIDIA/open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Last synced: 30 Jul 2024

https://github.com/NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

computer-vision deep-learning drug-discovery forecasting large-language-models mxnet nlp paddlepaddle pytorch recommender-systems speech-recognition speech-synthesis tensorflow tensorflow2 translation

Last synced: 31 Jul 2024

https://github.com/NVIDIA/FastPhotoStyle

Style transfer, deep learning, feature transform

Last synced: 01 Aug 2024

https://github.com/NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts

Last synced: 30 Jul 2024

https://github.com/NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

deep-learning gpu-acceleration inference nvidia tensorrt

Last synced: 31 Jul 2024

https://github.com/NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

large-language-models model-para transformers

Last synced: 31 Jul 2024

https://github.com/NVIDIA/vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Last synced: 31 Jul 2024

https://github.com/NVIDIA/apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Last synced: 01 Aug 2024

https://github.com/NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Last synced: 30 Jul 2024

https://nvidia.github.io/TensorRT-LLM/

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Last synced: 31 Jul 2024

https://github.com/NVIDIA/cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

cuda cuda-driver-api cuda-kernels cuda-opengl

Last synced: 31 Jul 2024

https://github.com/nvidia/fastertransformer

Transformer related optimization, including BERT, GPT

bert gpt pytorch transformer

Last synced: 02 Aug 2024

https://github.com/NVIDIA/FasterTransformer

Transformer related optimization, including BERT, GPT

bert gpt pytorch transformer

Last synced: 31 Jul 2024

https://github.com/NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

cpp cuda deep-learning deep-learning-library gpu nvidia

Last synced: 30 Jul 2024

https://github.com/NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 30 Jul 2024

https://github.com/NVIDIA/tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Last synced: 31 Jul 2024

https://github.com/NVIDIA/thrust

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

algorithms cpp cpp11 cpp14 cpp17 cpp20 cuda cxx cxx11 cxx14 cxx17 cxx20 gpu gpu-computing nvidia nvidia-hpc-sdk thrust

Last synced: 30 Jul 2024

https://github.com/NVIDIA/DIGITS

Deep Learning GPU Training System

caffe deep-learning gpu machine-learning torch

Last synced: 30 Jul 2024

https://github.com/NVIDIA/warp

A Python framework for high performance GPU simulation and graphics

Last synced: 01 Aug 2024

https://github.com/NVIDIA/NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Last synced: 31 Jul 2024

https://github.com/NVIDIA/flownet2-pytorch

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Last synced: 31 Jul 2024

https://github.com/NVIDIA/nccl

Optimized primitives for collective multi-GPU communication

Last synced: 02 Aug 2024

https://github.com/NVIDIA/k8s-device-plugin

NVIDIA device plugin for Kubernetes

kubernetes

Last synced: 01 Aug 2024

https://nvidia.github.io/libcudacxx/

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

cpp cpp11 cpp14 cpp17 cpp20 cpp23 cuda cxx cxx11 cxx14 cxx17 cxx20 cxx23 gpu libcxx llvm nvidia nvidia-hpc-sdk standard std

Last synced: 01 Aug 2024

https://github.com/NVIDIA/libcudacxx

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

cpp cpp11 cpp14 cpp17 cpp20 cpp23 cuda cxx cxx11 cxx14 cxx17 cxx20 cxx23 gpu libcxx llvm nvidia nvidia-hpc-sdk standard std

Last synced: 02 Aug 2024

https://github.com/NVIDIA/waveglow

A Flow-based Generative Network for Speech Synthesis

Last synced: 31 Jul 2024

https://github.com/NVIDIA/GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

gpu-acceleration large-language-models llm llm-inference microservice nemo rag retrieval-augmented-generation tensorrt triton-inference-server

Last synced: 31 Jul 2024

https://github.com/NVIDIA/Stable-Diffusion-WebUI-TensorRT

TensorRT Extension for Stable Diffusion Web UI

Last synced: 31 Jul 2024

https://github.com/NVIDIA/semantic-segmentation

Nvidia Semantic Segmentation monorepo

Last synced: 02 Aug 2024

https://github.com/NVIDIA/gpu-operator

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

cuda gpu kubernetes nvidia

Last synced: 02 Aug 2024

https://github.com/NVIDIA/cub

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

algorithms cpp cpp11 cpp14 cpp17 cpp20 cub cuda cxx cxx11 cxx14 cxx17 cxx20 gpu nvidia nvidia-hpc-sdk

Last synced: 31 Jul 2024

https://github.com/NVIDIA/nvidia-container-toolkit

Build and run containers leveraging NVIDIA GPUs

Last synced: 01 Aug 2024

https://github.com/NVIDIA/CUDALibrarySamples

CUDA Library Samples

Last synced: 04 Aug 2024

https://github.com/NVIDIA/trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Last synced: 31 Jul 2024

https://github.com/NVIDIA/stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Last synced: 02 Aug 2024

https://github.com/NVIDIA/VideoProcessingFramework

Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions

Last synced: 31 Jul 2024

https://github.com/NVIDIA/open-gpu-doc

Documentation of NVIDIA chip/hardware interfaces

Last synced: 01 Aug 2024

https://github.com/NVIDIA/deepops

Tools for building GPU clusters

Last synced: 01 Aug 2024

https://github.com/NVIDIA/MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

cuda gpgpu gpu gpu-computing hpc

Last synced: 31 Jul 2024

https://github.com/NVIDIA/nvidia-container-runtime

NVIDIA container runtime

Last synced: 01 Aug 2024

https://github.com/NVIDIA/sentiment-discovery

Unsupervised Language Modeling at scale for robust sentiment classification

Last synced: 03 Aug 2024

https://github.com/NVIDIA/gpu-monitoring-tools

Tools for monitoring NVIDIA GPUs on Linux

Last synced: 01 Aug 2024

https://github.com/NVIDIA/tensorflow

An Open Source Machine Learning Framework for Everyone

Last synced: 31 Jul 2024

https://github.com/NVIDIA/jetson-gpio

A Python library that enables the use of Jetson's GPIOs

Last synced: 03 Aug 2024

https://github.com/NVIDIA/flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

speech-synthesis

Last synced: 02 Aug 2024

https://github.com/NVIDIA/retinanet-examples

Fast and accurate object detection with end-to-end GPU optimization

deep-learning neural-network object-detection python pytorch retinanet tensorrt

Last synced: 07 Aug 2024

https://github.com/NVIDIA/modulus

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

deep-learning machine-learning nvidia-gpu physics pytorch

Last synced: 31 Jul 2024

https://github.com/NVIDIA/cuda-python

CUDA Python Low-level Bindings

Last synced: 31 Jul 2024

https://github.com/NVIDIA/gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

gpu-memory gpudirect-rdma kernel-mode-driver libraries linux nvidia

Last synced: 02 Aug 2024

https://github.com/NVIDIA/libnvidia-container

NVIDIA container runtime library

Last synced: 01 Aug 2024

https://github.com/NVIDIA/dcgm-exporter

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

Last synced: 03 Aug 2024

https://github.com/NVIDIA/nccl-tests

NCCL Tests

Last synced: 04 Aug 2024

https://github.com/NVIDIA/spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

big-data gpu rapids spark

Last synced: 01 Aug 2024

https://github.com/NVIDIA/nv-wavenet

Reference implementation of real-time autoregressive wavenet inference

Last synced: 02 Aug 2024

https://github.com/NVIDIA/nvvl

A library that uses hardware acceleration to load sequences of video frames to facilitate machine learning training

Last synced: 01 Aug 2024

https://github.com/NVIDIA/caffe

Caffe: a fast open framework for deep learning.

Last synced: 01 Aug 2024

https://github.com/NVIDIA/runx

Deep Learning Experiment Management

Last synced: 02 Aug 2024

https://github.com/NVIDIA/enroot

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Last synced: 01 Aug 2024

https://github.com/NVIDIA/multi-gpu-programming-models

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Last synced: 01 Aug 2024

https://github.com/NVIDIA/PyProf

A GPU performance profiling tool for PyTorch models

Last synced: 04 Aug 2024

https://github.com/NVIDIA/libglvnd

The GL Vendor-Neutral Dispatch library

Last synced: 30 Jul 2024

https://github.com/NVIDIA/MDL-SDK

NVIDIA Material Definition Language SDK

Last synced: 02 Aug 2024

https://github.com/NVIDIA/nvbench

CUDA Kernel Benchmarking Library

benchmark cuda cuda-kernels gpu kernel-benchmark nvidia performance

Last synced: 04 Aug 2024

https://github.com/NVIDIA/NeMo-Aligner

Scalable toolkit for efficient model alignment

Last synced: 01 Aug 2024

https://github.com/NVIDIA/NvPipe

NVIDIA-accelerated zero latency video compression library for interactive remoting applications

Last synced: 03 Aug 2024

https://github.com/NVIDIA/cuda-quantum

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

Last synced: 03 Aug 2024

https://github.com/NVIDIA/video-sdk-samples

Samples demonstrating how to use various APIs of NVIDIA Video Codec SDK

Last synced: 31 Jul 2024

https://github.com/NVIDIA/cuQuantum

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

cuda cuquantum custatevec cutensornet nvidia quantum-computing

Last synced: 01 Aug 2024

https://github.com/NVIDIA/cnmem

A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory

Last synced: 30 Jul 2024

https://github.com/NVIDIA/egl-wayland

The EGLStream-based Wayland external platform

Last synced: 03 Aug 2024

https://github.com/NVIDIA/NeMo-Curator

Scalable toolkit for data curation

Last synced: 08 Aug 2024

https://github.com/NVIDIA/NeMo-text-processing

NeMo text processing for ASR and TTS

inverse-text-n text-normalization

Last synced: 07 Aug 2024

https://github.com/NVIDIA/container-canary

A tool for testing and validating container requirements against versioned manifests

automation ci containers docker kubernetes podman utilities versioning

Last synced: 01 Aug 2024

https://github.com/NVIDIA/VisRTX

NVIDIA OptiX based implementation of ANARI

Last synced: 03 Aug 2024

https://github.com/NVIDIA/transformer-ls

Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).

efficient-transformers long-sequence transformer vision-transformer

Last synced: 03 Aug 2024

https://github.com/NVIDIA/grcuda

Polyglot CUDA integration for the GraalVM

Last synced: 01 Aug 2024

https://github.com/NVIDIA/ContrastiveLosses4VRD

Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Generation"

Last synced: 01 Aug 2024

https://github.com/NVIDIA/k8s-dra-driver

Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes

Last synced: 31 Jul 2024

https://github.com/NVIDIA/JAX-Toolbox

JAX-Toolbox

Last synced: 04 Aug 2024

https://github.com/NVIDIA/nvtrust

Ancillary open source software to support confidential computing on NVIDIA GPUs

Last synced: 31 Jul 2024

https://github.com/NVIDIA/gds-nvidia-fs

NVIDIA GPUDirect Storage Driver

Last synced: 01 Aug 2024

https://github.com/NVIDIA/GMAT

A toolkit showing GPU's all-round capability in video processing

codec cpp cuda deep-learning ffmpeg gpu image-processing nvidia video video-codec

Last synced: 01 Aug 2024

https://github.com/NVIDIA/earth2mip

Earth-2 Model Intercomparison Project (MIP) is a python framework that enables climate researchers and scientists to inter-compare AI models for weather and climate.

climate deep-learning weather

Last synced: 01 Aug 2024

https://github.com/NVIDIA/mig-parted

MIG Partition Editor for NVIDIA GPUs

Last synced: 02 Aug 2024

https://github.com/NVIDIA/Deep-Learning-Accelerator-SW

NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.

Last synced: 31 Jul 2024

https://github.com/NVIDIA/ipyparaview

iPython widget for server-side ParaView rendering in Jupyter.

Last synced: 30 Jul 2024

https://github.com/NVIDIA/nvtx-plugins

Python bindings for NVTX

Last synced: 05 Aug 2024

https://github.com/NVIDIA/eglexternalplatform

The EGL External Platform interface

Last synced: 03 Aug 2024

https://github.com/NVIDIA/cuda-profiler

Tools and extensions for CUDA profiling

Last synced: 01 Aug 2024