Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nvidia/transformerengine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
https://github.com/nvidia/transformerengine
cuda deep-learning fp8 gpu jax machine-learning python pytorch
Last synced: about 10 hours ago
JSON representation
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
- Host: GitHub
- URL: https://github.com/nvidia/transformerengine
- Owner: NVIDIA
- License: apache-2.0
- Created: 2022-09-20T15:20:26.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-05-22T18:52:38.000Z (4 months ago)
- Last Synced: 2024-05-22T19:11:43.076Z (4 months ago)
- Topics: cuda, deep-learning, fp8, gpu, jax, machine-learning, python, pytorch
- Language: Python
- Homepage: https://docs.nvidia.com/deeplearning/transformer-engine/user-guide/index.html
- Size: 4.38 MB
- Stars: 1,482
- Watchers: 32
- Forks: 229
- Open Issues: 121
-
Metadata Files:
- Readme: README.rst
- Contributing: CONTRIBUTING.rst
- License: LICENSE
- Security: SECURITY.md