An open API service indexing awesome lists of open source software.

https://github.com/nikhilrout/thetensorcoreproject

Microarchitecture implementation of Nvidia's Tensor Cores
https://github.com/nikhilrout/thetensorcoreproject

cuda floating-point gpgpu hybrid-precision-training tensorcore

Last synced: about 1 year ago
JSON representation

Microarchitecture implementation of Nvidia's Tensor Cores

Awesome Lists containing this project

README

          

# TheTensorCoreProject
Microarchitecture implementation of Nvidia's CUDA and Tensor Cores

## Tensor Core Versions
TensorCore v0 --> Volta Architecture (Hybrid Precision - FP16MUL FP32ADD) \
TensorCore v1 --> Ampere Architecture (TF32MUL FP32ADD / BF16MUL FP32ADD) \
TensorCore v2 --> Hopper Architecture (FP8 E5M2 / E4M3 FP16ADD)