Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/macaycz/nn

A lightweight, GPU-accelerated machine learning library built with CUDA.
https://github.com/macaycz/nn

cuda deep-learning gpu machine-learning neural-network

Last synced: about 2 hours ago
JSON representation

A lightweight, GPU-accelerated machine learning library built with CUDA.

Awesome Lists containing this project

README

        

## Resources

[Neural networks](https://www.youtube.com/watch?v=aircAruvnKk&list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi)
[Neural Network From Scratch](https://github.com/TheIndependentCode/Neural-Network)
[How to Implement Performance Metrics in CUDA C/C++](https://developer.nvidia.com/blog/how-implement-performance-metrics-cuda-cc/)
[Fast Dynamic Indexing of Private Arrays in CUDA](https://developer.nvidia.com/blog/fast-dynamic-indexing-private-arrays-cuda/)
[Tiny CUDA Neural Networks](https://github.com/NVlabs/tiny-cuda-nn)
[What’s the max register number that causes slowdown](https://forums.developer.nvidia.com/t/whats-the-max-register-number-that-causes-slowdown/234969/2)
[CUDA Crash Course](https://youtube.com/playlist?list=PLxNPSjHT5qvtYRVdNN1yDcdSl39uHV_sU&si=hfWfZ5R7q7iYNYvs)
[BFloat16: The secret to high performance on Cloud TPUs](https://cloud.google.com/blog/products/ai-machine-learning/bfloat16-the-secret-to-high-performance-on-cloud-tpus)
[Programming Tensor Cores in CUDA 9](https://developer.nvidia.com/blog/programming-tensor-cores-cuda-9/)
[CUDA C++ Programming Guide - Warp Matrix Functions](https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#wmma)
[Zen, CUDA, and Tensor Cores, Part I: The Silicon](https://www.computerenhance.com/p/zen-cuda-and-tensor-cores-part-i)