https://github.com/zwliew/readings
https://github.com/zwliew/readings
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/zwliew/readings
- Owner: zwliew
- Created: 2025-07-02T06:01:02.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2025-07-21T05:36:03.000Z (11 months ago)
- Last Synced: 2025-07-21T07:24:15.015Z (11 months ago)
- Size: 6.84 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Readings
## Memory models
* [A Formal Analysis of the NVIDIA PTX Memory Consistency Model](https://dl.acm.org/doi/10.1145/3297858.3304043)
* [RISC-V Unprivileged ISA Manual: Chapter 17 and Appendix A on RVWMO](https://riscv.github.io/riscv-isa-manual/snapshot/unprivileged/#memorymodel)
## Memory barriers
* [Memory Barriers: a Hardware View for Software Hackers](http://www.rdrop.com/users/paulmck/scalability/paper/whymb.2010.07.23a.pdf)
* [Linux Kernel Memory Barriers](https://www.kernel.org/doc/Documentation/memory-barriers.txt)
## Parallel programming
* [Rust Atomics and Locks](https://marabos.nl/atomics/)
* [Is Parallel Programming Hard, And, If So, What Can You Do About It?](https://www.kernel.org/pub/linux/kernel/people/paulmck/perfbook/perfbook.html)
## GPGPU ISA and uarch design
* [Vortex GPGPU ISA](https://arxiv.org/pdf/2110.10857)
* [Analyzing Modern NVIDIA GPU cores](https://arxiv.org/abs/2503.20481)
* [Apple G13 GPU Architecture Reference](https://dougallj.github.io/applegpu/docs.html)
* [MTIA: First Generation Silicon Targeting Meta’s Recommendation Systems](http://firoozshahian.com/publications/3579371.3589348.pdf)
* [Meta’s Second Generation AI Chip: Model-Chip Co-Design and Productionization Experiences](https://dl.acm.org/doi/pdf/10.1145/3695053.3731409)
## GPGPU compiler design
* [Convergence and Scalarization for Data-Parallel Architectures](https://research.nvidia.com/index.php/publication/2013-02_convergence-and-scalarization-data-parallel-architectures)
## DL compiler design
* [Triton: An Intermediate Language and Compiler for Tiled Neural Network Computations](https://www.eecs.harvard.edu/~htk/publication/2019-mapl-tillet-kung-cox.pdf)
## Sanitizers
* [AddressSanitizer: A Fast Address Sanity Checker](https://www.usenix.org/system/files/conference/atc12/atc12-final39.pdf)
* [ThreadSanitizer - data race detection in practice](https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/35604.pdf)
* [Dynamic Race Detection with LLVM Compiler](https://static.googleusercontent.com/media/research.google.com/en//pubs/archive/37278.pdf)