Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tweska/cern-ssp
CERN Summer Student Project
https://github.com/tweska/cern-ssp
Last synced: 10 days ago
JSON representation
CERN Summer Student Project
- Host: GitHub
- URL: https://github.com/tweska/cern-ssp
- Owner: tweska
- Created: 2024-07-04T14:07:12.000Z (4 months ago)
- Default Branch: master
- Last Pushed: 2024-09-16T12:16:04.000Z (about 2 months ago)
- Last Synced: 2024-09-17T07:12:07.751Z (about 2 months ago)
- Language: C++
- Size: 170 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ROOT GPU Optimization Experiments for High Energy Physics Analysis
## Dependencies and Compilers
| Package | Version | Notes |
|---------------------------------------|----------------------------------------------------------------------------------------------------------|-------------------|
| ROOT | 6.33.01 ([5b7b6cc0](https://github.com/root-project/root/tree/5b7b6cc0253a145ee59b8890b6e26d81ac7198b4)) | Build from source |
| Google Test | 1.15.2 ([b514bdc8](https://github.com/google/googletest/tree/b514bdc898e2951020cbdca1304b75f5950d1f59)) | Build from source |
| GNU Compiler Collection (`gcc`/`g++`) | 12.3.0 | |
| CUDA Toolkit (includes `nvcc`) | 12.5 | |## Performance Results
The following results are obtained on a machine with an `AMD Ryzen 7 5700g`
processor and `NVIDIA GeForce RTX 3060` GPU, by running the
`./runBenchmarks.sh` script in the root directory.Note that some runtimes might not add up to the total time reported. This is
due to the intermediate times being rounded.### Batched Histogram
| | CPU Runtime | GPU Runtime | GPU Percentage | Speedup |
|:----------:|------------:|------------:|---------------:|--------:|
| Transfer | N/A | 543ms | 97.8% | N/A |
| Fill | 3383ms | 12ms | 2.2% | 281.9x |
| Result | N/A | 0ms | 0.0% | N/A |
| Total | 3383ms | 555ms | 100.0% | 6.1x |### DiMuon
| | CPU Runtime | GPU Runtime | GPU Percentage | Speedup |
|:-------------:|------------:|------------:|---------------:|--------:|
| Transfer | N/A | 163ms | 58.3% | N/A |
| Define + Fill | 788ms | 116ms | 41.7% | 6.8x |
| Result | N/A | 0ms | 0.0% | N/A |
| Total | 788ms | 278ms | 100.0% | 2.8x |### FoldedWMass
| | CPU Runtime | GPU Runtime | GPU Percentage | Speedup |
|:-------------:|------------:|------------:|---------------:|--------:|
| Transfer | N/A | 0ms | 0.0% | N/A |
| Define + Fill | 17792ms | 172ms | 98.9% | 103.4x |
| Result | N/A | 1ms | 0.6% | N/A |
| Total | 17792ms | 174ms | 100.0% | 102.3x |