https://github.com/ikergarcia1996/matrix-benchmark

A cupy (GPU) / numpy benchmark to measure how fast different hardware can perform matrix operations.
https://github.com/ikergarcia1996/matrix-benchmark

benchmark cuda cupy embedding gpu matrix numpy python word-embeddings

Last synced: 3 days ago
JSON representation

A cupy (GPU) / numpy benchmark to measure how fast different hardware can perform matrix operations.

Host: GitHub
URL: https://github.com/ikergarcia1996/matrix-benchmark
Owner: ikergarcia1996
License: mit
Created: 2020-09-22T14:02:58.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2021-10-04T12:28:57.000Z (about 4 years ago)
Last Synced: 2025-01-10T23:45:29.611Z (9 months ago)
Topics: benchmark, cuda, cupy, embedding, gpu, matrix, numpy, python, word-embeddings
Language: Python
Homepage:
Size: 62.5 KB
Stars: 7
Watchers: 5
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Matrix Benchmark

A cupy (GPU) / numpy (Numpy) benchmark to measure how fast different hardware can perform matrix operations. The benchmark tests operations commonly used in the word embedding research field.

## Tests
* Matrix dot
* Squared Distance
* Euclidean distance
* K-nearest neighbours (dot)
* K-nearest neighbours (euclidean distance)

## Requeriments
* python3
* numpy
* cupy (GPU support)
* tqdm

## Usage
```
python3 run_benchmark.py
```
You can test different batch sizes with the --gpy_batch_sizes parameter
```
python3 run_benchmark.py --gpu_batch_sizes 100 500 1000 2000
```

You can test different matrix sizes with the --matrix_size parameter

```
python3 run_benchmark.py --gpu_batch_sizes 100 500 1000 2000 --matrix_size 10000
```

To run the kkn / euclian distance / squared distance benchmarks use the --full_benchmark flag

```
python3 run_benchmark.py --full_benchmark
```

You can run the benchmark in FP16, Fp32 or FP64 (FP16 not supported for knn in current cupy version)

```
python3 run_benchmark.py --fp16
python3 run_benchmark.py --fp32
python3 run_benchmark.py --fp64
```

## Sample Output
```
---> Running benchmark <---
Device: GeForce RTX 2080 SUPER. FP32. Matrix size: 10000 x 300

Running dot task. Batch size: 1000. fp32. Time: 0.3407488663991292 seconds.
Running squared_distance task. Batch size: 1000. fp32. Time: 3.1240174770355225 seconds.
Running euclidean_distance task. Batch size: 1000. fp32. Time: 3.0890189011891684 seconds.
Running knn_dot task. Batch size: 1000. fp32. Time: 1.4538000424702961 seconds.
Running knn_euclidean_distance task. Batch size: 1000. fp32. Time: OUT OF MEMORY
```

## Benchmarks

![alt text](Results.png "Benchmarks")

To reproduce results:
```
python3 run_benchmark.py --gpu_batch_sizes 2000 --matrix_size 50000 --fp 16
python3 run_benchmark.py --gpu_batch_sizes 2000 --matrix_size 50000 --fp 32
python3 run_benchmark.py --gpu_batch_sizes 2000 --matrix_size 50000 --fp 64
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ikergarcia1996/matrix-benchmark

Awesome Lists containing this project

README