An open API service indexing awesome lists of open source software.

https://github.com/yuhui-zh15/gpu-smart

Interactive Automatic GPU Manager
https://github.com/yuhui-zh15/gpu-smart

cuda cudnn deep-learning gpu manager neural-network nvidia pytorch tensorflow

Last synced: 12 months ago
JSON representation

Interactive Automatic GPU Manager

Awesome Lists containing this project

README

          

# GPU-Smart: Interactive Automatic GPU Manager

###### Author: Yuhui Zhang, yuhui-zh15@mails.tsinghua.edu.cn, www.yuhui.ac.cn, Tsinghua University

## Version 1.1

### Usage

`python3 main.py`

### Requirements

gpustat==0.4.1

### Advantages:

- [x] Interactive and easy to use
- [x] Simply add bash command into your waiting lists
- [x] Automatically detect and select idle GPU to run command in your waiting lists
- [x] No need to modify any part of your code
- [x] Clearly show waiting lists and running history
- [x] Concisely show GPU status: memory, temperature, usage, etc.
- [x] Allow manually set number of GPUs reserved for others

### Menu
```
Welcome to Smart GPU Queue
--------------------------
[1] New Command
[2] Running History
[3] Waiting List
[4] GPU Status
[5] Reserve Number
--------------------------
```

### New Command
```
Please input command

python idle.py
```

### Running History
```
{GPU ID}{Enqueue Time->Dequeue Time}: Command
[0]{GPU: 0}(Thu Aug 23 08:10:54 2018->Thu Aug 23 08:11:56 2018): python idle.py
[1]{GPU: 1}(Thu Aug 23 08:10:58 2018->Thu Aug 23 08:12:59 2018): python idle.py
[2]{GPU: 2}(Thu Aug 23 08:11:02 2018->Thu Aug 23 08:14:01 2018): python idle.py
[3]{GPU: 3}(Thu Aug 23 08:11:06 2018->Thu Aug 23 08:15:03 2018): python idle.py
```

### Waiting List
```
{Enqueue Time}: Command
[0](Thu Aug 23 08:10:54 2018): python idle.py
[1](Thu Aug 23 08:10:58 2018): python idle.py
[2](Thu Aug 23 08:11:02 2018): python idle.py
[3](Thu Aug 23 08:11:06 2018): python idle.py
```

### GPU Status
```
Thu Aug 23 10:27:52 2018
[0] Tesla K20c | 43'C, 0 % | 0 / 4742 MB |
[1] Tesla K20c | 44'C, 0 % | 0 / 4742 MB |
[2] Tesla K20c | 43'C, 0 % | 0 / 4742 MB |
[3] Tesla K20c | 39'C, 96 % | 0 / 4742 MB |
```

### Reserve Number
```
Current Reserve Number: 0
Please input how many GPUs you want to reserve

2
Set successfully
```