https://github.com/yuhui-zh15/gpu-smart
Interactive Automatic GPU Manager
https://github.com/yuhui-zh15/gpu-smart
cuda cudnn deep-learning gpu manager neural-network nvidia pytorch tensorflow
Last synced: 12 months ago
JSON representation
Interactive Automatic GPU Manager
- Host: GitHub
- URL: https://github.com/yuhui-zh15/gpu-smart
- Owner: yuhui-zh15
- License: gpl-3.0
- Created: 2018-08-22T22:22:11.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2018-08-23T07:18:11.000Z (almost 8 years ago)
- Last Synced: 2025-06-06T20:12:04.843Z (about 1 year ago)
- Topics: cuda, cudnn, deep-learning, gpu, manager, neural-network, nvidia, pytorch, tensorflow
- Language: Python
- Homepage:
- Size: 26.4 KB
- Stars: 8
- Watchers: 2
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# GPU-Smart: Interactive Automatic GPU Manager
###### Author: Yuhui Zhang, yuhui-zh15@mails.tsinghua.edu.cn, www.yuhui.ac.cn, Tsinghua University
## Version 1.1
### Usage
`python3 main.py`
### Requirements
gpustat==0.4.1
### Advantages:
- [x] Interactive and easy to use
- [x] Simply add bash command into your waiting lists
- [x] Automatically detect and select idle GPU to run command in your waiting lists
- [x] No need to modify any part of your code
- [x] Clearly show waiting lists and running history
- [x] Concisely show GPU status: memory, temperature, usage, etc.
- [x] Allow manually set number of GPUs reserved for others
### Menu
```
Welcome to Smart GPU Queue
--------------------------
[1] New Command
[2] Running History
[3] Waiting List
[4] GPU Status
[5] Reserve Number
--------------------------
```
### New Command
```
Please input command
python idle.py
```
### Running History
```
{GPU ID}{Enqueue Time->Dequeue Time}: Command
[0]{GPU: 0}(Thu Aug 23 08:10:54 2018->Thu Aug 23 08:11:56 2018): python idle.py
[1]{GPU: 1}(Thu Aug 23 08:10:58 2018->Thu Aug 23 08:12:59 2018): python idle.py
[2]{GPU: 2}(Thu Aug 23 08:11:02 2018->Thu Aug 23 08:14:01 2018): python idle.py
[3]{GPU: 3}(Thu Aug 23 08:11:06 2018->Thu Aug 23 08:15:03 2018): python idle.py
```
### Waiting List
```
{Enqueue Time}: Command
[0](Thu Aug 23 08:10:54 2018): python idle.py
[1](Thu Aug 23 08:10:58 2018): python idle.py
[2](Thu Aug 23 08:11:02 2018): python idle.py
[3](Thu Aug 23 08:11:06 2018): python idle.py
```
### GPU Status
```
Thu Aug 23 10:27:52 2018
[0] Tesla K20c | 43'C, 0 % | 0 / 4742 MB |
[1] Tesla K20c | 44'C, 0 % | 0 / 4742 MB |
[2] Tesla K20c | 43'C, 0 % | 0 / 4742 MB |
[3] Tesla K20c | 39'C, 96 % | 0 / 4742 MB |
```
### Reserve Number
```
Current Reserve Number: 0
Please input how many GPUs you want to reserve
2
Set successfully
```