https://github.com/enhuiz/pytorch-training-utilities
A set of PyTorch training utilities (WIP).
https://github.com/enhuiz/pytorch-training-utilities
Last synced: about 2 months ago
JSON representation
A set of PyTorch training utilities (WIP).
- Host: GitHub
- URL: https://github.com/enhuiz/pytorch-training-utilities
- Owner: enhuiz
- License: mit
- Created: 2022-09-22T02:14:07.000Z (over 2 years ago)
- Default Branch: deepspeed
- Last Pushed: 2023-05-11T15:44:29.000Z (about 2 years ago)
- Last Synced: 2025-04-03T06:42:04.002Z (2 months ago)
- Language: Python
- Homepage:
- Size: 74.2 KB
- Stars: 6
- Watchers: 2
- Forks: 8
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PyTorch Training Utilities (WIP)
This is a collection of PyTorch training utilities that I am currently using. Please note that the interfaces may change frequently. If you have checked out the correct commit, you should not encounter any issues.
To run the training utility, use one of the following commands:
```
deepspeed .py
```or
```
python .py
```## Warning
- Be cautious when using the deepspeed branch to train GANs. The gradients of different engines must be properly managed, as the zero_grad function is not called before feedforward to support gradient accumulation.