Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/GreenAITorch/GATorch
GATorch is a tool seamlessly integrated with PyTorch that enables ML developers to generate an energy consumption report. By attaching your model, the tool automatically tracks the energy consumption of your model's training and generates graphs and plots to gain in-depth insights into the energy consumption of your model.
https://github.com/GreenAITorch/GATorch
energy-consumption green-ai pytorch sustainability
Last synced: 2 months ago
JSON representation
GATorch is a tool seamlessly integrated with PyTorch that enables ML developers to generate an energy consumption report. By attaching your model, the tool automatically tracks the energy consumption of your model's training and generates graphs and plots to gain in-depth insights into the energy consumption of your model.
- Host: GitHub
- URL: https://github.com/GreenAITorch/GATorch
- Owner: GreenAITorch
- License: mit
- Created: 2023-03-31T10:02:35.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-04-01T16:41:05.000Z (almost 2 years ago)
- Last Synced: 2024-10-17T12:39:30.958Z (3 months ago)
- Topics: energy-consumption, green-ai, pytorch, sustainability
- Language: Python
- Homepage:
- Size: 358 KB
- Stars: 8
- Watchers: 2
- Forks: 1
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-green-ai - GATorch - Aware PyTorch Extension.<br> ![Linux](https://img.shields.io/badge/Linux-black?style=flat&logo=linux) ![GPU](https://img.shields.io/badge/GPU-black?style=flat&logo=nvidia) (🛠Tools / Code-Based Tools)
README
# GAtorch
Green AI torch tries to create awareness over energy consumption within the Pytorch ML framework. Its goal is to measure energy consumption throughout the complete process of AI engineers and can give overviews and indications for performance gains with respect to energy consumption.
Currently it supports energy measurement of the training passes per layer.
You can find the full API documentation [here](https://gatorch.readthedocs.io/en/latest/).
# Installation
You can install GATorch using pip.
To install GATorch, at the command line, run:```bash
pip install GATorch
```# Source Installation
- cuda >=11.7
- cuDNN >=8Create a virtualenv like `virtualenv .venv` and activate it using `source ./.venv/bin/activate` and install the other requirements with `pip install requirements.txt`.
## Basic example
```python
from GA import GA# Create the profiler object and attach a model to it
ga_measure = GA()
ga_measure.attach_model(model)# Let's try to do a single forward pass and a backward pass
x = torch.zeros([1, 1, 28, 28]).to(device)
y = torch.zeros([1, 10]).to(device)
pred = model(x) # forwardloss = loss_fn(pred, y)
optimizer.zero_grad()
loss.backward() # backward
optimizer.step()# Now lets print the mean measurements
print(ga_measure.get_mean_measurements())
```or run the example scripts in the `examples` directory.
## Compatibility
Some older hardware might not support energy consumption measurements:
- NVML requires Tesla Architecture NVIDIA GPUs or newer to work.
- RAPLs DRAM measurements are only available for XENON CPUs.In case you get compatibility errors due to older hardware you can disable the failing measurement application, use the `disable_measurements` parameter in the `attach_model` function. This parameter accepts a list of disabled measurements out of `['cpu', 'ram', 'gpu']`, default is `[]`. You need to use at least one measurement that is not disabled. The program will indicate that the disabled devices are unavailable.
## Permissions
Due to [Platypus attack](https://platypusattack.com) Intel RAPL requires root permission for energy readings. In order to run this program with the correct permissions, do NOT make Intel RAPL readable for any user as this introduces vulnerability. Instead use Python with sudo instead:
```bash
sudo ./.venv/bin/python .py
```## Tensorboard
This tool can automatically generate energy consumption reports and display these in Tensorboard. To use Tensorboard run `tensorboard --logdir=runs` and open the browser to view the graphs. This tool further allows for custom graph generation and tensorboard integration, but is not complete and needs to be extended.
# Roadmap
The current architecture of this tool uses the integrated hooks of the PyTorch library, which restricts the current implementation towards the final goal of complete coverage including data loading, pre-processing, saving and loading a model etc. To give a more thorough analysis of the impact of energy consumption in ML development, this still needs to be developed.
This tool differs from other tools by measuring in-depth layers and system components and could be expanded to provide energy consumption data that can lead to recommendations for eliminating certain layers due to high energy consumption compared to accuracy gain.
PyJoules measures the energy consumption per individual hardware components and this data could be separated in order to provide a relative component view. Another improvement could be to measure the system component utilization over time, which can be an indicator of wasted energy.