https://github.com/helmholtz-ai-energy/ab-training

Last synced: 6 months ago
JSON representation

Host: GitHub
URL: https://github.com/helmholtz-ai-energy/ab-training
Owner: Helmholtz-AI-Energy
License: bsd-3-clause
Created: 2024-04-25T11:20:13.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-04-29T14:01:17.000Z (over 1 year ago)
Last Synced: 2024-04-29T15:26:37.199Z (over 1 year ago)
Language: Python
Size: 557 KB
Stars: 0
Watchers: 4
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# madonna

# Description
Orthogonal DPNN training methods

# Quickstart

## Create the pipeline environment and install the madonna package
Before using the template, one needs to install the project as a package.
* First, create a virtual environment.
> You can either do it with conda (preferred) or venv.
* Then, activate the environment
* Finally, install the project as a package. Run:
```
pip install -e .
```
## Run the MNIST example
This pipeline comes with a toy example (MNIST dataset with a simple feedforward neural network). To run the training (resp. testing) pipeline, simply run:
```
python scripts/train.py
# or python scripts/test.py
```
Or, if you want to submit the training job to a submit (resp. interactive) cluster node via slurm, run:
```
sbatch job_submission.sbatch
# or sbatch job_submission_interactive.sbatch
```
> * The experiments, evaluations, etc., are stored under the `logs` directory.
> * The default experiments tracking system is mlflow. The `mlruns` directory is contained in `logs`. To view a user friendly view of the experiments, run:
> ```
> # make sure you are inside logs (where mlruns is located)
> mlflow ui --host 0000
> ```
> * When evaluating (running `test.py`), make sure you give the correct checkpoint path in `configs/test.yaml`

# Project Organization
```
├── configs
│ ├── callbacks
│ ├── data
│ ├── debug
│ ├── experiment
│ ├── hparams_search
│ ├── local
│ ├── log_dir
│ ├── logger
│ ├── model
│ ├── trainer
│ │
│ ├── test.yaml
│ └── train.yaml
│
├── data
│ └── MNIST
│ ├── processed
│ └── raw
│
├── docs
├── models
├── notebooks
├── reports
│ └── figures
├── scripts
│ ├──
│ ├──
│ ├── test.py
│ └── train.py
│
├── src/madonna
│ ├── datamodules
│ ├── models
│ ├── utils
│ │
│ ├──
│ └──
│
├── tests
│ ├── helpers
│ ├── shell
│ └── unit
│
├── .gitignore
├── .pre-commit-config.yaml
├── requirements.txt
├── setup.cfg
├── LICENSE.txt
├── pyproject.toml
│
├── setup.cfg
├── setup.py
│
└── README.md
``` <- Hydra configuration files <- Callbacks configs <- Datamodule configs <- Debugging configs <- Experiment configs <- Hyperparameter search configs <- Local configs <- Logging directory configs <- Logger configs <- Model configs <- Trainer configs <- Main config for testing <- Main config for training <- Project data <- Processed data <- Raw data <- Directory for Sphinx documentation in rst or md. <- Trained and serialized models, model predictions <- Jupyter notebooks. <- Generated analysis as HTML, PDF, LaTeX, etc. <- Generated plots and figures for reports. <- Scripts used in project job_submission.sbatch <- Submit training job to slurm job_submission_interactive.sbatch <- Submit training job to slurm (interactive node) <- Run testing <- Run training <- Source code <- Lightning datamodules <- Lightning models <- Utility scripts testing_pipeline.py <- Model evaluation workflow training_pipeline.py <- Model training workflow <- Tests of any kind <- A couple of testing utilities <- Shell/command based tests <- Unit tests <- List of files/folders ignored by git <- Configuration of pre-commit hooks for code formatting <- File for installing python dependencies <- Configuration of linters and pytest <- License as chosen on the command-line. <- Build configuration. Don't change! Use `pip install -e .` to install for development or to build `tox -e build`. <- Declarative configuration of your project. <- [DEPRECATED] Use `python setup.py develop` to install for development or `python setup.py bdist_wheel` to build.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/helmholtz-ai-energy/ab-training

Awesome Lists containing this project

README