https://github.com/codecuttech/hydra-demo

Last synced: 6 months ago
JSON representation

Host: GitHub
URL: https://github.com/codecuttech/hydra-demo
Owner: CodeCutTech
Created: 2023-05-23T20:41:55.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-05-17T20:50:52.000Z (8 months ago)
Last Synced: 2025-06-30T05:35:53.028Z (6 months ago)
Language: Python
Size: 933 KB
Stars: 31
Watchers: 2
Forks: 10
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Hydra Demo

[![View the Article](https://img.shields.io/badge/CodeCut-View%20the%20Article-blue)](https://codecut.ai/stop-hard-coding-in-a-data-science-project-use-configuration-files-instead/)

## Set up the environment

1. Install [uv](https://github.com/astral-sh/uv)
1. Set up the environment:

```bash
uv sync
```

## Download the data

1. Download the dataset from [Kaggle](https://www.kaggle.com/datasets/uciml/red-wine-quality-cortez-et-al-2009?resource=download)

2. Move the downloaded file to `data/raw/`

## Run the scripts

Run the data processing script:

```bash
uv run src/process.py
```

Run the model training script:

```bash
uv run src/train_model.py
```

Both scripts use Hydra for configuration management. The default configurations are in the `conf/main.yaml` file. You can override any configuration parameter using the command line. For example:

```bash
# Override test size in process.py
uv run src/process.py process.test_size=0.3

# Override hyperparameters in train_model.py
uv run src/train_model.py train.hyperparameters.svm__C=10
```

To see all available configuration options, you can use the `--help` flag:

```bash
# View configuration options for process.py
uv run src/process.py --help

# View configuration options for train_model.py
uv run src/train_model.py --help
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/codecuttech/hydra-demo

Awesome Lists containing this project

README