https://github.com/thiswillbeyourgithub/taguchigridsearchconverter

TaguchiGridSearchConverter: Optimize hyperparameter search with Taguchi array principles, reducing experiments while effectively exploring the parameter space.
https://github.com/thiswillbeyourgithub/taguchigridsearchconverter

Last synced: about 1 year ago
JSON representation

TaguchiGridSearchConverter: Optimize hyperparameter search with Taguchi array principles, reducing experiments while effectively exploring the parameter space.

Host: GitHub
URL: https://github.com/thiswillbeyourgithub/taguchigridsearchconverter
Owner: thiswillbeyourgithub
License: gpl-3.0
Created: 2025-01-10T21:05:25.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-01-13T12:19:13.000Z (over 1 year ago)
Last Synced: 2025-04-12T07:52:43.707Z (about 1 year ago)
Language: Python
Size: 35.2 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# TaguchiGridSearchConverter

A Python package for optimizing hyperparameter search using Taguchi array principles. Inspired by [NightHawkInLight's video on Taguchi arrays](https://www.youtube.com/watch?v=5oULEuOoRd0&pp=ygUPdGFndXNoaSBhcnJhYXlz).

**Do fewer experiments than grid search, but do the right ones using Taguchi orthogonal arrays!**

## Why use TaguchiGridSearchConverter?

This library is designed to work seamlessly with scikit-learn's ParameterGrid, providing a drop-in replacement that can significantly reduce your hyperparameter search space.

When tuning machine learning models, traditional grid search can require an exponentially large number of experiments. TaguchiGridSearchConverter helps reduce the number of experiments needed while still effectively exploring the parameter space.

Instead of testing every possible combination of parameters (which can be computationally expensive), this package uses Taguchi array principles to:
1. Reduce the number of experiments needed
2. Maintain good coverage of the parameter space
3. Identify significant parameters efficiently

## Getting started

### Installation

* From PyPI:
* Via uv: `uv pip install TaguchiGridSearchConverter`
* Via pip: `pip install TaguchiGridSearchConverter`
* From GitHub:
* Clone this repo then `pip install .`

### Basic Usage

```python
from sklearn.model_selection import ParameterGrid
from TaguchiGridSearchConverter import TaguchiGridSearchConverter

grid_converter = TaguchiGridSearchConverter()

sample_grid = {
'kernel': ['linear', 'rbf', 'poly'],
'C': [0.1, 1, 10],
'gamma': ['scale', 'auto'],
'verbose': [True], # also handles length 1 lists for fixed params
}

full_grid = ParameterGrid(sample_grid)

reduced_grid = grid_converter.fit_transform(sample_grid)
# Alternative way:
# reduced_grid = grid_converter.fit_transform(full_grid)

# Use the reduced grid in your experiments
for params in reduced_grid:
# Your training/evaluation code here
print(params)
```

In the above experiment, the reduced experiments list is only this:
```python
[{'C': 0.1, 'gamma': 'scale', 'kernel': 'linear', 'verbose': True},
{'C': 1, 'gamma': 'auto', 'kernel': 'rbf', 'verbose': True},
{'C': 10, 'gamma': 'scale', 'kernel': 'poly', 'verbose': True}]
```

# The full experiments list would have been that long:
```python
[{'C': 0.1, 'gamma': 'scale', 'kernel': 'linear', 'verbose': True},
{'C': 0.1, 'gamma': 'scale', 'kernel': 'rbf', 'verbose': True},
{'C': 0.1, 'gamma': 'scale', 'kernel': 'poly', 'verbose': True},
{'C': 0.1, 'gamma': 'auto', 'kernel': 'linear', 'verbose': True},
{'C': 0.1, 'gamma': 'auto', 'kernel': 'rbf', 'verbose': True},
{'C': 0.1, 'gamma': 'auto', 'kernel': 'poly', 'verbose': True},
{'C': 1, 'gamma': 'scale', 'kernel': 'linear', 'verbose': True},
{'C': 1, 'gamma': 'scale', 'kernel': 'rbf', 'verbose': True},
{'C': 1, 'gamma': 'scale', 'kernel': 'poly', 'verbose': True},
{'C': 1, 'gamma': 'auto', 'kernel': 'linear', 'verbose': True},
{'C': 1, 'gamma': 'auto', 'kernel': 'rbf', 'verbose': True},
{'C': 1, 'gamma': 'auto', 'kernel': 'poly', 'verbose': True},
{'C': 10, 'gamma': 'scale', 'kernel': 'linear', 'verbose': True},
{'C': 10, 'gamma': 'scale', 'kernel': 'rbf', 'verbose': True},
{'C': 10, 'gamma': 'scale', 'kernel': 'poly', 'verbose': True},
{'C': 10, 'gamma': 'auto', 'kernel': 'linear', 'verbose': True},
{'C': 10, 'gamma': 'auto', 'kernel': 'rbf', 'verbose': True},
{'C': 10, 'gamma': 'auto', 'kernel': 'poly', 'verbose': True}]
```

So only 3 experiments instead of 18!

## How it works

The converter takes a parameter grid (similar to scikit-learn's ParameterGrid) and:
1. Determines the number of levels for each parameter
2. Calculates the minimum number of experiments needed
3. Creates a reduced set of parameter combinations using modulo cycling
4. Ensures the reduced set is smaller than the full grid

This approach is particularly useful when:
- You have limited computational resources
- You need quick insights into parameter importance
- You want to reduce experiment time without sacrificing too much accuracy

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/thiswillbeyourgithub/taguchigridsearchconverter

Awesome Lists containing this project

README