Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/rick12000/confopt

A Library for Conformal Hyperparameter Tuning
https://github.com/rick12000/confopt

automl conformal-prediction hyperparameter-optimization hyperparameter-tuning inferential-statistics machine-learning predictive-modeling

Last synced: 2 months ago
JSON representation

A Library for Conformal Hyperparameter Tuning

Host: GitHub
URL: https://github.com/rick12000/confopt
Owner: rick12000
License: apache-2.0
Created: 2023-12-09T21:05:15.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-01-25T19:27:24.000Z (12 months ago)
Last Synced: 2024-10-03T21:38:25.839Z (4 months ago)
Topics: automl, conformal-prediction, hyperparameter-optimization, hyperparameter-tuning, inferential-statistics, machine-learning, predictive-modeling
Language: Python
Homepage:
Size: 140 KB
Stars: 28
Watchers: 2
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-conformal-prediction - confopt - A Library for Conformal Hyperparameter Tuning

README

        # ConfOpt

[![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)

[![arXiv](https://img.shields.io/badge/arXiv-ACHO-cyan)](https://doi.org/10.48550/arXiv.2207.03017)

ConfOpt is an inferential hyperparameter optimization package designed to

speed up model hyperparameter tuning.

The package currently implements Adaptive Conformal Hyperparameter Optimization (ACHO), as detailed

in [the original paper](https://doi.org/10.48550/arXiv.2207.03017).

## Installation

You can install ConfOpt from [PyPI](https://pypi.org/project/confopt) using `pip`:

```bash

pip install confopt

```

## Getting Started

As an example, we'll tune a Random Forest model with data from a regression task.

Start by setting up your training and validation data:

```python

from sklearn.datasets import fetch_california_housing

X, y = fetch_california_housing(return_X_y=True)

split_idx = int(len(X) * 0.5)

X_train, y_train = X[:split_idx, :], y[:split_idx]

X_val, y_val = X[split_idx:, :], y[split_idx:]

```

Then import the Random Forest model to tune and define a search space for

its parameters (must be a dictionary mapping the model's parameter names to

possible values of that parameter to search):

```python

from sklearn.ensemble import RandomForestRegressor

parameter_search_space = {

    "n_estimators": [10, 30, 50, 100, 150, 200, 300, 400],

    "min_samples_split": [0.005, 0.01, 0.1, 0.2, 0.3],

    "min_samples_leaf": [0.005, 0.01, 0.1, 0.2, 0.3],

    "max_features": [None, 0.8, 0.9, 1],

}

```

Now import the `ConformalSearcher` class and initialize it with:

- The model to tune.

- The raw X and y data.

- The parameter search space.

- An extra variable clarifying whether this is a regression or classification problem.

Hyperparameter tuning can be kicked off with the `search` method and a specification

of how long the tuning should run for (in seconds):

```python

from confopt.tuning import ConformalSearcher

searcher = ConformalSearcher(

    model=RandomForestRegressor(),

    X_train=X_train,

    y_train=y_train,

    X_val=X_val,

    y_val=y_val,

    search_space=parameter_search_space,

    prediction_type="regression",

)

searcher.search(

    runtime_budget=120  # How many seconds to run the search for

)

```

Once done, you can retrieve the best parameters obtained during tuning using:

```python

searcher.get_best_params()

```

Or obtain an already initialized model with:

```python

searcher.get_best_model()

```

More information on specific parameters and overrides not mentioned

in this walk-through can be found in the docstrings or in the `examples`

folder of the main repository.