https://github.com/syrte/robustgp
Robust Gaussian Process with Iterative Trimming
https://github.com/syrte/robustgp
gaussian-processes gpy itgp machine-learning nonlinear-regression outlier-detection python regression robust-regresssion robust-statistics
Last synced: 10 months ago
JSON representation
Robust Gaussian Process with Iterative Trimming
- Host: GitHub
- URL: https://github.com/syrte/robustgp
- Owner: syrte
- License: mit
- Created: 2020-05-19T18:19:59.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2021-06-13T17:09:47.000Z (almost 5 years ago)
- Last Synced: 2025-07-28T19:49:21.839Z (10 months ago)
- Topics: gaussian-processes, gpy, itgp, machine-learning, nonlinear-regression, outlier-detection, python, regression, robust-regresssion, robust-statistics
- Language: Jupyter Notebook
- Homepage:
- Size: 910 KB
- Stars: 23
- Watchers: 3
- Forks: 5
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# robustgp
[Robust Gaussian Process Regression Based on Iterative Trimming (ITGP)](https://arxiv.org/abs/2011.11057)
- Nonparametric, flexible, and robust nonlinear regression based on Gaussian process.
- Outperforming the robust Gaussian process with Student’s-t likelihood significantly in many test cases.
The Gaussian process (GP) regression can be severely biased when the data are contaminated by outliers. ITGP is a new robust GP regression algorithm that iteratively trims the most extreme data points. While the new algorithm retains the attractive properties of the standard GP as a nonparametric and flexible regression method, it can greatly improve the model accuracy for contaminated data even in the presence of extreme or abundant outliers. It is also easier to implement compared with previous robust GP variants that rely on approximate inference. Applied to a wide range of experiments with different contamination levels, the proposed method significantly outperforms the standard GP and the popular robust GP variant with the Student-t likelihood in most test cases.
## Install
```
pip install robustgp
```
Dependency: [GPy](https://github.com/SheffieldML/GPy/)
## Quick start
One can start with examples in [this notebook](https://nbviewer.jupyter.org/github/syrte/robustgp/blob/master/notebook/Example_Neal_Dataset.ipynb).
## Usage
```python
from robustgp import ITGP
# train ITGP
res = ITGP(X, Y, alpha1=0.5, alpha2=0.975, nsh=2, ncc=2, nrw=1)
gp, consistency = res.gp, res.consistency
# make prediction
y_avg, y_var = gp.predict(x_new)
y_var *= consistency
```
Here `gp` is a `GPy.core.GP` object, whose usage is further illustrated [here](https://nbviewer.jupyter.org/github/SheffieldML/notebook/blob/master/GPy/basic_gp.ipynb).
- See the [docstring of ITGP](https://github.com/syrte/robustgp/blob/master/robustgp/robustgp.py) for detailed usage and full API.
- See this [notebook](https://github.com/syrte/robustgp/blob/master/notebook/Example_Neal_Dataset.ipynb) for a complete example.
## Benchmark
Comparison with standard Gaussian process and t-likelihood Gaussian process in terms of RMSE.
See the algorithm paper for details.

## References
- Algorithm paper:
[Robust Gaussian Process Regression Based on Iterative Trimming](https://arxiv.org/abs/2011.11057),
Zhao-Zhou Li, Lu Li, & Zhengyi Shao, 2020
- First application:
[Modeling Unresolved Binaries of Open Clusters in the Color-Magnitude Diagram. I. Method and Application of NGC 3532](https://ui.adsabs.harvard.edu/abs/2020ApJ...901...49L/),
Lu Li, Zhengyi Shao, Zhao-Zhou Li, Jincheng Yu, Jing Zhong, & Li Chen, 2020
## License
The MIT License