Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/andreazoccatelli/light_permanova
A lightweight implementation of PERMANOVA based on Euclidean distance from centroid
https://github.com/andreazoccatelli/light_permanova
computervision numpy permanova pytorch statistics tabular-data
Last synced: 3 days ago
JSON representation
A lightweight implementation of PERMANOVA based on Euclidean distance from centroid
- Host: GitHub
- URL: https://github.com/andreazoccatelli/light_permanova
- Owner: AndreaZoccatelli
- Created: 2024-08-07T16:27:33.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-09-12T22:26:38.000Z (2 months ago)
- Last Synced: 2024-09-14T07:36:45.903Z (2 months ago)
- Topics: computervision, numpy, permanova, pytorch, statistics, tabular-data
- Language: Jupyter Notebook
- Homepage: https://light-permanova.readthedocs.io
- Size: 2.58 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# LightPERMANOVA
A lightweight implementation of PERMANOVA based on Euclidean distance from centroid.## Overview
One known problem of machine learning models in production that affects their predictive ability is covariate shift. It is defined as a change in the distribution of one or more independent variables used to train the model.ANOVA is often adopted to assess if two samples are from the same population by comparing the variance of their means (H0: all $$\mu$$’s are equal; H1: at least one pair of $$\mu$$’s are
not equal). This test relies, however, on the normality assumption of the samples, which makes it a non-viable solution to effectively monitor batches of data.PERMANOVA is a multivariate version of ANOVA based on the pseudo-F statistic, which makes use of permutations, allowing for a non-parametric estimation.
In the case of covariates shift monitoring, the test compares the original sample $$s_0$$ used at time $$t_0$$ to train the model with a new, unseen sample $$s_1$$ on which the model made predictions at time $$t_1$$.
## Useful links
Read the docs [here](https://light-permanova.readthedocs.io/en/latest/index.html).This project is part of "[Root.](https://andrea-zoccatelli.gitbook.io/me/v/root.)".