An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with oversampling

A curated list of projects in awesome lists tagged with oversampling .

https://github.com/analyticalmindsltd/smote_variants

A collection of 85 minority oversampling techniques (SMOTE) for imbalanced learning with multi-class oversampling and model selection features

imbalanced-data imbalanced-learning oversampling smote

Last synced: 21 Oct 2025

https://github.com/jfilter/split-folders

🗂 Split folders with files (i.e. images) into training, validation and test (dataset) folders

dataset deep-learning machine-learning oversampling python python-package splitting test training validation

Last synced: 28 Jan 2026

https://github.com/maxhalford/pytorch-resample

🎲 Iterable dataset resampling in PyTorch

imbalanced-learning oversampling pytorch resampling undersampling

Last synced: 17 Jul 2025

https://github.com/MaxHalford/pytorch-resample

🎲 Iterable dataset resampling in PyTorch

imbalanced-learning oversampling pytorch resampling undersampling

Last synced: 08 May 2025

https://github.com/anaxagor/applybn

Multi-purpose data analysis framework based on Bayesian networks and Causal models

bayesian-networks causal-models concept-analysis feature-extraction feature-selection outlier-detection oversampling tabular-data time-series

Last synced: 26 Feb 2026

https://github.com/rajoy99/osman

osman: OverSampling by Deep Generative Models A pip package which oversamples class imbalance binary data by Deep Generative Models.

deep-learning gan generative-model oversampling variational-autoencoder wgan-gp

Last synced: 14 Jan 2026

https://github.com/joaopfonseca/publications

Repository containing most of the source code (LaTeX, Python, etc.) of all experiments and papers I have been involved with.

active-learning data-augmentation machine-learning oversampling research

Last synced: 13 Jun 2026

https://github.com/predict-idlab/tpehgdb-experiments

Experiments conducted on the TPEHGDB dataset to reproduce the reported results from "A critical look at studies applying over-sampling on the TPEHGDB dataset"

data-leakage imbalanced-data oversampling tpehgdb-dataset

Last synced: 07 Jul 2025

https://github.com/kingabzpro/malawi-news-classification

Using text classifier to predict various categories in Malawi News articles using SMOTE and SGDClassifier.

africa multiclass-classification nlp-machine-learning oversampling

Last synced: 19 Apr 2026

https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset

BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?

bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote

Last synced: 21 Jun 2025

https://github.com/alessandrosocc/machine-learning-project-2022

Final project for the Machine Learning course at the University of Cagliari in 2022. Analysis of a dataset, use of Machine Learning techniques with Oversampling and Undersampling techniques. Final report with the results obtained.

imblearn machine-learning matplotlib-pyplot oversampling pandas scikit-learn spambase-dataset undersampling

Last synced: 18 Jan 2026

https://github.com/hase3b/class-imbalance-classification-performance-analysis

This repository contains the code, documentation, and datasets for a comprehensive exploration of machine learning techniques to address class imbalance. The project investigates the impact of various methods, like ADASYN, KMeansSMOTE, and Deep Learning Generator, on classification performance while effectively demonstrating benefits of pipelining.

adasyn class-imbalance classification cross-validation data-cleaning data-pipeline data-preprocessing deep-learning eda feature-selection hyperparameter-tuning kmeanssmote oversampling pipeline synthetic-data

Last synced: 04 Apr 2025

https://github.com/jianninapinto/bandersnatch

This project implements a machine learning model using Random Forest, XGBoost, and Support Vector Machines algorithms with oversampling and undersampling techniques to handle imbalanced classes for classification tasks in the context of predicting the rarity of monsters.

altair imbalanced-classification imblearn machine-learning mongodb oversampling pycharm-ide pymongo python random-forest-classifier scikit-learn smote support-vector-machines undersampling xgboost

Last synced: 29 Sep 2025

https://github.com/rakibhhridoy/handlingimbalanceddataset-business

It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase but by others illegally. Some huge transactions can also done by suspicious figure, it need to catch em.

auc business-intelligence fraud-detection imbalanced-data imbalanced-learning machine-learning oversampling precision recall smote transcations

Last synced: 14 May 2025

https://github.com/alessandrosocc/deep-learning-project-2023

A study of oversampling techniques using GAN and CycleGAN: an overview using a binary classifier. University of Cagliari, 2022.

cyclegan deep-learning gan keras oversampling tensorflow2

Last synced: 18 Jan 2026