Projects in Awesome Lists tagged with oversampling
A curated list of projects in awesome lists tagged with oversampling .
https://github.com/analyticalmindsltd/smote_variants
A collection of 85 minority oversampling techniques (SMOTE) for imbalanced learning with multi-class oversampling and model selection features
imbalanced-data imbalanced-learning oversampling smote
Last synced: 21 Oct 2025
https://github.com/jfilter/split-folders
🗂 Split folders with files (i.e. images) into training, validation and test (dataset) folders
dataset deep-learning machine-learning oversampling python python-package splitting test training validation
Last synced: 28 Jan 2026
https://github.com/maxhalford/pytorch-resample
🎲 Iterable dataset resampling in PyTorch
imbalanced-learning oversampling pytorch resampling undersampling
Last synced: 17 Jul 2025
https://github.com/MaxHalford/pytorch-resample
🎲 Iterable dataset resampling in PyTorch
imbalanced-learning oversampling pytorch resampling undersampling
Last synced: 08 May 2025
https://github.com/dmey/synthia
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
augmentation climate copula data-augmentation data-generation data-generator data-modelling data-science dependency-analysis dependency-modeling finance fpca functional-data machine-learning oversampling principal-component-analysis statistics synthetic-data weather xarray
Last synced: 01 Feb 2026
https://github.com/priyavrat-misra/xrays-and-gradcam
Classification and Gradient-based Localization of Chest Radiographs using PyTorch.
cnn covid-19 deep-learning densenet121 early-stopping fine-tuning gradcam imbalanced-data localization oversampling pneumonia pytorch-implementation radiographs resnet18 transfer-learning vgg16 xrays
Last synced: 14 Apr 2025
https://github.com/ncordon/imbalance
binary-classification imbalanced-data oversampling r
Last synced: 13 Apr 2025
https://github.com/georgedouzas/imbalanced-learn-extra
Implementation of novel oversampling algorithms.
clustering-based-oversampling data-science g-somo geometric-smote imbalanced-learning kmeans-smote machine-learning oversampling python scikit-learn smote
Last synced: 06 Feb 2026
https://github.com/anaxagor/applybn
Multi-purpose data analysis framework based on Bayesian networks and Causal models
bayesian-networks causal-models concept-analysis feature-extraction feature-selection outlier-detection oversampling tabular-data time-series
Last synced: 26 Feb 2026
https://github.com/rajoy99/osman
osman: OverSampling by Deep Generative Models A pip package which oversamples class imbalance binary data by Deep Generative Models.
deep-learning gan generative-model oversampling variational-autoencoder wgan-gp
Last synced: 14 Jan 2026
https://github.com/joaopfonseca/publications
Repository containing most of the source code (LaTeX, Python, etc.) of all experiments and papers I have been involved with.
active-learning data-augmentation machine-learning oversampling research
Last synced: 13 Jun 2026
https://github.com/chaitanyac22/fraud_analytics_credit_card_fraud_detection
The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.
adasyn banking credit-card-fraud-detection data-analysis decision-tree-classifier fraud-analytics hyperparameter-optimization hyperparameter-tuning imblearn kneighborsclassifier logistic-regression machine-learning-algorithms oversampling pipelines power-transformers random-forest-classifier randomoversampler smote svm-classifier xgboost-classifier
Last synced: 13 Apr 2025
https://github.com/predict-idlab/tpehgdb-experiments
Experiments conducted on the TPEHGDB dataset to reproduce the reported results from "A critical look at studies applying over-sampling on the TPEHGDB dataset"
data-leakage imbalanced-data oversampling tpehgdb-dataset
Last synced: 07 Jul 2025
https://github.com/alfurka/synloc
A Python Package to Create Synthetic Tabular Data
clustering constrained-clustering copulas data-augmentation distributions k-means knn local-sampling machine-learning multivariate-distributions nonparametric-distribution oversampling python resampling sampling semi-parametric-modeling statistics synthetic synthetic-data synthetic-dataset-generation
Last synced: 14 Jan 2026
https://github.com/ditronix/pvim-precision-voltage-iot-monitor
DitroniX PVIM ESP32 AD7606 Precision Voltage IoT Monitor SDK Board
16bit accurate ad7606 adc analogue balanced compact current digital-filter dsp eeprom esp32 iot low-noise oversampling precision synchronous-data-acquisition unbalanced voltage
Last synced: 02 Mar 2026
https://github.com/kingabzpro/malawi-news-classification
Using text classifier to predict various categories in Malawi News articles using SMOTE and SGDClassifier.
africa multiclass-classification nlp-machine-learning oversampling
Last synced: 19 Apr 2026
https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset
BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?
bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote
Last synced: 21 Jun 2025
https://github.com/alessandrosocc/machine-learning-project-2022
Final project for the Machine Learning course at the University of Cagliari in 2022. Analysis of a dataset, use of Machine Learning techniques with Oversampling and Undersampling techniques. Final report with the results obtained.
imblearn machine-learning matplotlib-pyplot oversampling pandas scikit-learn spambase-dataset undersampling
Last synced: 18 Jan 2026
https://github.com/hase3b/class-imbalance-classification-performance-analysis
This repository contains the code, documentation, and datasets for a comprehensive exploration of machine learning techniques to address class imbalance. The project investigates the impact of various methods, like ADASYN, KMeansSMOTE, and Deep Learning Generator, on classification performance while effectively demonstrating benefits of pipelining.
adasyn class-imbalance classification cross-validation data-cleaning data-pipeline data-preprocessing deep-learning eda feature-selection hyperparameter-tuning kmeanssmote oversampling pipeline synthetic-data
Last synced: 04 Apr 2025
https://github.com/avinandanbose/credit-card-fraud-detection-machine-learning-
Credit Card Fraud Detection using Python and Machine Learning.
artificial-intelligence credit-card creditcard decision-trees logistic-regression machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot oversampling pca python quantile-transformer random-over-sampling random-under-sampling seaborn smote-sampling stratified-cross-validation support-vector-machines tomek-link-elimination
Last synced: 19 May 2026
https://github.com/mmsaki/credit-risks-ml
Using the imbalanced-learn and Scikit-learn libraries to build and evaluate machine learning models.
balanced-accuracy-scores classification-models credit-risk imbalanced-classification imbalanced-learning loan-prediction-analysis logistic-regression machine-learning oversampling predictive-modeling resampling sklearn smote-oversampler smoteenn-combination undersampling-technique
Last synced: 28 Apr 2026
https://github.com/tanyachutani/credit-card-fraud-detection
Applied undersampling and oversampling using SMOTE.
credit-card-fraud-detection data-imbalance fraud-detection machine-learning oversampling smote undersampling
Last synced: 10 Jun 2026
https://github.com/jianninapinto/bandersnatch
This project implements a machine learning model using Random Forest, XGBoost, and Support Vector Machines algorithms with oversampling and undersampling techniques to handle imbalanced classes for classification tasks in the context of predicting the rarity of monsters.
altair imbalanced-classification imblearn machine-learning mongodb oversampling pycharm-ide pymongo python random-forest-classifier scikit-learn smote support-vector-machines undersampling xgboost
Last synced: 29 Sep 2025
https://github.com/rakibhhridoy/handlingimbalanceddataset-business
It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase but by others illegally. Some huge transactions can also done by suspicious figure, it need to catch em.
auc business-intelligence fraud-detection imbalanced-data imbalanced-learning machine-learning oversampling precision recall smote transcations
Last synced: 14 May 2025
https://github.com/alessandrosocc/deep-learning-project-2023
A study of oversampling techniques using GAN and CycleGAN: an overview using a binary classifier. University of Cagliari, 2022.
cyclegan deep-learning gan keras oversampling tensorflow2
Last synced: 18 Jan 2026