Projects in Awesome Lists tagged with imbalanced-classification
A curated list of projects in awesome lists tagged with imbalanced-classification .
https://github.com/YyzHarry/imbalanced-regression
[ICML 2021, Long Talk] Delving into Deep Imbalanced Regression
computer-vision healthcare icml icml-2021 imbalance imbalanced-classification imbalanced-data imbalanced-learning imbalanced-regression long-tail natural-language-processing regression
Last synced: 09 May 2025
https://github.com/YyzHarry/imbalanced-semi-self
[NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning
class-imbalance imbalanced-classification imbalanced-data imbalanced-learning long-tail long-tailed-recognition neurips neurips-2020 self-supervised-learning semi-supervised-learning unlabeled-data
Last synced: 08 May 2025
https://github.com/zhiningliu1998/imbalanced-ensemble
🛠️ Class-imbalanced Ensemble Learning Toolbox. | 类别不平衡/长尾机器学习库
class-imbalance classification data-mining data-science ensemble ensemble-imbalanced-learning ensemble-learning ensemble-model imbalanced-classification imbalanced-data imbalanced-learning long-tail machine-learning multi-class-classification python python3 scikit-learn sklearn
Last synced: 15 May 2025
https://github.com/ZhiningLiu1998/imbalanced-ensemble
🛠️ Class-imbalanced Ensemble Learning Toolbox. | 类别不平衡/长尾机器学习库
class-imbalance classification data-mining data-science ensemble ensemble-imbalanced-learning ensemble-learning ensemble-model imbalanced-classification imbalanced-data imbalanced-learning long-tail machine-learning multi-class-classification python python3 scikit-learn sklearn
Last synced: 11 Apr 2025
https://github.com/solegalli/machine-learning-imbalanced-data
Code repository for the online course Machine Learning with Imbalanced Data
data-science imbalanced-classification imbalanced-data imbalanced-learning machine-learning python
Last synced: 16 May 2025
https://github.com/jiawei-ren/BalancedMetaSoftmax-Classification
[NeurIPS 2020] Balanced Meta-Softmax for Long-Tailed Visual Recognition
imbalanced-classification imbalanced-learning
Last synced: 05 Apr 2025
https://github.com/YyzHarry/multi-domain-imbalance
[ECCV 2022] Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization, and Beyond
deep-learning domain-adaptation domain-generalization eccv eccv-2022 imbalance imbalanced-classification imbalanced-data imbalanced-learning long-tail long-tailed-recognition multi-domain multi-domain-learning ood ood-generalization
Last synced: 08 May 2025
https://github.com/TACJu/Bi-Sampling
This is the official PyTorch implementation of the paper "Rethinking Re-Sampling in Imbalanced Semi-Supervised Learning" (Ju He, Adam Kortylewski, Shaokang Yang, Shuai Liu, Cheng Yang, Changhu Wang, Alan Yuille).
imbalanced-classification semi-supervised-learning
Last synced: 08 May 2025
https://github.com/LirongWu/GraphMixup
Code for ECML-PKDD 2022 paper "GraphMixup: Improving Class-Imbalanced Node Classification by Reinforcement Mixup and Self-supervised Context Prediction"
graph-algorithms graph-self-supervised-learning imbalanced-classification imbalanced-data reinforcement-learning
Last synced: 15 Aug 2025
https://github.com/lirongwu/graphmixup
Code for ECML-PKDD 2022 paper "GraphMixup: Improving Class-Imbalanced Node Classification by Reinforcement Mixup and Self-supervised Context Prediction"
graph-algorithms graph-self-supervised-learning imbalanced-classification imbalanced-data reinforcement-learning
Last synced: 27 Jul 2025
https://github.com/wildoctopus/cbloss
Pytorch implementation of Class Balanced Loss based on Effective number of Samples
cbloss class-balanced-loss classbalancedloss focal-loss focalloss imbalanced-classification pytorch-implementation
Last synced: 26 Oct 2025
https://github.com/theochem/b3clf
Predictors for Blood-Brain Barrier Permeability with resampling strategies based on B3DB database.
bioinformatics blood-brain-barrier classification cns-drug drug-design imbalanced-classification imbalanced-learning molecular-modeling permeability
Last synced: 24 Oct 2025
https://github.com/amajji/multi-class-classification
Deployment of a classification model on a webapp using FLASK for the backend and html/CSS/JS for frontend
analyse-data app classification data flask flask-application imbala imbalanced-classes imbalanced-classification imbalanced-data machine-learning machine-learning-algorithms preprocessing webapp webapplication
Last synced: 24 Sep 2025
https://github.com/solegalli/imbalanced-data-myths-mistakes-solutions
Code repository for the book "Imbalanced Data: Myths, Mistakes and Modern Solutions".
cost-sensitive-learning data-preparation data-preprocessing data-science imbalanced-classification imbalanced-data imbalanced-learning imblearn machine-learning machine-learning-algorithms python scikit-learn
Last synced: 22 Jun 2026
https://github.com/ejw-data/ml-myopia
A variety of machine learning techniques used to identify nearsighted patients
cross-validation gridsearchcv imbalanced-classification kmeans knn machine-learning pca pipeline python random-forest scikit-learn svc tensorflow tsne
Last synced: 11 Jul 2025
https://github.com/splch/qbs
An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.
classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling
Last synced: 01 Apr 2025
https://github.com/k1nght/online_cl_logit_adjusted_softmax
official code repository for TMLR paper "Online Continual Learning via Logit Adjusted Softmax"
continual-learning imbalanced-classification logit-adjusted-softmax online-continual-learning
Last synced: 05 Apr 2025
https://github.com/mmsaki/credit-risks-ml
Using the imbalanced-learn and Scikit-learn libraries to build and evaluate machine learning models.
balanced-accuracy-scores classification-models credit-risk imbalanced-classification imbalanced-learning loan-prediction-analysis logistic-regression machine-learning oversampling predictive-modeling resampling sklearn smote-oversampler smoteenn-combination undersampling-technique
Last synced: 28 Apr 2026
https://github.com/ahmetzamanis/usedcarkicksclassification
Imbalanced classification with scikit-learn and PyTorch Lightning.
class-weights classification classification-metrics data-science deep-learning focal-loss hyperparameter-optimization imbalanced-classification logistic-regression machine-learning neural-network optuna python pytorch pytorch-lightning scikit-learn sensitivity-analysis stochastic-gradient-descent support-vector-machines xgboost
Last synced: 10 May 2026
https://github.com/antoniof1704/imbalanced-classification-example
An example of a model I built where the dataset contained a very imbalanced class. Due to data governance rules, I have replaced the original dataset used in the modelling with a credit card fraud dataset from Kaggle.
fraud-detection imbalanced-classification jupiter-notebook modelling smote
Last synced: 10 Sep 2025
https://github.com/mwombeki6/multiclass-machine-learning-web-app
This is a multiclass classification project to classify severity of road accidents into three categories. this project is based on real-world data and dataset is also highly imbalanced.
classification data-mining data-science imbalanced-classification machine-learning
Last synced: 03 Aug 2025
https://github.com/ekellbuch/longtail_ensembles
Evaluating ensemble performance in long-tailed datasets (Neurips 2023 Heavy Tails Workshop)
class-imbalance ensemble-learning fairness-ml imbalanced-classes imbalanced-classification imbalanced-data imbalanced-learning
Last synced: 15 May 2026
https://github.com/celineboutinon/credit-scoring
Source code for OpenClassrooms - Data Scientist Project 7 - Implement a Scoring Model
drift-detection evidently fraud-detection imbalanced-classification lightgbm mlflow mlflow-model pyfunc sklearn smote-oversampler streamlit xgboost-classifier
Last synced: 06 May 2026
https://github.com/mohammad95labbaf/outlier-imbalanced-fraud-detection
The Credit Card Fraud Detection project uses statistical techniques and machine learning for identifying fraudulent transactions. It includes data preprocessing, outlier detection using Boxplots and Z-scores, and a decision tree model. Evaluation goes beyond accuracy, considering precision, recall, F1-score, and ROC AUC.
boxplot classification-model credit-card credit-card-fraud credit-card-fraud-detection decision-tree decision-tree-classifier fraud-detection imbalanced-classification imbalanced-data outlier-detection outlier-removal z-score
Last synced: 10 Jun 2026
https://github.com/jianninapinto/bandersnatch
This project implements a machine learning model using Random Forest, XGBoost, and Support Vector Machines algorithms with oversampling and undersampling techniques to handle imbalanced classes for classification tasks in the context of predicting the rarity of monsters.
altair imbalanced-classification imblearn machine-learning mongodb oversampling pycharm-ide pymongo python random-forest-classifier scikit-learn smote support-vector-machines undersampling xgboost
Last synced: 29 Sep 2025
https://github.com/mehrab-kalantari/vehicle-claim-fraud-detection
Vehicle insurance claim fraud detection dataset analysis and modeling
classification data-preprocessing data-understanding imbalanced-classification imbalanced-data machine-learning supervised-learning
Last synced: 14 Apr 2026
https://github.com/ahmetzamanis/loanrequestclassification
Imbalanced classification with loan clients dataset.
classification data-science hyperparameter-optimization imbalanced-classification k-nearest-neighbours logistic-regression machine-learning mlr3 naive-bayes performance-metrics regularization support-vector-machines xgboost
Last synced: 22 Aug 2025
https://github.com/halacoded/bodyperformance_imbalanced
exploring Imbalanced classification and Techniques to Handle Imbalance.
coded imbalanced-classification imbalanced-data machine-learning
Last synced: 21 Jun 2025
https://github.com/andreazoccatelli/tabular_data_augmentation_continuous
This repository contains the scripts used to write my master degree thesis project: "Augmentation of tabular data with continuous features for binary imbalanced classification problems"
cgan copula data-augmentation imbalanced-classification imbalanced-data imbalanced-learning
Last synced: 21 Apr 2026
https://github.com/lkethridge/supervised_learning
Supervised Learning project from TripleTen
class-imbalance-handling confusion-matrix data-upload downsampling f1-score feature-prep feature-scaling fpr imbalanced-classification label-encoding one-hot-encoding ordinal-encoding pr-curve precision recall regression-metrics roc-curve supervised-learning tpr upsampling
Last synced: 28 Mar 2025
https://github.com/jiagengchang/fhr
Multi-omics integration for the classification of functional high risk patients in multiple myeloma.
dimensional-reduction imbalanced-classification multi-omics-integration scikit-learn-pipelines
Last synced: 19 Oct 2025
https://github.com/0zean/hellingerforest
A Python library built in Rust for implementing the Hellinger distance splitting criteria in a Random Forest Classifier to address imbalanced data. Work in progress.
decision-trees imbalanced-classification imbalanced-data random-forest-classifier
Last synced: 05 Jun 2026