An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with imblearn

A curated list of projects in awesome lists tagged with imblearn .

https://github.com/sorna-fast/fraud-detection

Predicting transaction fraud using classification problems such as Guardian Boosting as well as user interfaces using Streamlite, Accuracy: 98% AUC-ROC

adaboostclassifier eda gradientboostingclassifier imblearn lgbmclassifier matplotlib-pyplot numpy pandas-dataframe pickle-file plotly-express randomforestclassifier scipy-stats seaborn-plots sklearn-library streamlit-webapp xgbclassifier

Last synced: 28 Apr 2026

https://github.com/mahnoorsheikh16/Credit-Card-Default-Prediction

This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.

chi-square-test encoding hiplot imblearn json knn-imputer matplotlib numpy pandas pca-analysis pillow plotly robust-scalar scipy seaborn sklearn smote streamlit ttest visualization

Last synced: 01 Mar 2025

https://github.com/sayamalt/fraudulent-transactions-prediction

Successfully trained a machine learning model which can predict whether a given transaction is fraud or not.

data-visualization exploratory-data-analysis imblearn machine-learning model-based-testing model-building predictive-analytics sklearn

Last synced: 29 Apr 2026

https://github.com/mahnoorsheikh16/credit-card-default-prediction

This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.

encoding hiplot imblearn json knn-imputer logistic-regression matplotlib numpy pandas pca-analysis plotly scipy seaborn sklearn smote streamlit support-vector-machines timeseries-forecasting visualization xgboost-classifier

Last synced: 06 Apr 2026

https://github.com/alessandrosocc/machine-learning-project-2022

Final project for the Machine Learning course at the University of Cagliari in 2022. Analysis of a dataset, use of Machine Learning techniques with Oversampling and Undersampling techniques. Final report with the results obtained.

imblearn machine-learning matplotlib-pyplot oversampling pandas scikit-learn spambase-dataset undersampling

Last synced: 18 Jan 2026

https://github.com/Fedesgh/Asteorid_RandomForest_Classifier

Classifier model trained with unbalanced dataset ready for deployment

imblearn pandas pickle seaborn sklearn

Last synced: 05 Oct 2025

https://github.com/ricardorobledo/malicious_server_hack_detection

Predictive model to detect malicious hacking patterns in banking servers. Utilizes advanced Machine Learning techniques such as SMOTE, Gradient Boosting, and probability calibration to predict attacks befor

anaconda cibersecurity imbalanced-data imbalanced-learning imblearn kaggle matplotlib numpy pandas pandas-library python3 sklearn

Last synced: 14 Apr 2026

https://github.com/fedesgh/asteorid_randomforest_classifier

Classifier model trained with unbalanced dataset ready for deployment

imblearn pandas pickle seaborn sklearn

Last synced: 15 Feb 2026

https://github.com/fedesgh/building_credit_risk_classifier_using_bagging_kneighbors

Problem statment about modeling target vector and attempt to improve metrics

feature-selection imblearn information-value sklearn

Last synced: 12 Feb 2026

https://github.com/manjit-baishya-datascience/spam-email-detection

This project demonstrates how to build a spam detection system using Natural Language Processing (NLP) and machine learning techniques.

imblearn nlp nlp-machine-learning nltk scikit-learn spam-detection

Last synced: 12 Feb 2026

https://github.com/rajivaleaakash/customer-churn-prediction

A machine learning project focused on predicting customer churn using various data analysis and modeling techniques. The repository includes data preprocessing, feature engineering, exploratory data analysis (EDA), model training, evaluation, and visualization to help businesses identify customers at risk of leaving.

churn-prediction classification customer-churn data-analysis data-science gridsearchcv imblearn machine-learning numpy pandas pyhton randomsearchcv scikit-learn

Last synced: 28 Apr 2026

https://github.com/egorumaev/2023-telekom-customers-churn

Прогнозирование оттока клиентов оператора связи

catboost classification imblearn lightgbm pandas phik pipeline python3 sklearn xgboost

Last synced: 30 Apr 2026

https://github.com/viniciusds2020/ml_balaceamento_allknn

Este repositório contém um código de Machine Learning que utiliza o algoritmo AllKNN do pacote imblearn para realizar o balanceamento de dados.

allknn imbalanced-data imblearn machine-learning sklearn

Last synced: 01 May 2026

https://github.com/egorumaev/2023-ods-turnstiles

Идентификация посетителя в зависимости от характерного времени его прохода на территорию организации

catboost featureengineering imblearn multiclass-classification pandas pipeline python3 sklearn

Last synced: 06 May 2026

https://github.com/christabelsakyi/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance. Dataset

imblearn machine-learning nltk numpy python sklearn

Last synced: 07 May 2026

https://github.com/jianninapinto/bandersnatch

This project implements a machine learning model using Random Forest, XGBoost, and Support Vector Machines algorithms with oversampling and undersampling techniques to handle imbalanced classes for classification tasks in the context of predicting the rarity of monsters.

altair imbalanced-classification imblearn machine-learning mongodb oversampling pycharm-ide pymongo python random-forest-classifier scikit-learn smote support-vector-machines undersampling xgboost

Last synced: 29 Sep 2025

https://github.com/paulomppatricio/projeto_challenge_telecomx-br_parte-2

Projeto Challenge TelecomX-BR_Parte-2 - Formação Data Science do programa ONE - Oracle Next Education em parceria com a Alura.

data-science imblearn joblib machine-learning matplotlib modelos-preditivos numpy pandas python scipy seaborn sklearn statsmodels xgboost

Last synced: 12 Apr 2026

https://github.com/Fedesgh/Building_Credit_Risk_Classifier_Using_Bagging_Kneighbors

Problem statment about modeling target vector and attempt to improve metrics

feature-selection imblearn information-value sklearn

Last synced: 05 Oct 2025

https://github.com/egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

catboost imblearn iqr lda matplotlib numpy pandas pca phik pipeline sklearn t-sne xgboost

Last synced: 08 May 2026

https://github.com/antarmukhopadhyaya/fraud-warden

Fraudulent Credit Transaction detection system using SMOTE, Random Forest Classifier and Streamlit

data-science imblearn machine-learning pandas python sklearn streamlit

Last synced: 05 Jan 2026