Projects in Awesome Lists tagged with data-imputation
A curated list of projects in awesome lists tagged with data-imputation .
https://github.com/tatevkaren/mathematics-statistics-for-data-science
Mathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.
bootstrap canonical-correlation clustering data-imputation dimensionality-reduction factor-analysis-methods importance-sampling inverse-transform-method linear-regression monte-carlo-simulation pca-analysis probability-distribution python3 r regression-analysis rejection-sampling statistcal-tests
Last synced: 10 Apr 2025
https://github.com/kennethleungty/datawig-missing-data-imputation
Imputation of Missing Data in Tables
data-imputation data-science datawig deep-learning imputation machine-learning
Last synced: 12 Jul 2025
https://github.com/se-jaeger/data-imputation-paper
Research code for the paper "A Benchmark for Data Imputation Methods".
data-imputation data-quality machine-learning
Last synced: 24 Jun 2025
https://github.com/tawfikhammad/data-imputation-methods
Imputation methods aim to estimate the missing values based on the available information in the dataset.
data-cleaning data-imputation machine-learning missing-data null-safety
Last synced: 28 Feb 2025
https://github.com/ehsan-behzadi/breast-cancer-prediction-model
This project implements a machine learning model to predict breast cancer diagnosis. Utilizing techniques such as data preprocessing, feature selection, and various algorithms, the model aims to assist in early detection and improve healthcare outcomes. Explore the repository to understand the methodology and technologies used in this project.
breast-cancer-prediction breast-cancer-wisconsin data-imputation data-preprocessing dbscan-clustering feature-selection handling-missing-values iqr-method k-nearest-neighbours machine-learning mice-algorithm mutual-information outlier-detection predictive-modeling randomizedsearchcv standardization tuning-hyperparameters
Last synced: 02 Apr 2025
https://github.com/shreshthvashisht/bank-loan-case-study
Risk Analytics using Python
data-cleaning data-imputation data-normalization data-standardization exploratory-data-analysis jupyter-notebook loan-analytics loan-default-prediction matplotlib-pyplot numpy pandas python-data-analysis python-data-science risk-analysis seaborn-plots
Last synced: 25 Oct 2025
https://github.com/yuji1702/ai--powered-triage-system
This project implements a machine learning-based triage system for emergency rooms, which classifies patients based on their symptoms and vitals using a Random Forest Classifier. The system features real-time patient data integration, a user-friendly GUI built with Tkinter, and secure patient data encryption using Fernet from the cryptography lib
cryptography data-imputation data-preprocessing data-security encryption gui healthcare machine-learning matplotlib medical-data python random-forest realt-time scikit-learn seaborn tkinter triage-system
Last synced: 05 Apr 2025
https://github.com/bcebere/genentech-404-challenge
6th place entry for the Genentech – 404 Challenge
automl data-imputation imputation-methods kaggle-competition tabular-data
Last synced: 04 Mar 2025
https://github.com/ehsan-behzadi/a-machine-learning-approach-using-the-pima-indians-diabetes-dataset
This repository features a machine learning project utilizing the Pima Indians Diabetes Dataset to predict diabetes risk. It explores data preprocessing, model training, and evaluation using techniques such as Naive Bayes and K-Nearest Neighbors (KNN) . The aim is to highlight the impact of various health factors on diabetes prediction.
data-imbalance data-imputation data-preprocessing diabetes-prediction feature-selection k-nearest-neighbours leave-one-out-cross-validation machine-learning model-validation naive-bayes-classifier outlier-detection pima-indians-diabetes predictive-modeling recursive-feature-elimination standardization
Last synced: 09 Aug 2025