An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-imputation

A curated list of projects in awesome lists tagged with data-imputation .

https://github.com/tatevkaren/mathematics-statistics-for-data-science

Mathematical & Statistical topics to perform statistical analysis and tests; Linear Regression, Probability Theory, Monte Carlo Simulation, Statistical Sampling, Bootstrapping, Dimensionality reduction techniques (PCA, FA, CCA), Imputation techniques, Statistical Tests (Kolmogorov Smirnov), Robust Estimators (FastMCD) and more in Python and R.

bootstrap canonical-correlation clustering data-imputation dimensionality-reduction factor-analysis-methods importance-sampling inverse-transform-method linear-regression monte-carlo-simulation pca-analysis probability-distribution python3 r regression-analysis rejection-sampling statistcal-tests

Last synced: 10 Apr 2025

https://github.com/se-jaeger/data-imputation-paper

Research code for the paper "A Benchmark for Data Imputation Methods".

data-imputation data-quality machine-learning

Last synced: 24 Jun 2025

https://github.com/tawfikhammad/data-imputation-methods

Imputation methods aim to estimate the missing values based on the available information in the dataset.

data-cleaning data-imputation machine-learning missing-data null-safety

Last synced: 28 Feb 2025

https://github.com/ehsan-behzadi/breast-cancer-prediction-model

This project implements a machine learning model to predict breast cancer diagnosis. Utilizing techniques such as data preprocessing, feature selection, and various algorithms, the model aims to assist in early detection and improve healthcare outcomes. Explore the repository to understand the methodology and technologies used in this project.

breast-cancer-prediction breast-cancer-wisconsin data-imputation data-preprocessing dbscan-clustering feature-selection handling-missing-values iqr-method k-nearest-neighbours machine-learning mice-algorithm mutual-information outlier-detection predictive-modeling randomizedsearchcv standardization tuning-hyperparameters

Last synced: 02 Apr 2025

https://github.com/yuji1702/ai--powered-triage-system

This project implements a machine learning-based triage system for emergency rooms, which classifies patients based on their symptoms and vitals using a Random Forest Classifier. The system features real-time patient data integration, a user-friendly GUI built with Tkinter, and secure patient data encryption using Fernet from the cryptography lib

cryptography data-imputation data-preprocessing data-security encryption gui healthcare machine-learning matplotlib medical-data python random-forest realt-time scikit-learn seaborn tkinter triage-system

Last synced: 05 Apr 2025

https://github.com/bcebere/genentech-404-challenge

6th place entry for the Genentech – 404 Challenge

automl data-imputation imputation-methods kaggle-competition tabular-data

Last synced: 04 Mar 2025

https://github.com/ehsan-behzadi/a-machine-learning-approach-using-the-pima-indians-diabetes-dataset

This repository features a machine learning project utilizing the Pima Indians Diabetes Dataset to predict diabetes risk. It explores data preprocessing, model training, and evaluation using techniques such as Naive Bayes and K-Nearest Neighbors (KNN) . The aim is to highlight the impact of various health factors on diabetes prediction.

data-imbalance data-imputation data-preprocessing diabetes-prediction feature-selection k-nearest-neighbours leave-one-out-cross-validation machine-learning model-validation naive-bayes-classifier outlier-detection pima-indians-diabetes predictive-modeling recursive-feature-elimination standardization

Last synced: 09 Aug 2025