An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/souravxbera/credit-card-approval-predictor

End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture

credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit

Last synced: 15 May 2026

https://github.com/ssreeramj/hackerearth-exhibit-art

This is my solution to the HackerEarth Exhibit Art Machine Learning Challenge

gradient-boosting machine-learning pandas regression scikit-learn

Last synced: 20 Apr 2026

https://github.com/vishal-verma-96/Pre-Owned-Car-Price-prediction-using-Streamlit-App

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 02 Mar 2025

https://github.com/jtorrescarbajal/ev-charger-accessibility-analysis

Capstone project analyzing EV charger sufficiency and equity across ZIP codes on the U.S. West Coast using Python and public datasets.

jupyter-notebook python scikit-learn wilcoxon-signed-rank

Last synced: 18 Apr 2026

https://github.com/imnotannamaria/ia-ml-scikit-learn

Learning tradicional machine learning with scikit-learn.

numpy pandas python scikit-learn seaborn

Last synced: 06 Apr 2026

https://github.com/leticiamilan/formacao-python-developer-dio

Formação Python Developer - Digital Innovation One

django flask pandas pymongo python scikit-learn sqlalchemy sqlite

Last synced: 06 Apr 2026

https://github.com/mathealgou/ml-jobs

This project is a machine learning exercise, the application receives a set of skills from the user and returns a job title that matches the skills entered. It uses the Random Forest algorithm to make the prediction base on a jobs dataset.

machine-learning python random-forest-classifier scikit-learn

Last synced: 23 Jun 2026

https://github.com/hariprasath-v/zindi_umojahack-india-income-prediction-challenge

Create a machine learning model to predict whether an individual earns above 50,000 in a specific currency or not.

catboost-classifier classification exploratory-data-analysis f1-score matplotlib numpy pandas python scikit-learn seaborn shap zindi

Last synced: 13 Apr 2026

https://github.com/hariprasath-v/hackerearth_get-a-room-ml-hackathon

Build a Machine Learning model to identify the habitability score of the property based on the property's basic information and location-based information.

exploratory-data-analysis machine-learning numpy pandas pycaret python randomforestregressor regression scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/lefteris-souflas/ai-fashion

"DressMeUp" project utilizes fashion images and color combinations to achieve image classification for clothing combinations. Algorithms include SGD (SVM), Passive Aggressive Classifier, ResNet50 CNN, and EfficientNetV2-S CNN with K-Means for color analysis. Achieved accuracy exceeds 90%. Built with Python, Scikit-Learn, TensorFlow, and Streamlit.

business-idea cnn-classification efficientnetv2 image-classification k-means-clustering machine-learning outfit-matching outfit-planner passive-aggressive-classifier python3 resnet-50 scikit-learn sgd-classifier sqlite3 streamlit tensorflow2

Last synced: 02 May 2026

https://github.com/sauravokos/deep-learning

Repository on private project concerning Deep Learning through multilevel neural networks

keras numpy pillow python python-3-5 scikit-image scikit-learn scipy tensorflow tensorflow-examples tkinter tkinter-graphic-interface

Last synced: 02 Apr 2026

https://github.com/leftcoastnerdgirl/deep_learning

This project introduces neural networks, deep learning, and Tensorflow.

deep-learning jupyter-notebook neural-networks pandas-python scikit-learn tensorflow

Last synced: 04 May 2026

https://github.com/ramanpreet6262/ml_internity

Repository of code implimentations of various machine learning algorithms in my Internity Internship.

matplotlib numpy pandas python scikit-learn

Last synced: 06 Apr 2026

https://github.com/hrolive/deep-learning-nanodegree

As one of the top 3% students from the first phase, "PyTorch Scholarship Challenge" by Facebook AI, I have earned a full scholarship to Udacity’s Deep Learning Nanodegree program

api-gateway aws aws-lambda aws-sagemaker computer-vision convolutional-neural-networks deep-learning deployment machine-learning natural-language-processing numpy pandas python pytorch scikit-learn

Last synced: 12 Apr 2026

https://github.com/adamouization/superconductors-critical-temperature-predictions-regression

:chart_with_upwards_trend: Machine Learning & Data Visualisation/Processing techniques for predicting the critical temperatures required for different superconductors to conduct electrical current with no resistance.

jupyter jupyter-lab jupyter-notebook machine-learning matplotlib matplotlib-pyplot numpy pandas prediction python python-3 regression regression-algorithms scikit-learn scikitlearn-machine-learning scipy

Last synced: 12 Apr 2026

https://github.com/barrettotte/anilist-ml

Training a binary classifier model to predict if I would recommend an anime using my Anilist user data.

anilist binary-classification data-visualization machine-learning scikit-learn

Last synced: 15 May 2026

https://github.com/j-nobull/noble_ana-500

Apply Advbanced Python programming to data science problems

dash machine-learning-algorithms neural-network numpy pandas plotly scikit-learn tensorflow

Last synced: 03 Mar 2025

https://github.com/alan-oliveir/cnn_digitos_manuscritos

Trabalho final para disciplina Redes Neurais(2023/1) - Rede Neural Convolucional para identificação de dígitos manuscritos.

convolutional-neural-networks jupyter-notebook scikit-learn

Last synced: 08 May 2026

https://github.com/joekakone/prostate-cancer-prediction

Predict Prostate Cancer Diagnosis using ML

machine-learning scikit-learn

Last synced: 15 May 2026

https://github.com/andriamahay11master/machinelearning

This project is created to practice supervised ML using the libraries existing in Python

linear-regression multiple-linear-regression python scikit-learn scikitlearn-machine-learning

Last synced: 15 May 2026

https://github.com/aml-hassan-abd-el-hamid/divorce-analysis-and-prediction

Exploratory data analysis and modelling for a dataset that contains questions that couples who got divorced and stayed together answered using Python, pandas, pandas profiling and Scikit learn.

machine-learning pandas-profiling scikit-learn

Last synced: 03 Apr 2025

https://github.com/nickklos10/Compressive-Strenght-Prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 12 Mar 2025

https://github.com/mindlessmuse666/features-scaling

Проект по масштабированию признаков датасета Iris с использованием Python, Pandas, Scikit-learn, Seaborn и Plotly. Включает визуализацию данных, применение различных методов масштабирования и оценку производительности модели логистической регрессии.

data-scaling data-visualization feature-engineering iris-dataset machine-learning pandas plotly python scikit-learn seaborn student-project

Last synced: 16 Jun 2025

https://github.com/mindlessmuse666/iris-knn

Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.

algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn

Last synced: 17 Aug 2025

https://github.com/boomerspine/selflearning_chatbot

Self learning chatbot using python

python scikit-learn

Last synced: 10 May 2026

https://github.com/henriquepmartins/ml-number-prediction

Number prediction using Logistic Regression

logistic-regression machine-learning scikit-learn

Last synced: 15 May 2026

https://github.com/bilalm04/email-spam-classifier

A machine learning project that classifies emails as spam or not spam using Logistic Regression, with a deployable Flask API for real-time classification.

api flask jupyter-notebook machine-learning matplotlib nlp numpy pandas python scikit-learn

Last synced: 06 Mar 2026

https://github.com/vishal-038/healthcare

The AI Healthcare System is a web-based application that integrates machine learning with Django to assist users in disease prediction, booking appointments with doctors, purchasing medicines, and managing lab tests.

django django-rest-framework pandas scikit-learn sqllite

Last synced: 05 May 2026

https://github.com/mohd-faizy/preprocess_ml

This repository hosts Python code that utilizes the Scikit-learn preprocessing API for data preprocessing. The code presents a comprehensive range of tools that handle missing data, scale data, encode categorical variables, and perform other functions.

data-science feature-engineering feature-engineering-algorithm feature-extraction feature-selection machine-learning outlier-detection preprocessing-data preprocessor scikit-learn

Last synced: 16 May 2026

https://github.com/rririanto/thesis-projects

The computer science thesis project that I worked on when I was a student and was looking for a part time job

bag machine-learning python2 python27 scikit-learn surf

Last synced: 02 Feb 2026

https://github.com/khanovico/energy-data-analysis

This is the cloud model analyzing real world dataset with BigQuery and other big-data analyzing tools. I implemented docker image for running this app on cross-platform environments.

big-data-processing bigquery docker google-app-engine jupyter-notebook mlflow python scikit-learn seaborn xgboost

Last synced: 17 Feb 2026

https://github.com/raghavendranhp/industrial_copper_modelling

Industrial Copper Modeling optimizes pricing decisions using advanced ML. Predict sales with accuracy, classify leads, and streamline decision-making.

classification-models copper decision-tree-classifier decision-tree-regression pickle-file predictive-modeling regression-models scikit-learn

Last synced: 16 May 2026

https://github.com/daniel-furman/RecFeatureSelect

Feature selection functions (1) using the multi-collinearity matrix and recursively proceeding to a spearman threshold and (2) using Forward Stepwise Selection running on an ensemble sklearner (with options for HPO).

correlation-threshold machine-learning modeling multicollinearity recursion recursive-algorithm scikit-learn spearman-rho

Last synced: 09 Jul 2025

https://github.com/swat1563/recommendation-system

This repository features a recommendation system and analytics engine using datasets on users, organizations, contents, contacts, events, and recommendations. It includes data preprocessing, building a recommendation system, and creating visual reports with Power BI.

analytics data-analysis data-visualization engine kaggle numpy pandas powerbi powerbi-dashboards powerbi-desktop powerbi-reports python recommendation-engine recommendation-system recommender-systems scikit-learn scipy

Last synced: 07 Jan 2026

https://github.com/jbizzlefoshizzle/linear-and-ridge-regression

The purpose of this project was to analyze and predict housing prices using attributes or features such as square footage, number of bedrooms, number of floors, and so on.

linear-regression machine-learning machine-learning-algorithms regression-analysis regression-models ridge-regression scikit-learn scikitlearn-machine-learning train-test-split train-test-using-sklearn

Last synced: 16 May 2026

https://github.com/chris-santiago/tsfeast

A collection of Scikit-Learn compatible time series transformers and tools.

data-science feature-engineering python scikit-learn time-series timeseries-features transformers

Last synced: 01 May 2026

https://github.com/senaayy/adhd-network-efficiency

🧠 End-to-end fMRI analysis pipeline comparing ADHD brain topology vs. Healthy Controls using Graph Theory (Global Efficiency & Clustering). Built with Nilearn, NetworkX, and Docker for reproducible neuroscience.

adhd bioinformatics brain-networks computational-neuroscience data-science docker fmri graph-theory network-analysis networkx neuroscience nilearn python scikit-learn

Last synced: 17 Jun 2026

https://github.com/ranimeshehata/softmax-regression-on-mnist

A PyTorch-based project for classifying the MNIST dataset using Softmax Regression, including training, validation, results and visualization.

matplotlib mnist python3 pytorch scikit-learn softmax-regression torchvision

Last synced: 15 Apr 2026

https://github.com/usmana5809/quran-recitation-audio-classification

Quran Recitation Audio Classification project aims to classify different recitations of the Quran using machine learning techniques. It involves preprocessing audio data, extracting features, training models, and evaluating their performance

audio-classification classification-model islamic-studies librosa machine-learning python quran scikit-learn

Last synced: 20 Mar 2025

https://github.com/umasivakumar14/real_estate_ml_model

Predicts the price of a home in Bengaluru, Karnataka based on location, urbanization, total square feet, bedrooms, bathrooms, and balconies.

aws flask gridsearchcv http-requests machine-learning machine-learning-algorithms nginx pandas python scikit-learn

Last synced: 02 Feb 2026

https://github.com/celineboutinon/faux-billets

OpenClassrooms Data Analyst 2022-2023 - Projet 10

machine-learning python scikit-learn statsmodels

Last synced: 16 May 2026

https://github.com/crispengari/python-sklearn

💎 Introduction to machine learning with scikit-learn in python. A quick walk through the sklearn library for machine learning and understanding different machine learning algorithims.

ai artificial-intelligence classification clustering datascience jupyter-notebook machine-learning ml-python nlp python regression scikit-learn

Last synced: 13 May 2026

https://github.com/alphacrypto246/grape-quality-prediction

The Grape Quality Prediction project uses machine learning to predict the quality of grapes based on chemical properties like acidity, sugar content, and alcohol levels. It applies regression models to forecast the quality score, helping in wine production and quality assessment.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 19 Apr 2026

https://github.com/lilivalgo/machine-learning-projects

This repository hosts the machine learning project developed during my learning journey. It showcases my progress and the skills acquired in the field of machine learning

lag-feature linear-regression ml-models scikit-learn scipy-stats seaborn-plots

Last synced: 28 Mar 2025

https://github.com/gfyoung/tree-decode

Package for removing the black-box around decision trees

blackbox decision-tree machine-learning python scikit-learn

Last synced: 20 Jan 2026

https://github.com/striderzz/ml-heart-disease-classification

Machine Learning - Heart Disease Classification Project using Sci-Kit Learn

classification-machine-learning machine-learning machine-learning-projects scikit-learn

Last synced: 16 May 2026

https://github.com/yuweaec/wine_quality_prediction

The Wine Quality Prediction project aims to predict the quality of wine based on its chemical properties using machine learning algorithms.

flask jupyter-notebook machine-learning python scikit-learn

Last synced: 11 Apr 2025

https://github.com/oneapi-src/predictive-asset-health-analytics

AI Starter Kit for Predictive Asset Maintenance using Intel® optimized version of XGBoost

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/akimuddinshaikh/domain-application-of-predictive-analysis

Data-Driven House Price Prediction "Predicting house prices using Machine Learning techniques

feature-engineering pca python random-forest scikit-learn

Last synced: 09 May 2026

https://github.com/prashantsaini1525/heart-disease-predicition

Project 1 : Heart Disease Prediction A machine learning project that uses logistic regression to predict the likelihood of heart disease based on clinical data. This repository includes data preprocessing, model training, evaluation, and an interactive prediction system.

cardiology classification data-science healthcare heart-disease logistic-regression machine-learning predictive-modeling python scikit-learn

Last synced: 04 Jul 2025

https://github.com/cs50victor/cspaint

handwritten-text recognition application

ai machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/themihirmathur/machine-failure-predictor

The 'Machine Failure Predictor' is a web application designed to enable predictive maintenance for industrial equipment. Leveraging machine learning techniques, this application predicts whether equipment will fail and identifies the type of failure based on key process parameters.

cicd digitalocean docker fastapi flask github github-actions machine-learning matplotlib mlops numpy plotly python randomforest scikit-learn seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/vipinchaudhary31122002/bytephone

A data science project that explores, visualizes, and models smartphone features from a futuristic 2025 dataset to predict mobile prices using machine learning.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/adiseshu-sdet/ai-model-testing

This project demonstrates a complete workflow for developing, testing, and deploying an AI model. The primary objective is to build and test a machine learning model, expose it via a REST API, and automate the entire process using CI/CD pipelines. Below is a detailed summary of the work done.

dockerization flask-rest-api logistic-regression machine-learning python3 scikit-learn

Last synced: 16 May 2026

https://github.com/shreeparab1890/laptop-price-predictor

This notebook is trying to build a model which will predict the price of the laptop based on the given specifications.

machine-learning-algorithms matplotlib numpy pandas price-prediction python scikit-learn sklearn

Last synced: 06 Apr 2026

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 12 Apr 2025

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 20 Feb 2026

https://github.com/rishavp15/aivshuman_text

In this project make user to decide that the text which is entered in text box is a human generated or a computer generated text.

django pandas python scikit-learn

Last synced: 11 Feb 2026

https://github.com/ntanwir10/ai-anamoly-detector

A highly efficient, real-time anomaly detection system for distributed microservices architecture that leverages Redis's advanced probabilistic data structures (RedisBloom) for memory-efficient data collection and RedisGears for in-database processing. An AI model analyzes data patterns to identify and predict system failures before they cascade.

client-sdks docker docker-compose expressjs flask kubertnetes microservices nodejs python reactjs redis-bloomfilter redis-gears redis-pubsub redis-stack redis-streams scikit-learn

Last synced: 06 Apr 2026

https://github.com/akhileshmanitiwari06/gesture-to-command-system

AI that recognizes your gestures via webcam and maps them to system commands (play, pause, next, scroll, etc.)

pyautogui-automation python scikit-learn tkinter-gui

Last synced: 16 May 2026

https://github.com/davidyslu/pokemonrecognition

Recognize Pokemon's image using scikit-learn in Python

knn-model python scikit-learn svm-model

Last synced: 02 May 2026

https://github.com/sckonung/crab-age-regression

ML model for regression with a crab age dataset Competition in Kaggle

keras machine-learning pandas python scikit-learn tensorflow

Last synced: 17 Feb 2026

https://github.com/lakshitalearning/spamfortress

A machine learning-based project to detect SMS spam messages with high accuracy, using the SMS Spam Collection Dataset and techniques like supervised learning, text preprocessing, and model comparison.

data-science google-colab machine-learning nlp scikit-learn sms-spam-detection

Last synced: 28 Apr 2026

https://github.com/kosmoswerner/scikitlearn.net

C# bindings for Scikit-Learn (WIP - Refactor)

machine-learning scikit scikit-learn sklearn

Last synced: 20 Jun 2025

https://github.com/aysh2603/credit-card-fraud-detection

The Credit Card Fraud Detection project aims to identify fraudulent transactions from a dataset of credit card transactions. The project addresses the challenge of class imbalance and employs advanced machine learning techniques to build an effective fraud detection model.

ensemble-learning hyperparameter-tuning numpy pandas python3 scikit-learn streamlit

Last synced: 04 Feb 2026

https://github.com/lorenzorottigni/ml-interview

Machine Learning python bootcamp: K Nearest Neighbors job interview simulation dataset

ipynb k-nearest-neighbours machine-learning numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/florianwoelki/stock_price_prediction

This is a simple jupyter notebook for stock price prediction. As a model I've used the linear, ridge and lasso model.

lasso-regression linear-regression python ridge-regression scikit-learn stock-price-prediction

Last synced: 07 Apr 2026

https://github.com/alpkanoz/ibm_data_science_professional_certificate

The repository contains projects and training materials carried out throughout the IBM data science professional course.

classification clustering data-analysis data-science data-visualization dataframe ibm ibm-watson machine-learning mathplotlib pandas predictive-modeling python scikit-learn

Last synced: 07 Mar 2026

https://github.com/koradapavani/customer-churn-ml-project

Machine learning project to predict customer churn in telecom

churn-prediction machine-learning python scikit-learn telecom

Last synced: 04 May 2026

https://github.com/sabin74/spam_mail_detection

A machine learning project to classify SMS messages as Spam or Ham (Not Spam) using Natural Language Processing (NLP) techniques and Scikit-learn. This binary classification task uses the UCI SMS Spam Collection Dataset and implements various models including Naive Bayes, SVM, and Logistic Regression with performance tuning.

gridsearchcv nltk python scikit-learn smote sms-spam-detection uci-machine-learning

Last synced: 04 May 2026

https://github.com/manshreet27/mrs

This Movie Recommendation System is a web-based application built using Python and Streamlit, designed to provide movie recommendations based on user preferences. It utilizes TMDb API for fetching real-time movie details and Kaggle's TMDB 5000 Movies dataset for content-based filtering.

numpy pandas python scikit-learn streamlit tmdb-5000-movies-dataset-from-kaggle tmdb-api-for-fetching-real-time-movie-data

Last synced: 07 Apr 2026

https://github.com/muhdhammad/machine-learning

Crafted for hands-on learning and implementation of ML with scikit-learn

data-science jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/fikri-rouzan/energy-consumption-prediction

Final Project for the AI/ML Weekly Class by Google Developer Group on Campus (GDGoC) UIN Jakarta.

jupyter-notebook matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 07 Apr 2026

https://github.com/akapich/clustermatic

Python AutoML library for clustering tasks

automl clustering machine-learning scikit-learn

Last synced: 11 Feb 2026

https://github.com/dmarks84/coursework_project_ml-classifier-eval-selection

Project for University of Michigan Applied Data Science Specialization -- Predicted viewer engagement based on features related to video metrics; evaluated a large set of classifiers under different scoring metrics to select the "optimal" one.

classification cross-validation data-modeling data-reporting data-visualization databases dataframes eda grid-search matplotlib numpy pandas python scikit-learn statistics supervised-ml

Last synced: 02 Apr 2026

https://github.com/abdiasarsene/developpement_tableau_de_bord_de_la_chaine_approvisionnement_power_bi

Développer une solution complète pour visualiser, analyser et prédire des données de la chaîne d'approvisionnement.

ci-cd docker fastapi github-actions mysql-database randomizedsearchcv scikit-learn seaborn-plots

Last synced: 23 Jun 2025

https://github.com/aneeshmurali-n/ann-diabetes-prediction

Predicting diabetes progression using an Artificial Neural Network (ANN). This project leverages the scikit-learn diabetes dataset for training and evaluation. Includes data preprocessing, model building, and performance visualization.

ann data-preprocessing data-visualization deep-learning diabetes-prediction exploratory-data-analysis keras machine-learning matplotlib neural-network numpy pandas regression scikit-learn seaborn tensorflow visualization

Last synced: 07 Apr 2026

https://github.com/pramodyasahan/learn-ml

This repository serves as both a personal learning diary and a resource for others interested in understanding and applying machine learning concepts. The projects are categorized based on the type of ML model and are implemented in Python using libraries like scikit-learn, pandas, and numpy.

classification clustering machine-learning matplotlib numpy pandas regression scikit-learn supervised-learning unsupervised-learning

Last synced: 07 Apr 2026

https://github.com/praatibhsurana/breast-cancer-prediction-svm

A SVM classifier coded in Python using Scikit-Learn to classify whether a patient's tumor is malignant or benign.

kaggle-dataset linear-classifier machine-learning-algorithms python scikit-learn svm-classifier

Last synced: 16 May 2026

https://github.com/anthippi/naive-bayes-imdb-classification

A custom Naive Bayes classifier for sentiment analysis of movie reviews from the IMDb dataset, utilizing feature selection based on Information Gain and comparing its performance with scikit-learn's BernoulliNB.

classification imdb matplotlib naive-bayes-classifier numpy pandas scikit-learn sklearn

Last synced: 09 Apr 2026