An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/richardbmk/datascience_machinelearning

projects related with data science and machine learning projects.

data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/agnivchtj/ann

Develop an Artificial Neural Network that can classify inputs based on a number of features

backpropagation-algorithm jupyter-notebooks python scikit-learn

Last synced: 07 May 2026

https://github.com/hariprasath-v/av-dataverse-hack---insurance-claim-prediction

Create a machine learning model to predict if the policyholder will file a claim in the next 6 months or not based on the set of car and policy features.

analyticsvidhya classification exploratory-data-analysis f1-score matplotlib numpy pandas python randomforest-classification scikit-learn seaborn shap

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/dl-lending-club

Deep Learning python bootcamp: deep learning on Lending Club dataset

deep-learning ipynb keras machine-learning numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/pb319/california_house-price-prediction

This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".

evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split

Last synced: 11 Apr 2026

https://github.com/atharvapathak/size_estimator_project

This project in Python aims to provide a tool for estimating the size of objects in images or videos. Using computer vision techniques, the project analyzes the input media, detects objects of interest, and provides an estimation of their size based on known reference points or objects.

cicd cnn opencv python pytorch rnn scikit-learn sql tensorflow

Last synced: 11 Apr 2026

https://github.com/nicolas-giacomelli/modelo-previsao-colesterol-com-gradio

Modelo de predição do colesterol com base em informações Modelo conta com pipelines para tratamento de dados Disponibilizado via Gradio

gradio machine-learning matplotlib pandas pingouin python3 saude scikit-learn scipy seaborn

Last synced: 11 Apr 2026

https://github.com/devinw03/movie-genre-nlp

🎬 Classify movie genres from plot summaries using various models, including Transformers, with clear EDA and MLflow tracking for reproducible results.

cosine-similarity countvectorizer datascience distilroberta gru huggingface imdb machine-learning multi-label-classification nlp python pytorch recommendation-engine scikit-learn slack tabulate text-classification word2vec

Last synced: 11 Apr 2026

https://github.com/gayathri2200/car-price-prediction---machine-learning

Car price prediction Machine Learning --- Which is used to predict the price of used cars based on the features.

data-science machine-learning modeldeployment pandas price-prediction python regression scikit-learn streamlit visual-studio visualization

Last synced: 11 Apr 2026

https://github.com/ejw-data/ml-classification-exoplanet

Classification of planets identified by Kepler telescope using multiple models and tuning using GridSearchCV

classification python scikit-learn

Last synced: 09 May 2026

https://github.com/ejw-data/ml-clustering-personality

Analaysis of the big-5 personality test survey results with clustering techniques.

clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 04 May 2026

https://github.com/ejw-data/ml-classification-grants

Compares several machine learning classification models including a neural network to determine whether to approve or reject a grant applicant

classification neural-network python scikit-learn

Last synced: 10 May 2026

https://github.com/kaguya163/marketing_campaigns

Анализ маркетинговой эффективности в спортивном ритейле.

ab-testing machine-learning matplotlib numpy pandas python scikit-learn scipy sqlite3

Last synced: 11 Apr 2026

https://github.com/nauxqouh/python-for-data-science-labs

This repo contains weekly practical codes for my Python for Data Science course at university.

data-science jupyter-notebook numpy pandas python pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/dyarleniber/hands-on-machine-learning

This repository contains code examples, exercises, and projects related to the concepts covered in the book "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron.

ai artificial-intelligence keras machine-learning matpolotlib numpy pandas scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/stella4444/linear-regression

learning about linear regression (currently a work in progress) ~ working with data

linear-regression machine-learning numpy scikit-learn

Last synced: 20 Jan 2026

https://github.com/abdiasarsene/healthpredict-api-smart-medical-diagnosis-system

Ce projet propose une API intelligente construite avec FastAPI pour prédire des maladies à partir de données médicales de patients. L'application repose sur un modèle de machine learning (Logistic Regression) géré via MLflow, et peut facilement être déployée grâce à Docker.

bentoml docker-compose dockerfiles jenkinsfiles mlflow pandas ray-serve scikit-learn taskfile

Last synced: 11 Apr 2026

https://github.com/vickshan001/tweet-sentiment-classifier-nlp-svm-project

NLP coursework project using SVM to classify tweet sentiments. Features custom preprocessing, error analysis, and cross-validation.

natural-language-processing nlp python scikit-learn sentiment-analysis svm text-classification tweets

Last synced: 31 Mar 2025

https://github.com/adi3042/sensor-fault-detection

🔍⚙️ Ensure Reliable Operations! Detect anomalies and prevent disruptions with our Sensor Fault Detection system. Explore advanced classification and regression techniques to identify and address sensor faults effectively. Your path to robust and accurate sensor data begins here! 🚨🔧 SensorFaultTech

classification css datetime fault-detection flask functools html ipykernel jupternotebook machine-learning numpy pandas python3 readme regression scikit-learn sensor setuptools venv

Last synced: 11 Apr 2026

https://github.com/lorenzorottigni/ml-spam-filter

Machine Learning python bootcamp: natural language processing with spam filter purposes

machine-learning npl numpy pandas python scikit-learn seaborn spam-filtering

Last synced: 11 Apr 2026

https://github.com/csengupta1101/housing-price-prediction

Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.

jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode

Last synced: 11 Apr 2026

https://github.com/cmdecker95/learning-ml

My first foray into machine learning, comprised of my early undergrad work (circa 2021-2022) 🤓

jupyter-notebook keras machine-learning scikit-learn tensorflow2

Last synced: 08 Apr 2025

https://github.com/prakashjha1/sensor-based-human-intervention-detection

Objective: Develop an on-device machine-learning model for Android that classifies sensor sequences into four scenarios; walking, handheld still, on-desk with interaction, and on-desk idle using sensor data.

classification feature-engineering feature-extraction lightgbm machine-learning-algorithms matplotlib neural-networks pandas python3 scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/frauvate/cheatsheet

This project provides ready-to-use templates for machine learning base models. This repo, which contains templates of common models in Jupyter notebook format, aims to help users quickly implement these models and easily adapt them to their own projects.

machine-learning machinelearning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/karimosman89/customer-churn-prediction

Predict which customers are likely to stop using a service.Build a model to predict which customers are likely to stop using a service or product. This can help companies take proactive measures to retain customers.Provide actionable insights and retention strategies based on the model’s predictions.

matplotlib-pyplot pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/saurabhjalendra/hrf-retinal-edge-retrieval

🔬 Content-Based Image Retrieval system for retinal fundus images using HRF dataset. Implements handcrafted feature engineering (HOG, LBP, Edge Detection, Gabor, GLCM) with ML models for medical image classification and analysis.

computer-vision content-based-image-retrieval edge-detection feature-engineering gabor-filters hog-features jupyter-notebook machine-learning medical-imaging opencv retinal-analysis scikit-image scikit-learn

Last synced: 12 May 2026

https://github.com/aadrianleo/book-recommendation-system

Book Recommender System using the Book-Crossing dataset. Compares content-based (TF-IDF + cosine similarity) and collaborative filtering (SVD) methods for book recommendations. Includes data cleaning, EDA, and model evaluation (Precision@5, RMSE) in Python.

artificial-intelligence collaborative-filtering content-based-filtering cosine-similarity data-preprocessing jupyter jupyter-notebook machine-learning matrix-factorization numpy pandas predictive-modeling python3 recommendation-system recommender-system scikit-learn sckit-surprise

Last synced: 10 Apr 2026

https://github.com/nickklos10/league-of-legends-match-predictor

Predicting League of Legends Match Outcomes Using Logistic Regression and PyTorch.

jupyter-notebook matplotlib numpy pandas pytorch scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/yahiazakaria445/sentiment-analysis-using-lstm

A Sentiment analysis NLP project using LSTM model

matplotlib nltk numpy pandas scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 20 Jan 2026

https://github.com/dmarks84/coursework_project_ml-classification

Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics

classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml

Last synced: 11 Apr 2026

https://github.com/ojhaharsh/rockvsmine-prediction

Streamlit App for Predicting Rock vs. Mine using SONAR Data. The app allows users to input SONAR readings and returns predictions based on trained data.

machine-learning python scikit-learn

Last synced: 03 Jan 2026

https://github.com/sdpdas/sm_sentiment_analysis

Using Natural Language Processing (NLP) and pandas, numpy, scikit-learn for classification and applying logistic regression as it is a supervised model, lastly NLTK. Pickle library used for saving and running the model anywhere.

logistic-regression machine-learning nlp scikit-learn sentiment-analysis stemming vectorizer

Last synced: 03 Jan 2026

https://github.com/emms21/thoughtful_ai

Technical Screening: an agent that responds to semantically similar questions with pre-defined answers, falling back to generic LLM responses otherwise

openai python scikit-learn streamlit

Last synced: 02 Jan 2026

https://github.com/tszon/data-science-projects

Included are all the worth-noting Data Science projects in my learning journey with DataCamp.

data-analysis data-science exploratory-data-analysis feature-engineering machine-learning modelling preprocessing-data scikit-learn supervised-learning

Last synced: 15 Mar 2025

https://github.com/thbaylson/datascience

All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.

data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn

Last synced: 09 May 2026

https://github.com/shahzaibdev355/house-recommendation-backend

House Recommendation Backend is a Flask-based API that provides house recommendations based on user input. It includes endpoints to check the backend's functionality, fetch state data, and get personalized house recommendations based on criteria like price, property size, number of beds, number of baths, and state.

flask flask-api flask-cors numpy pandas scikit-learn tensorflow

Last synced: 05 Jan 2026

https://github.com/diegogallegos4/boston_housing

Udacity Machine Learning Nanodegree Project

machine-learning python scikit-learn

Last synced: 11 Apr 2026

https://github.com/pranav-tank/heart-disease-prediction-model

I have created this project as my Python term assignment. In this project I have trained a ML model to predict the heart disease using Scikit-learn library in python.

google-colaboratory jupyter-notebook machine-learning prediction-model python scikit-learn

Last synced: 20 Jan 2026

https://github.com/pakagronglb/istanbul-restaurant-tips-prediction

This project focuses on predicting restaurant tips in Istanbul using machine learning techniques, specifically Linear Regression. The analysis includes data preprocessing, exploratory data analysis (EDA), model training, and evaluation.

kaggle matplotlib numpy pandas prediction python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/ralolooafanxyaiml/neural-evolution-engine

A modular AI evolution simulation engine built with Python and TensorFlow. Uses Deep Learning to predict organism adaptations against environmental threats.

artificial-intelligence deep-learning evolution-simulation keras machine-learning neural-networks numpy oop pandas python scikit-learn simulation tensorflow

Last synced: 07 Jan 2026

https://github.com/itssahilwhat/AI-Fundamentals

A curated collection of fundamental AI concepts, algorithms, and code implementations — including Machine Learning, Deep Learning, and Computer Vision — built from scratch and with practical examples.

computer-vision deep-learning machine-learning numpy pandas python pytorch scikit-learn

Last synced: 02 Jul 2025

https://github.com/saragarcia6123/fraud-classification

Credit Card fraud prediction with SkLearn Classification models.

jupyter machine-learning python scikit-learn

Last synced: 25 Apr 2026

https://github.com/chrispsang/customerchurnanalysis

Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.

customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau

Last synced: 31 Jan 2026

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 20 Jan 2026

https://github.com/aldotestino/word-freq-email-classification

Simple email classifier using word frequency and Logistic Regression

docker email-classification fastapi logistic-regression python react scikit-learn

Last synced: 12 Apr 2026

https://github.com/siddhesh-agarwal/cognizant-prodigi-22

Supply chain event predictability and Visibility

fastapi javascript python react scikit-learn solidity sqlite typescript

Last synced: 12 Apr 2026

https://github.com/lorenzorottigni/ml-titanic

Machine Learning python bootcamp: logistic regression on titanic dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/amirkh777/data-science-yandex-practicum

My projects from the Yandex Practicum Data Science course.

data-science matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 12 Apr 2026

https://github.com/iamkushvanth/marketing-campaign-analysis

Data analysis comparing the performance of Facebook and AdWords ad campaigns in 2019, Using Statical Methods such as A/B test focusing on clicks, conversions, cost-effectiveness, and seasonal trends.

ab-testing hypothesis-testing linear-regression matplotlib-pyplot numpy pandas python regression-analysis scikit-learn scipy stastistics

Last synced: 07 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_05

Метод опорних векторів -Support Vector Machine, SVM. Дерева рішень - RandomForestClassifier, XGBClassifier

classification numpy pandas python random-forest-classifier reporting scikit-learn svm-classifier svm-model xgbclassifier

Last synced: 12 Apr 2026

https://github.com/vijaykumarr1452/ipl-first-innings-score-prediction

Developed a machine learning model to predict probable IPL match score ranges using advanced data analytics and historical cricket performance data.

adaboost ann decision-trees deployment machine-learning ml-regression pandas python random-forest scikit-learn

Last synced: 12 Apr 2026

https://github.com/stefagnone/-air-france-sponsored-search-campaign-optimization

Optimized Air France's online advertising campaigns using advanced regression modeling and data analysis, uncovering actionable insights to maximize ROI and enhance ad performance.

advertising-analytics air-france data-science digital-marketing marketing-campaigns predictive-modeling python r regression-analysis roi-optimization scikit-learn sponsored-search

Last synced: 12 Apr 2026

https://github.com/musty-ess/shopping-prediction-ai-using-machine-learning

This project is an AI model designed to predict whether customers on an online shopping website will complete a purchase. It uses machine learning, specifically a k-nearest neighbors (KNN) classifier, to analyze user behavior and predict their purchasing intent.

ai-prediction classification-model k-nearest-neighbors knn-classifier machine-learning online-shopping python scikit-learn sensitivity-specificity user-behavior-prediction

Last synced: 12 Apr 2026

https://github.com/velinovjovan/turbo-fpl

A modern Fantasy Premier League ⚽ analytics and prediction platform built with Next.js, Supabase, and machine learning — featuring live stats, price predictions, and AI-powered 🤖 point forecasts.

fpl fpl-analysis fpl-api nextjs14 scikit-learn scikitlearn-machine-learning tailwindcss

Last synced: 12 Apr 2026

https://github.com/id-andyyy/alfahack

📈💰 Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/enricobolzonello/ml_homeworks

Homeworks for the Machine Learning Course 2022/23 @ Unipd

linear-regression machine-learning neural-network scikit-learn svm

Last synced: 11 Jun 2025

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 12 Apr 2026

https://github.com/vishnu-vamshii/fraud-detection-using-machine-learning

Developed a machine learning pipeline to detect fraudulent credit card transactions, handling imbalanced data with SMOTE and scaling. Trained models like Logistic Regression and Random Forest. Conducted EDA to identify fraud patterns.

pandas python scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 12 Apr 2026

https://github.com/mark1708/neurointerfaces-of-information-systems

Laboratory work on the discipline "Neurointerfaces of information systems"

numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/nicolas-giacomelli/modelo-polinomial-api-streamlit

Codigo focado em treinamento de modelo de previsão de salario por tempo de empresa e nivel dentro da empresa disponibilizado por api e usando front-end streamlit para calcular basta inserir tempo de empresa e nivel e calcular o resultado de salario compativel

api fastapi joblib machine-learning matplotlib numpy pandas pingouin pydantic scikit-learn seaborn streamlit uvicorn

Last synced: 12 Apr 2026

https://github.com/jaspreetsingh-exe/sign-language-recognition-system

Sign Language Recognition System is an AI-powered application that enables real-time sign language recognition using MediaPipe and an MLP model. It captures hand gestures, extracts landmark features, and predicts sign language letters dynamically. The project also explores MobileNetV2 and aims to expand into Text-to-Sign Language generation.

deep-learning machine-learning mediapipe mobilenetv2 neural-networks scikit-learn sign signlanguage signlanguagedetection signlanguagerecognition tensorflow

Last synced: 01 May 2026

https://github.com/aishwaryagm1999/insurance-workflow-management

This project is an Insurance Workflow Management System designed to streamline policy management, claims processing, and fraud detection. It includes user account management, customer feedback analysis via NLP, alert notifications through SMS, and a fraud detection model, providing a secure, efficient solution for insurance operations.

css fraud-detection html json labelimg machine-learning natural-language-processing nlp opencv python qr-code-generator random-forest-classifier scikit-learn sms-notification tensorflow textblob twilio user-interface

Last synced: 26 Dec 2025

https://github.com/urvee1810/eda-time-series

A comprehensive time series analysis of French retail quarterly sales data from 2012 to 2017. The project focuses on analyzing sales patterns, seasonal decomposition, and trend analysis using various statistical techniques and visualizations.

arima-modeling data-visualization exploratory-data-analysis matplotlib numpy pandas pmdarima python scikit-learn seaborn statsmodels time-series-analysis trend-analysis

Last synced: 12 Apr 2026

https://github.com/manojkp08/student-performance-analysis

The Student Performance Analyzer is your go-to solution for understanding and improving student performance. By blending the power of machine learning with interactive visualizations, this tool provides educators and learners with personalized insights into learning styles, performance gaps, and actionable improvements.

machine-learning numpy pandas python requests scikit-learn streamlit

Last synced: 12 Apr 2026

https://github.com/brianlesko/maze-runner

Developed a Python-based maze-crawling application using a PS5 controller interface. This project highlights skills in software-hardware integration and low-code UI design, demonstrating expertise ideal for advanced software engineering.

communication dualsense engineer engineering hacking hardware hardware-hacking interface low-code-ui mechanical-engineer mechanical-engineering protocol ps5 python robotics-engineer scikit-learn software sony streamlit ui

Last synced: 12 Apr 2026

https://github.com/raduldev/ml-projects

End To End Machine Learning Project guided by Krish Naik from Ineuron.

catboost dill flask-application numpy pandas python scikit-learn xgboost

Last synced: 12 Apr 2026

https://github.com/dakohhh/politicians-face-recognition

A machine learning model where we classify famous Nigerian politicians. We restrict classification to only 4 people

gridsearchcv jupyter-notebook machine-learning opencv python pywavelets scikit-learn

Last synced: 16 Apr 2026