An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/dharma-acha/imageclassification

This project is an interactive Streamlit web application using the VGG-13 model to classify images from the CIFAR-10 dataset. Users can upload images to receive real-time predictions and visual explanations of the model's decisions. The goal is to accurately classify images into one of the ten CIFAR-10 classes: airplanes, automobiles, birds, cats,

colab-notebook matplotlib numpy pandas python3 pytorch scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/jpcano/boston_housing

Predicting Boston Housing Prices using supervised Machine Learning algorithms

cross-validation machine-learning numpy pandas python regression-models scikit-learn

Last synced: 12 Apr 2026

https://github.com/ccastleberry/sk-autobots

Custom data transformers using the scikit-learn API.

scikit-learn sklearn sklearn-api

Last synced: 08 Feb 2026

https://github.com/themihirmathur/soiligator

Soiligator is an advanced machine learning project designed to optimize irrigation management by predicting whether irrigation is necessary based on environmental and soil-related data.

auc-score logistic-regression machine-learning matplotlib numpy pandas python random-forest-classifier roc-curve scikit-learn seaborn standardscaler support-vector-machine

Last synced: 12 Apr 2026

https://github.com/rohansoni45/movie-recommendation-system

This project is a Content-Based Recommender System that suggests movies to users based on their preferences and watched history. The system leverages cosine similarity to find and recommend movies similar to a selected title. It is built using Python and libraries like Pandas, NumPy, and Scikit-learn.

content-based-filtering cosine-similarity data-analysis data-science machine-learning numpy pandas python recommender-system render scikit-learn

Last synced: 17 Apr 2026

https://github.com/vipulbunny/house-price-prediction

House Price Prediction is a machine learning project that analyzes real estate data to predict house prices based on various features like location, size, and amenities. It involves data preprocessing, exploratory data analysis (EDA), feature engineering, and model training using regression algorithms to provide accurate price estimates. πŸš€πŸ“ŠπŸ‘

ai-in-real-estate data-science data-visualization eda feature-engineering house-price-prediction housing-market-analysis machine-learning predictive-modeling python real-estate-analytics regression-models scikit-learn

Last synced: 03 May 2026

https://github.com/leticiamilan/formacao-python-developer-dio

FormaΓ§Γ£o Python Developer - Digital Innovation One

django flask pandas pymongo python scikit-learn sqlalchemy sqlite

Last synced: 06 Apr 2026

https://github.com/vishant007/annadataa

A Website For Farmers To Guide Them Regarding Crop Prouction In Their Native Language

django flask-application google-collab kaggle machine-learning-algorithms numpy pandas python3 scikit-learn

Last synced: 13 Apr 2026

https://github.com/touhoue/oilpumpvibration

The project employs signal processing techniques like Hilbert transforms to extract amplitude envelopes and instantaneous frequencies, facilitating insights into the mechanical health and performance of the system.

python scikit-learn

Last synced: 07 May 2026

https://github.com/gititsid/visaverdict

A ML project to predict possibility of US Visa approval

classification python3 random-forest-classifier scikit-learn

Last synced: 03 Feb 2026

https://github.com/imnotannamaria/ia-ml-scikit-learn

Learning tradicional machine learning with scikit-learn.

numpy pandas python scikit-learn seaborn

Last synced: 06 Apr 2026

https://github.com/jtorrescarbajal/ev-charger-accessibility-analysis

Capstone project analyzing EV charger sufficiency and equity across ZIP codes on the U.S. West Coast using Python and public datasets.

jupyter-notebook python scikit-learn wilcoxon-signed-rank

Last synced: 18 Apr 2026

https://github.com/lorenzorottigni/ml-universities

Machine Learning python bootcamp: K mean clustering with public/private universities dataset

k-mean-clustering machine-learning numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/jprmaulion/bayesopt-gb-seismic-liquefaction-liq7

Bayesian-optimized gradient boosting for seismic liquefaction prediction with geographic stratified CV on the LIQ/7/2833 global database.

bayesian-optimization binary-classification gradient-boosting lightgbm liquefaction machine-learning python scikit-learn shap shear-wave-velocity soil-mechanics xgboost

Last synced: 29 May 2026

https://github.com/massimilianoviola/entity-matching-dblp-acm

Entity matching on the DBLP-ACM dataset

scikit-learn sentence-transformers

Last synced: 13 Jun 2026

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/veerchaudhary0708/credit-fraud-detection

An end-to-end machine learning project to detect credit fraud using XGBoost.

datascience fintech fraud-detection machinelearning scikit-learn xgboost

Last synced: 18 May 2026

https://github.com/aleksandarbuk/machine-learning

The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.

matplotlib numpy python scikit-learn tensorflow

Last synced: 16 Apr 2026

https://github.com/chaman2003/parkinson-detection

Al-powered Parkinson's Disease Detection System leveraging smartphone sensors (voice and motion) for real-time analysis. Combines ensemble machine learning models (SVM, Random Forest, Gradient Boosting, XGBoost) with advanced feature extraction to provide accurate early detection, sub-second processing, and detailed reporting.

ai flask html-css-javascript librosa ml numpy pandas pydup python scikit-learn

Last synced: 08 Apr 2026

https://github.com/vishal-verma-96/Pre-Owned-Car-Price-prediction-using-Streamlit-App

Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit

Last synced: 02 Mar 2025

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/ssreeramj/hackerearth-exhibit-art

This is my solution to the HackerEarth Exhibit Art Machine Learning Challenge

gradient-boosting machine-learning pandas regression scikit-learn

Last synced: 20 Apr 2026

https://github.com/namratha2301/bangalorehousepricepredictor

Predicting house price in Bangalore based on the key features of the house like number of rooms, size in square feet etc.

azure bashscript docker flake8 flask github-actions scikit-learn

Last synced: 12 Apr 2026

https://github.com/charlescro/reddit-classification-nlp

Analyzing subreddit language via Reddit API and NLP techniques.

data-analysis data-science data-visualization nlp-machine-learning reddit-api scikit-learn

Last synced: 03 Apr 2025

https://github.com/filsan95/project-iot_malware_identification

This repository contains the code and data for a project that detects malware from IoT devices using a publish-subscribe model with Confluent and Databricks. The project streams IoT device data to Kafka, analyzes it, and detects malware using machine learning models such as Random Forest and Gradient Boosted Trees.

apache-kafka classification confluent databricks machine-learning-algorithms scikit-learn sql

Last synced: 16 Mar 2025

https://github.com/altescy/xsklearn

Expanded scikit-learn for my research

python scikit-learn

Last synced: 21 Mar 2025

https://github.com/zahediparsa/ml_birkaracademy

Developed exercises and practical tasks to help students grasp key machine learning topics in a course hosted by Birkar Academy and ICDS.ai

decision-trees iris-dataset knn machine-learning mlp-classifier scikit-learn

Last synced: 02 Jan 2026

https://github.com/karimosman89/health-risk-assessment

Predict health risks based on patient data.Create a machine learning model that predicts health risks (like diabetes or heart disease) based on patient data.Help healthcare providers identify at-risk patients for early intervention.

ehr-data pandas python scikit-learn

Last synced: 06 May 2026

https://github.com/gangula-karthik/bank-transaction-classification

Classifying bank transactions with precisionβ€”your first step towards smarter finance management πŸ’³πŸ€–πŸ“Š

finance machine-learning nlp scikit-learn

Last synced: 09 Apr 2025

https://github.com/fahrettinsolak/ai-map-based-geographic-clustering-project

This project focuses on clustering crime incidents in San Francisco using the K-Means algorithm. The dataset is obtained from Kaggle and contains information about crime types, geographical coordinates, and other relevant features. The goal is to identify crime hotspots through geographic clustering and visualize the clusters on an interactive map.

artificial-intelligence deep-learning elbow-method jupyter-notebook machine-learning numpy openstreetmap pandas phyton plotly scikit-learn standardscaler

Last synced: 05 Apr 2026

https://github.com/ozcankyo28/ds-ml-bootcamp

πŸ“Š Master data science and machine learning in one month with hands-on projects, covering the complete ML workflow from data collection to deployment.

data-science datascience jose-portilla lgbm lgbmregressor machine-learning matplotlib-pyplot python regression-models scikit-learn seaborn tensorflow udemy-course-project udemy-machine-learning

Last synced: 14 Apr 2026

https://github.com/davgiles/ML-Projects

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 20 Jan 2026

https://github.com/codecraft-sanju/medvisionai-medical-image-ai-vision.

MedVisionAI is an AI-powered platform that analyzes ultrasound images to detect PCOS and provide actionable recommendations. Using CNN-based deep learning and generative AI, it ensures fast, accurate diagnosis, reduces errors, and supports clinicians with instant insights all while maintaining patient privacy and compliance.

deep-learning fastapi gemini-api genai keras-tensorflow machine-learning matplotlib python react scikit-learn seaborn tailwindcss tensorflow

Last synced: 07 Sep 2025

https://github.com/rohit-2301/hiresense

HireSense is an AI-powered resume classifier that uses NLP and Machine Learning to predict the best-fit job role from a PDF resume. Built with Streamlit, it features a clean UI for uploading resumes and instantly suggests roles like Data Scientist, Full Stack Developer, and DevOps Engineer.

joblib ml nlp pymupdf python scikit-learn streamlit tfidfvectorizer

Last synced: 22 Jul 2025

https://github.com/souravxbera/credit-card-approval-predictor

End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture

credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit

Last synced: 15 May 2026

https://github.com/otuemre/housepricingml

A machine learning project predicting house prices using regression models. Covers data preprocessing, feature engineering, and model comparison to achieve accurate results. Developed for a Kaggle competition, focusing on effective ML workflows and model interpretability.

eda encoding evaluation-metrics kaggle-competition lightgbm-regressor machine-learning matplotlib-pyplot neural-networks numpy pandas preprocessing python ridge-regression scikit-learn seaborn tensorflow xgboost-regression

Last synced: 13 Apr 2026

https://github.com/hilalozdemirbuyukasik/deep-learning

A collection of deep learning projects demonstrating RNNs, BiLSTMs, CNNs, and basic neural networks applied to time series forecasting, text sentiment analysis, image classification, and tabular data tasks, with examples of data preprocessing, model training, evaluation, and visualization.

bilstm cnn keras matplotlib nn numpy rnn scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/the-developer-306/fake-review-detector

This project is a machine learning-based review classification system that predicts whether a product review is GENUINE or FAKE. It preprocesses review text, analyzes sentiment, and uses numerical features like ratings and helpfulness to make predictions. The model is deployed via a Flask web application for user interaction.

classification flask logistic-regression machine-learning numpy pandas python renderdeploy scikit-learn sentiment-analysis

Last synced: 12 Apr 2026

https://github.com/rahulb813/mvscr

An alternative movie recommendation system for the CineMatchSM with better predictions. Trained with a Kaggle dataset of over 450000+ entries.

gridsearchcv knn python scikit-learn scipy-sparse svd tensorflow xgboost

Last synced: 07 May 2026

https://github.com/cesar312/python-data-science-toolbox

A collection of useful data science tools and techniques

data-science jupyter-notebook pandas python scikit-learn statistics visualization

Last synced: 13 Apr 2026

https://github.com/freakwill/dred

πŸ”΄ dred = dimension reducing for machine learning (suit to sklearn)

dimension-reduction scikit-learn sklearn sklearn-estimator

Last synced: 19 May 2026

https://github.com/rosa-lpz/machine-learning-zoomcamp-2025

Machine Learning Zoomcamp 2025 from DataTalksClub. Based on repository: https://github.com/DataTalksClub/machine-learning-zoomcamp/tree/master

aws deep-learning docker flask kserve kubernetes machine-learning machine-learning-algorithms machine-learning-projects metrics-visualization neural-networks numpy pandas python scikit-learn tensorflow xgboost

Last synced: 06 Apr 2026

https://github.com/danielstankw/car_classification_and_regression

Regression & Classification model using a car data. Involves EDA, feature engineering, pipelines and more.

classification learning machine regression scikit-learn tensorflow trim

Last synced: 29 Apr 2026

https://github.com/labex-labs/scikit-learn-for-beginners

This comprehensive course covers the fundamental concepts and practical techniques of Scikit-learn, the essential machine learning library in Python. Learn to build, train, and evaluate machine learning models using various algorithms and preprocessing techniques.

algorithms beginner-friendly classification clustering course data-science feature-engineering hands-on labex labs machine-learning model-evaluation preprocessing programming python python-programming regression scikit-learn supervised-learning unsupervised-learning

Last synced: 14 May 2026

https://github.com/m-rishab/job-recruitment-prediction-and-hr-dashboard-using-plotly

This project features make it ideal for dynamic HR dashboards, offering insights into candidate profiles and recruitment processes.

correlation-analysis flask kmeans-clustering numpy pandas plotly python scikit-learn seaborn standardscaler

Last synced: 12 Apr 2026

https://github.com/abdullahashfaqvirk/sms-spam-detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 02 Apr 2026

https://github.com/dmarks84/coursework_project_ml-model-eval-refine

Project for IBM Data Science course on ML Models & Analysis -- Read in large dataset of home sales and utilized polynomial linear regression analysis to make predictions of future home sales prices

classification communication data-modeling dataframes machine-learning matplotlib numpy pandas programming python regression scikit-learn scipy seaborn supervised-ml visualization

Last synced: 09 Apr 2026

https://github.com/nurulashraf/telco-customer-churn-prediction-model

This repository contains a Telco Customer Churn Prediction project using machine learning. It includes data preprocessing, exploratory data analysis, feature engineering, and model development to predict customer churn. Key tools used are Python, Pandas, NumPy, Matplotlib, Seaborn, and scikit-learn.

churn-prediction classification-model customer-churn data-visualization exploratory-data-analysis machine-learning predictive-analytics python scikit-learn

Last synced: 16 Mar 2025

https://github.com/hvalfangst/azure-functions-pandas

Azure Functions for ETL operations using Pandas. Uploaded CSV files trigger data processing, calculating correlations and storing results in a JSON file. Automated deployment via GitHub Actions and Terraform.

az-204 azure azure-functions azure-functions-python pandas python scikit-learn terraform

Last synced: 12 Apr 2026

https://github.com/raghavendra317/brain-tumor-detection

A deep learning-based project for classifying brain MRI images into Glioma, Meningioma, No Tumor, and Pituitary Tumor using Convolutional Neural Networks (CNNs). This project aims to assist in early tumor detection, improving diagnostic accuracy and healthcare outcomes.

cnn-classification keras numpy opencv scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/indhra/hands_on_machine_learning

Core ML topics: short summaries , code snippets, Interview prep

keras machine-learning python scikit-learn tensorflow2

Last synced: 06 Apr 2026

https://github.com/hrolive/disaster-response-pipeline

A machine learning pipeline that categorizes disaster related messages so that they can be sent to the appropriate disaster relief agency

flask machine-learning natural-language-processing nltk pandas plotly python scikit-learn sql sqlalchemy

Last synced: 07 Apr 2026

https://github.com/asuquoaa/predicting_viewer_engagement_with_educational_videos

This project uses machine learning to predict video engagement based on features such as transcript complexity, speaker speed, and silence periods. By understanding the factors influencing engagement, we can improve content recommendations and educational experiences.

data-visualization exploratory-data-analysis machine-learning scikit-learn

Last synced: 15 May 2026

https://github.com/mmerlyn/analysis-of-tomato-prices

Forecasting tomato prices in Karnataka using machine learning to help farmers make better crop planning and selling decisions.

css flask html matplotlib numpy pandas python scikit-learn seaborn

Last synced: 06 Jul 2025

https://github.com/arijit-7612/sms-spam-detection

A deep learning–based SMS Spam Detector built with BiLSTM and Keras TextVectorization. The model classifies messages as Spam or Ham with high accuracy and is deployed on Streamlit for real-time text classification with a clean and interactive user interface.

pandas python scikit-learn seaborn streamlit tensorflow

Last synced: 12 Apr 2026

https://github.com/emmarhoffmann/starsort

A multi-model machine learning project that achieves up to 100% accuracy in classifying star types using five different AI algorithms

decision-tree naive-bayes neural-network python random-forest scikit-learn svm

Last synced: 20 May 2026

https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer

Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.

breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm

Last synced: 05 Feb 2026

https://github.com/vamsi0333/ai-sentiment-cicd-project

End-to-end CI/CD deployment of an AI-powered Sentiment Analysis API using FastAPI, Docker, Kubernetes, Terraform, and GitHub Actions. Demonstrates complete MLOps + DevOps workflow.

ai cicd devops docker fastapi github-actions kubernetes mlops scikit-learn terraform

Last synced: 12 Apr 2026

https://github.com/affan005-ai/tesla-stock-prediction

This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models

data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn

Last synced: 05 Oct 2025

https://github.com/supriya811106/twitter-sentiment-analysis

Analyzing the mood of tweets! We sort tweets on popular topics into positive, negative, or neutral categories to gauge public opinion. See what Twitter really thinks!

bernoulli-naive-bayes jupyter-notebook matplotlib nlp-machine-learning nltk numpy pandas python scikit-learn seaborn sentiment-analysis text-classification tfidf-vectorizer wordcloud

Last synced: 05 Apr 2026

https://github.com/hrolive/recommendation-systems-ibm

Analyze the interactions that users have with articles on the IBM Watson Studio platform and make recommendations to them about new articles, using various recommendation engines.

machine-learning natural-language-processing pandas python recomendation-system scikit-learn

Last synced: 12 Apr 2026

https://github.com/rubada/machine-learning-with-ruba-dabbas

Advance your skills and start your career here, by taking the online courses on Intuidemy.

course learning machine machine-learning matplotlib matplotlib-pyplot models numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/shreyazh/respire

To deal with Respiratory Challenges

machine-learning matplotlib numpy pandas pillow python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/abdelrahman-amen/housing-price

Predicting housing prices with machine learning regression models. This project implements Linear Regression, Random Forest, and Decision Tree models for accurate predictions.

decision-tree housing-price-prediction linear-regression machine-learning python random-forest regression-analysis scikit-learn

Last synced: 07 May 2026

https://github.com/somjit101/ds-logistic-regression

A simple implementation of the Logistic Regression Classifier on the Breast Cancer Dataset with L1 regularization and GridSearch for hyperparameter tuning.

breast-cancer-prediction breast-cancer-wisconsin grid-search grid-search-cross-validation hyperparameter-tuning logistic-regression machine-learning-algorithms regularization scikit-learn

Last synced: 19 May 2026

https://github.com/adam-maz/virtual_screening

Within this repository I present scripts that can be helpful during virtual screening in drug design & development.

clusterization jupyter-notebook k-means-clustering maestro-schrodinger medicinal-chemistry molecular-fingerprints pandas python rdkit scikit-learn scoring-functions virtual-screening

Last synced: 04 May 2026

https://github.com/murugavl/crop-prediction

This Crop Prediction System utilizes machine learning to recommend suitable crops based on environmental data. It helps farmers make informed decisions by analyzing factors like soil type and climate. The system aims to enhance agricultural efficiency and productivity.

flask machine-learning python scikit-learn

Last synced: 12 Jun 2025

https://github.com/satyavardhan2k4/medical-insurance-predictor

A linear regression model that predicts medical insurance cost based on the features like age, sex, BMI etc. the dataset values is based in US

machine-learning pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/alphacrypto246/zoo-animal-classifier

A project that uses machine learning to classify animals into categories like Mammals, Birds, and Reptiles based on their characteristics.

machine-learning machine-learning-algorithms random-forest scikit-learn

Last synced: 20 May 2026

https://github.com/santiago-giordano/datascienceproject

Data Science Course Project: Causes of death around the world

apis jupyter-notebook matplotlib pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/mianmharoon/sentimentanalysis_coreml_emotionclassifier

Emotion classification iOS app using CoreML and SwiftUI – demo for sentiment and emotion analysis, with the model converted from Scikit-learn using coremltools.

ai coreml coreml-models emotionclassification ios machinelearning nlp python3 scikit-learn sentimentanalysis swift swiftui

Last synced: 12 Apr 2026

https://github.com/heyitsjoealongi/fantasy-football-qbwr-model

Fantasy Football: Quarterback / Wide Receiver - Gaussian Process Regression (GPR) Machine Learning Model

machine-learning matplotlib model numpy python scikit-learn

Last synced: 01 Apr 2025

https://github.com/lukacerr/lovelytics

Lovelytics technical task for AI engineer position

ai-agents deepagents langchain ml python scikit-learn

Last synced: 31 May 2026

https://github.com/qinhanmin2014/sklearn-svm-guide

Rapidly obtain acceptable results using SVM (based on scikit-learn)

machine-learning scikit-learn svm

Last synced: 03 Apr 2025