An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/hazz-i/codexia-chatbot

Deskriminatif Chatbot

chatbot nlp scikit-learn

Last synced: 19 May 2026

https://github.com/myahninsi/housing-price-prediction-ml

Final project for Big Data Visualization for Business Communications 01 (DSMM Group 1). Analyzes housing data, identifies key price factors, and builds predictive models using machine learning. Includes Power BI dashboards for interactive visualizations and Flask for deployment.

elastic-net lasso-regression linear-regression matplotlib numpy pandas ridge-regression scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/elprofesoriqo/milab

The segmentation of lumbar spine MRI images to identify key anatomical structures related to lumbar spinal stenosis.

machine-learning machine-learning-images machine-learning-models python scikit-learn

Last synced: 02 Jan 2026

https://github.com/sudo-de/hull_tactical_market_prediction_using_hyperopt

This is based market timing model inspired by Hull Tactical’s approach, using Hyperopt for hyperparameter optimization and backtesting to enhance predictive performance.

algorithms catboost lightgbm optimization optuna python3 scikit-learn scipy xgboost

Last synced: 06 May 2026

https://github.com/alphacrypto246/zoo-animal-classifier

A project that uses machine learning to classify animals into categories like Mammals, Birds, and Reptiles based on their characteristics.

machine-learning machine-learning-algorithms random-forest scikit-learn

Last synced: 20 May 2026

https://github.com/emmarhoffmann/starsort

A multi-model machine learning project that achieves up to 100% accuracy in classifying star types using five different AI algorithms

decision-tree naive-bayes neural-network python random-forest scikit-learn svm

Last synced: 20 May 2026

https://github.com/rohit-2301/hiresense

HireSense is an AI-powered resume classifier that uses NLP and Machine Learning to predict the best-fit job role from a PDF resume. Built with Streamlit, it features a clean UI for uploading resumes and instantly suggests roles like Data Scientist, Full Stack Developer, and DevOps Engineer.

joblib ml nlp pymupdf python scikit-learn streamlit tfidfvectorizer

Last synced: 22 Jul 2025

https://github.com/davgiles/ML-Projects

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 20 Jan 2026

https://github.com/zahediparsa/ml_birkaracademy

Developed exercises and practical tasks to help students grasp key machine learning topics in a course hosted by Birkar Academy and ICDS.ai

decision-trees iris-dataset knn machine-learning mlp-classifier scikit-learn

Last synced: 02 Jan 2026

https://github.com/chaman2003/parkinson-detection

Al-powered Parkinson's Disease Detection System leveraging smartphone sensors (voice and motion) for real-time analysis. Combines ensemble machine learning models (SVM, Random Forest, Gradient Boosting, XGBoost) with advanced feature extraction to provide accurate early detection, sub-second processing, and detailed reporting.

ai flask html-css-javascript librosa ml numpy pandas pydup python scikit-learn

Last synced: 08 Apr 2026

https://github.com/wuweiweiwu/zookeeper-bot

Bot for Facebook Messenger game Zookeeper using scikit-learn SVM :dromedary_camel:

facebook-messenger scikit-learn svm zookeeper

Last synced: 20 May 2026

https://github.com/barbaraeguche/pyrocast

🚒 a proactive wildfire prediction & analysis built with react & flask.

ai flask ml pandas react scikit-learn vite

Last synced: 08 Apr 2026

https://github.com/nfordumass/nbafinalsmvp_ml

A python machine learning analysis of who should've won NBA Finals MVP each year.

jupyter-notebook machine-learning python scikit-learn

Last synced: 28 Mar 2025

https://github.com/sridharyadav07/ai--powered-task-management-system

An intelligent Task Management System that integrates Sentiment Analysis, Task Optimization, and Forecasting to streamline project and task handling. This AI-powered tool is designed to assist teams and project managers in making data-driven decisions by understanding emotional context, forecasting productivity, and optimizing workload distribution

arima flask joblib jupyter-notebook naive-bayes-classifier nltk numpy pandas pickle-file python randomforestregressor scikit-learn stopwords-removal streamlit tfidf-vectorizer

Last synced: 08 Apr 2026

https://github.com/pramodyasahan/house-price-prediction

This repository contains the code for a machine learning model aimed at predicting housing prices. The model is based on the RandomForestRegressor algorithm from the scikit-learn library and utilizes feature selection, preprocessing, and pipeline techniques for improved performance.

machine-learning numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/vancenceho/college_projects

🎓 House of school/college homework, problem sets & mini-projects which consists mainly of data structures & algorithms

data-structures matlab pandas python queue scikit-learn seaborn stack

Last synced: 08 Apr 2026

https://github.com/bsamseth/triangular-regressor

A scikit-learn compatible implementation of a 2D triangular regressor.

scikit-learn triangulation

Last synced: 20 May 2026

https://github.com/thekartikeyamishra/aipoweredmarketingassistant

AI-Powered Marketing Assistant, an advanced tool designed to enhance your digital marketing campaigns using the power of machine learning (ML) and large language models (LLMs). This project empowers small businesses and MSMEs to create compelling content, analyze campaigns, and strategize effectively.

artificial-intelligence llm matplotlib numpy openai pandas python scikit-learn streamlit

Last synced: 08 Apr 2026

https://github.com/esha-sm/forecastx

This is an interactive web application for forecasting sales data using the ARIMA model. Users can upload their own CSV files or use a default dataset to generate forecasts and visualizations.

arima-model flask-api jupyter-notebook matplotlib pandas plotly python scikit-learn seaborn

Last synced: 27 Feb 2026

https://github.com/freakwill/nb-combination

ensemble classifier with naive bayes combination

bayes-classifier python scikit-learn

Last synced: 20 May 2026

https://github.com/mohit1106/Fraud-Detection-In-Financial-Transactions

an anomaly detection system on 284,807 transactions, achieving an AUC of ~0.972 with CNNs and Autoencoders.

autoencoders cnn-model isolation-forest keras python scikit-learn tensorflow

Last synced: 17 Oct 2025

https://github.com/jihoonerd/restricted-discriminant-analysis

RDA implementation compatible with Scikit-learn API

discriminant-analysis rda scikit-learn

Last synced: 22 Apr 2026

https://github.com/pramodyasahan/titanic-survival

This repository contains a machine learning project focused on predicting the survival of passengers on the Titanic. The project uses a Support Vector Regression (SVR) model from the sklearn library and involves data preprocessing and prediction.

data-preprocessing matplotlib numpy pandas python scikit-learn support-vector-regression

Last synced: 08 Apr 2026

https://github.com/mk2345/fashionmnist-dl-ml

CNN and SVM image classifiers implemented in Keras and Scikit-Learn.

jupyter-notebook keras-tensorflow scikit-image scikit-learn

Last synced: 10 May 2026

https://github.com/kheriberto/logistic_regression_project

A project that analyses dummie data from an advertising company using logistic regression

data-analysis logistic-regression pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/khaja-shaik-21/heart-disease-prediction-system

This form allows users to enter key health details like age, blood pressure, cholesterol levels, and exercise results to predict the likelihood of heart disease. The data is submitted to the backend for processing, where a machine learning model provides a prediction. The form is styled for a clean and responsive user experience.

css3 flask-application git html5 logestic-regression numpy pandas python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/amon20044/quantum-bayes-classifiers-and-their-application-in-image-classification

implementing Quantum Bayes Classifiers (QBCs) for image classification tasks using MNIST and Fashion-MNIST datasets, based on the research by Ming-Ming Wang and Xiao-Ying Zhang. The project includes Naïve QBC, SPODE-QBC, TAN-QBC, and Symmetric-QBC, simulated on MindQuantum.

bayesian bayesian-inference classification computing gaussian mindquantum mindspore naive-bayes-classifier qml quantum quantum-computing quantum-machine-learning research-reproduction scikit-learn spode tan

Last synced: 18 May 2026

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/dwija12903/password-strength-checker

A Model to evaluate the strength of passwords and enhance security for your applications.

matplotlib pandas scikit-learn seaborn xgboost

Last synced: 02 Jan 2026

https://github.com/jenil311/application-of-covid-19-spread-analysis

The objective of this project is to study the COVID-19 outbreak using basic statistical techniques and make short term predictions using ML regression methods.

covid19-tracker machine-learning regression-analysis regression-models ridge-regression scikit-learn

Last synced: 02 Jan 2026

https://github.com/zenklinov/regression_logistic_-_sentiment_analysis

This project demonstrates sentiment analysis, model based on movie review data using Logistic Regression. The model predicts whether a review expresses positive or negative sentiment based on the text provided. The application on #PepGuardiola from Twitter (X).

natural-language-processing nlp nltk scikit-learn sentiment-analysis

Last synced: 03 May 2026

https://github.com/mhmudfzli/loan-approval-prediction

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

automl catboost numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/lorenzorottigni/ml-yelp-reviews

Machine Learning python bootcamp: natural language processing on Yelp reviews

machine-learning nlp numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/lren-chuv/sklearn_to_pfa

Convert Scikit Learn models to PFA

pfa-standard scikit-learn

Last synced: 21 May 2026

https://github.com/abhishekbagdiya01/movies-recommendation-system

This repository contains the code for a movie recommendation system built using Jupyter Notebook.

aiml jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/wlopezm-unal/titanic_ship-streamlit

Machine Learning model, where using titanic ship data and see if is be able to predict if a passager was salved or died. This apply use Machien learning (Random Forest, gassianNB and Logistic Regressión) . Further, using streamlit together to FastApi be able to see the predict result

docker machine-learning optuna scikit-learn streamlit supervised-machine-learning

Last synced: 25 Feb 2025

https://github.com/bjornmelin/ml-algorithm-playground

🧪 Core ML algorithm implementations with GPU acceleration. Featuring optimized implementations across various libraries with comprehensive analysis. 📈

algorithms cuda gpu-computing lightgbm machine-learning python scikit-learn xgboost

Last synced: 13 May 2026

https://github.com/itaxbox/loan-eligibility-prediction

Loan Eligibility Prediction using Decision Tree

decision-trees jupyter-notebook matplotlib pandas python scikit-learn

Last synced: 17 Jun 2025

https://github.com/akankshaaa013/practice-machine-learning-projects

A ML repository that contains several practice notebooks of different Machine Learning Algorithm such as Multiple Linear Regression, KNN Classification, SVM, etc.

classification machine-learning-algorithms regression-algorithms scikit-learn

Last synced: 21 May 2026

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 02 Jan 2026

https://github.com/douglaside/airlinedelay

[✍🏻Learn] Project aimed at analyzing flight delays, using Python algorithms and machine learning techniques to aid decision-making and identify patterns.

ai alura boxplot data-science graphics histogram machine-learning machine-learning-algorithms pandas python scikit-learn static

Last synced: 28 Jun 2025

https://github.com/achronus/data-exploration

A repository dedicated to interesting data exploration projects I've completed

data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 02 Jan 2026

https://github.com/miguellopezvirues/azure_keyword_cpc

Development and deployment of simple regression model in Azure Machine Learning.

azureml deplyment machine-learning mlflow pandas scikit-learn

Last synced: 09 May 2026

https://github.com/hawkharsh1/house-price-pridiction-model-using-ann

A deep learning-based regression model built using Artificial Neural Networks (ANN) in PyTorch to predict house prices from structured data. This project demonstrates the application of machine learning and deep learning techniques for solving real-world problems in the housing domain.

artificial-neural-networks deep-neural-networks machine-learning numpy pandas python3 pytorch scikit-learn

Last synced: 08 Apr 2026

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/giatraskon/machine_learning_assignments

Machine learning assignments covering regression, classification, neural networks, adversarial examples, and real-time emotion detection using Python. Includes theoretical insights and practical implementations.

adversarial-examples bayesian-inference bias-variance-tradeoff cifar10 classification deep-learning emotion-recognition iris-dataset k-nearest-neighbours keras machine-learning mnist neural-networks opencv pima-indians-diabetes python regression ridge-regression scikit-learn tensorflow

Last synced: 08 Apr 2026

https://github.com/roovedot/house-price-predictions-with-random-forest-regression-model

Applying Machine Learning Regression Model to Predict Housing Prices for Kaggle "Housing Prices Competition for Kaggle Learn Users".

machine-learning pandas random-forest-regression scikit-learn

Last synced: 03 May 2026

https://github.com/haydencordeiro/terafeed

Terafeed - Addressing Zero Hunger in Africa (Sustainability Goal SDG 2)

javscript numpy pandas powerbi python scikit-learn tableau vuejs

Last synced: 08 Apr 2026

https://github.com/ghulammuhayyudin1003/Email-Spam-Detector

ML-powered email spam detector with TF-IDF, Random Forest & OCR for image-based spam. Built on published research (EJASET 2025). Live on Streamlit Cloud.

email-classification email-spam-detection machine-learning nlp ocr scikit-learn streamlit streamlit-webapp tesseract tfidf

Last synced: 22 May 2026

https://github.com/analitico-771/machine_learning_trading_bot

This is an Application that implements an algorithmic trading strategy that uses machine learning to automate the trade decisions

financial-analysis hvplot logistic-regression machine-learning moving-average pandas-dataframe predictive-modeling python scikit-learn stock-price-prediction support-vector-machine

Last synced: 08 Apr 2026

https://github.com/thekartikeyamishra/predictive-sales-analytics

The Predictive Sales Analytics tool aims to help MSMEs forecast future sales using historical data. This advanced version leverages Machine Learning for accurate predictions and provides a dashboard to visualize sales trends, seasonality, and predictions.

joblib machine-learning matplotlib pandas python scikit-learn streamlit

Last synced: 08 Apr 2026

https://github.com/rakibhhridoy/visualmachinelearning-yellowbrick

Yellowbrick wraps the scikit-learn and matplotlib to create publication-ready figures and interactive data explorations. It is a diagnostic visualization platform for machine learning that allows us to steer the model selection process by helping to evaluate the performance, stability, and predictive value of our models and further assist in diagnosing the problems in our workflow.

classification hyperparameter-tuning machine-learning model-evaluation model-view-presenter model-visualization python random-forest random-forest-classifier scikit-learn visualization xgboost xgboost-algorithm yellowbrick

Last synced: 03 May 2026

https://github.com/rakibhhridoy/breastcanceranalysis-classificationclustering

Breast cancer prediction both in classification and clustering method for better understanding the data. Though clustering is different from classification,to finding the key aspect the data have,sometimes we need every possible way to catch behavior of the data.

breast-cancer-prediction breastcancer-classification classification classification-algorithm clustering eda hyperparameter-optimization machine-learning python scikit-learn supervised-learning unsupervised-learning

Last synced: 14 Apr 2026

https://github.com/paragon279/cancercellclassificationbyscikitlearn

Classifies cancer cell using Scikit-learn

machine-learning python scikit-learn

Last synced: 22 May 2026

https://github.com/aysh2603/twitter-sentiment-analysis

The Twitter Sentiment Analysis project employs Natural Language Processing (NLP) techniques to classify tweets into positive or negative sentiments. By analyzing the tone of tweets, this project provides insights into public sentiment on various topics.

hyperparameter-tuning nlp-machine-learning numpy pandas python3 scikit-learn

Last synced: 13 Jul 2025

https://github.com/leosolar8/mental-health-tech-ai-survey

Mental Health in Tech Survey Analysis — Applied K-means clustering, PCA, and Chi-square tests on tech industry survey data to uncover patterns between remote work practices and mental health consequences, with visualizations of key trends.

clustering data-science kmeans machine-learning mental-health pca python-project scikit-learn seaborn survey-analysis tech-industry visualization

Last synced: 08 Apr 2026

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/daniel-furman/recfeatureselect

Feature selection functions (1) using the multi-collinearity matrix and recursively proceeding to a spearman threshold and (2) using Forward Stepwise Selection running on an ensemble sklearner (with options for HPO).

correlation-threshold machine-learning modeling multicollinearity recursion recursive-algorithm scikit-learn spearman-rho

Last synced: 03 Jan 2026

https://github.com/myself-aas/predict-influence-of-social-media-and-other-socio-demographic-factors-on-study-duration

'The Study Duration Prediction Web App' uses machine learning to predict student study time based on factors like GPA, family background, social media engagement, and personal influences. Built with Flask and scikit-learn, it offers personalized insights into how lifestyle choices affect academic performance and study habits.

flask-application machine-learning machine-learning-algorithms prediction-model python scikit-learn scikitlearn-machine-learning webapp

Last synced: 23 Jul 2025

https://github.com/tabotcharlesbessong/python-errors

This repository will contain all python errors i will encounter in my life as a python plus their solutions

matplotlib-animation matplotlib-pyplot numpy pandas python-script python3 scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/tomas542/dl_examples

Examples of Machine Learning, Deep Learning, Natural Language Processing and so on

computer-vision cv deep-learning dl keras machine-learning ml natural-language-processing nlp numpy python pytorch scikit-learn

Last synced: 08 Apr 2026

https://github.com/notshrirang/m2connex

M2ConneX is an all-encompassing platform specifically crafted for MMCOE alumni, enabling seamless communication, networking, and collaboration. It provides tailored recommendations for connections, posts, and job opportunities based on each user's unique skills and experience.

django django-rest-framework scikit-learn

Last synced: 28 Jun 2025

https://github.com/colinwu0403/weatherpredictor

ML model that predicts future weather temperatures. Dataset taken from NOAA's Climate Data Online

pandas scikit-learn

Last synced: 02 May 2026

https://github.com/gmork2/covid-19

A mathematical analysis of the infection growth

coronavirus covid-19 jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/beolawork-art/novabank-churn-analysis

NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.

data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql

Last synced: 08 Apr 2026

https://github.com/lorenzorottigni/ml-lending-club

Machine Learning python bootcamp: random forest classifier on LendingClub dataset

ipynb machine-learning numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/shahbazshaddy/explainable-multimodal-ai-for-breast-cancer-and-pneumonia-prediction

A deep learning-based framework integrating explainable multimodal AI for accurate prediction and transparent diagnosis of breast cancer and pneumonia.

deep-learning explainable-ai grad-cam groq-api llm machine-learning matplotlib multimodal numpy pandas python pytorch scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/vladstudennikov/diabetes-prediction-app

ML-powered web app built with Laravel and Vue.js to predict diabetes risk based on users' daily habits and behavior

cypress data-analysis diabetes-prediction fastapi inertiajs laravel matplotlib medicine ml pandas php scikit-learn seaborn vuejs

Last synced: 08 Apr 2026

https://github.com/ashrw/handwritten_digit_recognizer

A handwritten digit recognition system using Python and Scikit-learn to preprocess images and classify digits with a trained SVM model.

ml python scikit-learn

Last synced: 03 Jan 2026

https://github.com/jhylin/ml1-1_small_mols_in_chembl

Polars dataframe library and logistic regression in scikit-learn (update)

logistic-regression machine-learning parquet-files polars-dataframe scikit-learn

Last synced: 03 Jan 2026

https://github.com/andrewsy1004/mask-detection

Mask detection system capable of identifying individuals with or without masks

kaggle keras python scikit-learn tensorflow

Last synced: 08 Apr 2026

https://github.com/hariprasath-v/machinehack_analytics_olympiad_2023

Create a machine learning model to determine the likelihood of a customer defaulting on a loan based on credit history, payment behavior, and account details.

binaryclassification catboost exploratory-data-analysis machine-learning numpy pandas python scikit-learn shap

Last synced: 08 Apr 2026

https://github.com/priyanshulathi/cancer-diagnosis-prediction-model

A Machine Learning project to predict cancer malignancy using K-Nearest Neighbor, Support Vector Machine, and Decision Tree algorithms.

machine-learning numpy pandas python scikit-learn

Last synced: 03 Jan 2026

https://github.com/rinuya/ml-cancer-diagnosis

Binary classficiation using MLP & Random Forest

ml mlp random-forest scikit-learn

Last synced: 03 Jan 2026

https://github.com/barraharrison/seoul-bike-sharing

Performing EDA on a kaggle dataset to look at the distribution of Seoul's bike-sharing system

jupyterlab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 23 Jul 2025

https://github.com/ledsouza/deep-learning-noticias

Este projeto visa construir dois modelos de Machine Learning: um para classificar notícias em diferentes categorias e outro para realizar o autocomplete de texto, prevendo a próxima palavra em uma frase. O conjunto de dados fornecido consiste em notícias de um site de notícias, já pré-processadas e armazenadas em um arquivo CSV.

deep-learning keras machine-learning python scikit-learn tensorflow

Last synced: 08 Mar 2026

https://github.com/alphacrypto246/insurance-charges-prediction

The Predicting Insurance Charges project uses Decision Tree Regression to predict insurance charges based on features like age, sex, BMI, and smoking habits. It involves data preprocessing, feature scaling, and model evaluation with metrics like MAE and R².

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/radoslawregula/geo-music-classification

Jupyter notebook implementing a classification solution to the geographical origins of music problem.

classification jupyter-notebook machine-learning pandas python random-forest-classifier scikit-learn

Last synced: 17 Apr 2026