An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/gangula-karthik/altitude-analytics

Developed a powerful model that predicts airline review sentiments—promoter, passive, or detractor—to help airlines sharpen their marketing strategies and boost customer loyalty 🚀 ✨

airlines data-science machine-learning python scikit-learn sentiment-analysis supervised-learning

Last synced: 15 Apr 2026

https://github.com/agrawalchaitany/product_recommendation

Smart Product Recommender is an AI-driven platform using web scraping and machine learning to deliver personalized shopping recommendations by analyzing product data and user preferences.

beautifulsoup cosine-similarity jupyter-notebook pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/somjit101/ds-logistic-regression

A simple implementation of the Logistic Regression Classifier on the Breast Cancer Dataset with L1 regularization and GridSearch for hyperparameter tuning.

breast-cancer-prediction breast-cancer-wisconsin grid-search grid-search-cross-validation hyperparameter-tuning logistic-regression machine-learning-algorithms regularization scikit-learn

Last synced: 19 May 2026

https://github.com/freakwill/dred

🔴 dred = dimension reducing for machine learning (suit to sklearn)

dimension-reduction scikit-learn sklearn sklearn-estimator

Last synced: 19 May 2026

https://github.com/lopez86/datascienceexamples

Examples of various data science & data analysis topics using various sources of data.

data-analysis data-science pandas scikit-learn tutorial visualization

Last synced: 13 Apr 2026

https://github.com/vibhore-sagar/stock-movement-analysis-based-on-social-media-sentiment

This project aims to develop a machine learning model that predicts stock movements by analyzing sentiment in social media discussions (Reddit posts).

joblib numpy pandas praw python scikit-learn textblob-sentiment-analysis yfinance

Last synced: 09 Apr 2026

https://github.com/shubhamgoyal575/credit-card-fraud-detection

📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.

adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost

Last synced: 08 Feb 2026

https://github.com/davidcgong/birddog.io

Real estate forecasting using Zillow Research data

forecasting pandas scikit-learn

Last synced: 19 May 2026

https://github.com/katiebristol/epsilon_fe2o3_controls

Exploratory Data Analysis using machine learning techniques as an exercise for GLY6932 (Data Science and Machine Learning Methods in the Geosciences) at the University of Florida.

biplot exploratory-data-analysis k-means-clustering machine-learning one-hot-encoding paleomagnetism principal-component-analysis random-forest rock-magnetism scikit-learn

Last synced: 19 May 2026

https://github.com/jazib-2004/face-mask-detection-using-cnns

Face mask detection can be very useful in environments like hospital emergency rooms or ICUs where wearing mask is mandatory. It can also help in pandemics like COVID where such models can detect if a person is wearing a mask or not. In this project, I used a Convolutional Nueral Network architecture to train a face mask detection algorithm based.

convolutional-neural-networks keras object-detection python scikit-learn tensorflow

Last synced: 08 Apr 2026

https://github.com/shamikaredkar/nextbinge-old

NextBinge is a personalized movie recommender system designed to suggest movies similar to the ones you've watched and liked. It leverages content-based filtering, recommending movies by analyzing genres, keywords, cast, and crew information.

machine-learning nlp-machine-learning python scikit-learn scikitlearn-machine-learning streamlit streamlit-webapp

Last synced: 14 Mar 2025

https://github.com/xprithvi/random-forest-regressor

This Jupyter notebook serves as a machine learning template to quickly make predictions and analyse feature importance in a dataset.

data-science feature-extraction machine-learning random-forest random-forest-regression scikit-learn

Last synced: 14 Mar 2025

https://github.com/v41bh4vr4jput/medical-diagnosis-assistant

Medical Diagnosis Assistant is a smart, multi-featured Python application that helps predict diseases based on symptoms and medical metrics using Machine Learning and Deep Learning techniques. It assists users in making informed decisions by analyzing health data and generating detailed PDF reports

logistic-regression pandas python random-forest requests scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/shendrew/coursescope

Machine learning for course classification by topics

beautifulsoup machine-learning nlp scikit-learn

Last synced: 09 Apr 2026

https://github.com/lourdilene/guess-the-number

Number guessing game played between two players: human and computer. Basic Python project for studying object-oriented programming and machine learning with the scikit-learn library.

machine-learning oops-in-python scikit-learn

Last synced: 20 May 2026

https://github.com/hazz-i/codexia-chatbot

Deskriminatif Chatbot

chatbot nlp scikit-learn

Last synced: 19 May 2026

https://github.com/myahninsi/housing-price-prediction-ml

Final project for Big Data Visualization for Business Communications 01 (DSMM Group 1). Analyzes housing data, identifies key price factors, and builds predictive models using machine learning. Includes Power BI dashboards for interactive visualizations and Flask for deployment.

elastic-net lasso-regression linear-regression matplotlib numpy pandas ridge-regression scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/elprofesoriqo/milab

The segmentation of lumbar spine MRI images to identify key anatomical structures related to lumbar spinal stenosis.

machine-learning machine-learning-images machine-learning-models python scikit-learn

Last synced: 02 Jan 2026

https://github.com/sudo-de/hull_tactical_market_prediction_using_hyperopt

This is based market timing model inspired by Hull Tactical’s approach, using Hyperopt for hyperparameter optimization and backtesting to enhance predictive performance.

algorithms catboost lightgbm optimization optuna python3 scikit-learn scipy xgboost

Last synced: 06 May 2026

https://github.com/alphacrypto246/zoo-animal-classifier

A project that uses machine learning to classify animals into categories like Mammals, Birds, and Reptiles based on their characteristics.

machine-learning machine-learning-algorithms random-forest scikit-learn

Last synced: 20 May 2026

https://github.com/emmarhoffmann/starsort

A multi-model machine learning project that achieves up to 100% accuracy in classifying star types using five different AI algorithms

decision-tree naive-bayes neural-network python random-forest scikit-learn svm

Last synced: 20 May 2026

https://github.com/rohit-2301/hiresense

HireSense is an AI-powered resume classifier that uses NLP and Machine Learning to predict the best-fit job role from a PDF resume. Built with Streamlit, it features a clean UI for uploading resumes and instantly suggests roles like Data Scientist, Full Stack Developer, and DevOps Engineer.

joblib ml nlp pymupdf python scikit-learn streamlit tfidfvectorizer

Last synced: 22 Jul 2025

https://github.com/davgiles/ML-Projects

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 20 Jan 2026

https://github.com/zahediparsa/ml_birkaracademy

Developed exercises and practical tasks to help students grasp key machine learning topics in a course hosted by Birkar Academy and ICDS.ai

decision-trees iris-dataset knn machine-learning mlp-classifier scikit-learn

Last synced: 02 Jan 2026

https://github.com/chaman2003/parkinson-detection

Al-powered Parkinson's Disease Detection System leveraging smartphone sensors (voice and motion) for real-time analysis. Combines ensemble machine learning models (SVM, Random Forest, Gradient Boosting, XGBoost) with advanced feature extraction to provide accurate early detection, sub-second processing, and detailed reporting.

ai flask html-css-javascript librosa ml numpy pandas pydup python scikit-learn

Last synced: 08 Apr 2026

https://github.com/wuweiweiwu/zookeeper-bot

Bot for Facebook Messenger game Zookeeper using scikit-learn SVM :dromedary_camel:

facebook-messenger scikit-learn svm zookeeper

Last synced: 20 May 2026

https://github.com/barbaraeguche/pyrocast

🚒 a proactive wildfire prediction & analysis built with react & flask.

ai flask ml pandas react scikit-learn vite

Last synced: 08 Apr 2026

https://github.com/nfordumass/nbafinalsmvp_ml

A python machine learning analysis of who should've won NBA Finals MVP each year.

jupyter-notebook machine-learning python scikit-learn

Last synced: 28 Mar 2025

https://github.com/sridharyadav07/ai--powered-task-management-system

An intelligent Task Management System that integrates Sentiment Analysis, Task Optimization, and Forecasting to streamline project and task handling. This AI-powered tool is designed to assist teams and project managers in making data-driven decisions by understanding emotional context, forecasting productivity, and optimizing workload distribution

arima flask joblib jupyter-notebook naive-bayes-classifier nltk numpy pandas pickle-file python randomforestregressor scikit-learn stopwords-removal streamlit tfidf-vectorizer

Last synced: 08 Apr 2026

https://github.com/pramodyasahan/house-price-prediction

This repository contains the code for a machine learning model aimed at predicting housing prices. The model is based on the RandomForestRegressor algorithm from the scikit-learn library and utilizes feature selection, preprocessing, and pipeline techniques for improved performance.

machine-learning numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/vancenceho/college_projects

🎓 House of school/college homework, problem sets & mini-projects which consists mainly of data structures & algorithms

data-structures matlab pandas python queue scikit-learn seaborn stack

Last synced: 08 Apr 2026

https://github.com/bsamseth/triangular-regressor

A scikit-learn compatible implementation of a 2D triangular regressor.

scikit-learn triangulation

Last synced: 20 May 2026

https://github.com/thekartikeyamishra/aipoweredmarketingassistant

AI-Powered Marketing Assistant, an advanced tool designed to enhance your digital marketing campaigns using the power of machine learning (ML) and large language models (LLMs). This project empowers small businesses and MSMEs to create compelling content, analyze campaigns, and strategize effectively.

artificial-intelligence llm matplotlib numpy openai pandas python scikit-learn streamlit

Last synced: 08 Apr 2026

https://github.com/esha-sm/forecastx

This is an interactive web application for forecasting sales data using the ARIMA model. Users can upload their own CSV files or use a default dataset to generate forecasts and visualizations.

arima-model flask-api jupyter-notebook matplotlib pandas plotly python scikit-learn seaborn

Last synced: 27 Feb 2026

https://github.com/freakwill/nb-combination

ensemble classifier with naive bayes combination

bayes-classifier python scikit-learn

Last synced: 20 May 2026

https://github.com/mohit1106/Fraud-Detection-In-Financial-Transactions

an anomaly detection system on 284,807 transactions, achieving an AUC of ~0.972 with CNNs and Autoencoders.

autoencoders cnn-model isolation-forest keras python scikit-learn tensorflow

Last synced: 17 Oct 2025

https://github.com/jihoonerd/restricted-discriminant-analysis

RDA implementation compatible with Scikit-learn API

discriminant-analysis rda scikit-learn

Last synced: 22 Apr 2026

https://github.com/pramodyasahan/titanic-survival

This repository contains a machine learning project focused on predicting the survival of passengers on the Titanic. The project uses a Support Vector Regression (SVR) model from the sklearn library and involves data preprocessing and prediction.

data-preprocessing matplotlib numpy pandas python scikit-learn support-vector-regression

Last synced: 08 Apr 2026

https://github.com/mk2345/fashionmnist-dl-ml

CNN and SVM image classifiers implemented in Keras and Scikit-Learn.

jupyter-notebook keras-tensorflow scikit-image scikit-learn

Last synced: 10 May 2026

https://github.com/kheriberto/logistic_regression_project

A project that analyses dummie data from an advertising company using logistic regression

data-analysis logistic-regression pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/khaja-shaik-21/heart-disease-prediction-system

This form allows users to enter key health details like age, blood pressure, cholesterol levels, and exercise results to predict the likelihood of heart disease. The data is submitted to the backend for processing, where a machine learning model provides a prediction. The form is styled for a clean and responsive user experience.

css3 flask-application git html5 logestic-regression numpy pandas python3 scikit-learn

Last synced: 12 Apr 2026

https://github.com/amon20044/quantum-bayes-classifiers-and-their-application-in-image-classification

implementing Quantum Bayes Classifiers (QBCs) for image classification tasks using MNIST and Fashion-MNIST datasets, based on the research by Ming-Ming Wang and Xiao-Ying Zhang. The project includes Naïve QBC, SPODE-QBC, TAN-QBC, and Symmetric-QBC, simulated on MindQuantum.

bayesian bayesian-inference classification computing gaussian mindquantum mindspore naive-bayes-classifier qml quantum quantum-computing quantum-machine-learning research-reproduction scikit-learn spode tan

Last synced: 18 May 2026

https://github.com/abidhasanrafi/pharma-sales-analytics

A Streamlit-powered web application for analyzing pharmaceutical sales performance across teams, products, and territories.

matplotlib numpy pandas plotly sales-analysis scikit-learn seaborn streamlit

Last synced: 08 Apr 2026

https://github.com/dwija12903/password-strength-checker

A Model to evaluate the strength of passwords and enhance security for your applications.

matplotlib pandas scikit-learn seaborn xgboost

Last synced: 02 Jan 2026

https://github.com/jenil311/application-of-covid-19-spread-analysis

The objective of this project is to study the COVID-19 outbreak using basic statistical techniques and make short term predictions using ML regression methods.

covid19-tracker machine-learning regression-analysis regression-models ridge-regression scikit-learn

Last synced: 02 Jan 2026

https://github.com/zenklinov/regression_logistic_-_sentiment_analysis

This project demonstrates sentiment analysis, model based on movie review data using Logistic Regression. The model predicts whether a review expresses positive or negative sentiment based on the text provided. The application on #PepGuardiola from Twitter (X).

natural-language-processing nlp nltk scikit-learn sentiment-analysis

Last synced: 03 May 2026

https://github.com/mhmudfzli/loan-approval-prediction

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

automl catboost numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/lorenzorottigni/ml-yelp-reviews

Machine Learning python bootcamp: natural language processing on Yelp reviews

machine-learning nlp numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/lren-chuv/sklearn_to_pfa

Convert Scikit Learn models to PFA

pfa-standard scikit-learn

Last synced: 21 May 2026

https://github.com/abhishekbagdiya01/movies-recommendation-system

This repository contains the code for a movie recommendation system built using Jupyter Notebook.

aiml jupyter-notebook numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/wlopezm-unal/titanic_ship-streamlit

Machine Learning model, where using titanic ship data and see if is be able to predict if a passager was salved or died. This apply use Machien learning (Random Forest, gassianNB and Logistic Regressión) . Further, using streamlit together to FastApi be able to see the predict result

docker machine-learning optuna scikit-learn streamlit supervised-machine-learning

Last synced: 25 Feb 2025

https://github.com/bjornmelin/ml-algorithm-playground

🧪 Core ML algorithm implementations with GPU acceleration. Featuring optimized implementations across various libraries with comprehensive analysis. 📈

algorithms cuda gpu-computing lightgbm machine-learning python scikit-learn xgboost

Last synced: 13 May 2026

https://github.com/itaxbox/loan-eligibility-prediction

Loan Eligibility Prediction using Decision Tree

decision-trees jupyter-notebook matplotlib pandas python scikit-learn

Last synced: 17 Jun 2025

https://github.com/akankshaaa013/practice-machine-learning-projects

A ML repository that contains several practice notebooks of different Machine Learning Algorithm such as Multiple Linear Regression, KNN Classification, SVM, etc.

classification machine-learning-algorithms regression-algorithms scikit-learn

Last synced: 21 May 2026

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 02 Jan 2026

https://github.com/douglaside/airlinedelay

[✍🏻Learn] Project aimed at analyzing flight delays, using Python algorithms and machine learning techniques to aid decision-making and identify patterns.

ai alura boxplot data-science graphics histogram machine-learning machine-learning-algorithms pandas python scikit-learn static

Last synced: 28 Jun 2025

https://github.com/achronus/data-exploration

A repository dedicated to interesting data exploration projects I've completed

data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 02 Jan 2026

https://github.com/miguellopezvirues/azure_keyword_cpc

Development and deployment of simple regression model in Azure Machine Learning.

azureml deplyment machine-learning mlflow pandas scikit-learn

Last synced: 09 May 2026

https://github.com/hawkharsh1/house-price-pridiction-model-using-ann

A deep learning-based regression model built using Artificial Neural Networks (ANN) in PyTorch to predict house prices from structured data. This project demonstrates the application of machine learning and deep learning techniques for solving real-world problems in the housing domain.

artificial-neural-networks deep-neural-networks machine-learning numpy pandas python3 pytorch scikit-learn

Last synced: 08 Apr 2026

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/chawthinn/car-price-prediction-regression-ml

This project predicts used car prices using regression models including Linear, Ridge, Random Forest, XGBoost, and LightGBM. It covers preprocessing, EDA, model evaluation, hyperparameter tuning, and model persistence using Scikit-learn and related libraries.

car-price-prediction lgbmregressor linearregression numpy pandas python scikit-learn xgbregressor

Last synced: 08 Apr 2026

https://github.com/giatraskon/machine_learning_assignments

Machine learning assignments covering regression, classification, neural networks, adversarial examples, and real-time emotion detection using Python. Includes theoretical insights and practical implementations.

adversarial-examples bayesian-inference bias-variance-tradeoff cifar10 classification deep-learning emotion-recognition iris-dataset k-nearest-neighbours keras machine-learning mnist neural-networks opencv pima-indians-diabetes python regression ridge-regression scikit-learn tensorflow

Last synced: 08 Apr 2026

https://github.com/roovedot/house-price-predictions-with-random-forest-regression-model

Applying Machine Learning Regression Model to Predict Housing Prices for Kaggle "Housing Prices Competition for Kaggle Learn Users".

machine-learning pandas random-forest-regression scikit-learn

Last synced: 03 May 2026

https://github.com/haydencordeiro/terafeed

Terafeed - Addressing Zero Hunger in Africa (Sustainability Goal SDG 2)

javscript numpy pandas powerbi python scikit-learn tableau vuejs

Last synced: 08 Apr 2026

https://github.com/ghulammuhayyudin1003/Email-Spam-Detector

ML-powered email spam detector with TF-IDF, Random Forest & OCR for image-based spam. Built on published research (EJASET 2025). Live on Streamlit Cloud.

email-classification email-spam-detection machine-learning nlp ocr scikit-learn streamlit streamlit-webapp tesseract tfidf

Last synced: 22 May 2026

https://github.com/analitico-771/machine_learning_trading_bot

This is an Application that implements an algorithmic trading strategy that uses machine learning to automate the trade decisions

financial-analysis hvplot logistic-regression machine-learning moving-average pandas-dataframe predictive-modeling python scikit-learn stock-price-prediction support-vector-machine

Last synced: 08 Apr 2026

https://github.com/thekartikeyamishra/predictive-sales-analytics

The Predictive Sales Analytics tool aims to help MSMEs forecast future sales using historical data. This advanced version leverages Machine Learning for accurate predictions and provides a dashboard to visualize sales trends, seasonality, and predictions.

joblib machine-learning matplotlib pandas python scikit-learn streamlit

Last synced: 08 Apr 2026

https://github.com/rakibhhridoy/visualmachinelearning-yellowbrick

Yellowbrick wraps the scikit-learn and matplotlib to create publication-ready figures and interactive data explorations. It is a diagnostic visualization platform for machine learning that allows us to steer the model selection process by helping to evaluate the performance, stability, and predictive value of our models and further assist in diagnosing the problems in our workflow.

classification hyperparameter-tuning machine-learning model-evaluation model-view-presenter model-visualization python random-forest random-forest-classifier scikit-learn visualization xgboost xgboost-algorithm yellowbrick

Last synced: 03 May 2026

https://github.com/rakibhhridoy/breastcanceranalysis-classificationclustering

Breast cancer prediction both in classification and clustering method for better understanding the data. Though clustering is different from classification,to finding the key aspect the data have,sometimes we need every possible way to catch behavior of the data.

breast-cancer-prediction breastcancer-classification classification classification-algorithm clustering eda hyperparameter-optimization machine-learning python scikit-learn supervised-learning unsupervised-learning

Last synced: 14 Apr 2026

https://github.com/paragon279/cancercellclassificationbyscikitlearn

Classifies cancer cell using Scikit-learn

machine-learning python scikit-learn

Last synced: 22 May 2026

https://github.com/aysh2603/twitter-sentiment-analysis

The Twitter Sentiment Analysis project employs Natural Language Processing (NLP) techniques to classify tweets into positive or negative sentiments. By analyzing the tone of tweets, this project provides insights into public sentiment on various topics.

hyperparameter-tuning nlp-machine-learning numpy pandas python3 scikit-learn

Last synced: 13 Jul 2025

https://github.com/leosolar8/mental-health-tech-ai-survey

Mental Health in Tech Survey Analysis — Applied K-means clustering, PCA, and Chi-square tests on tech industry survey data to uncover patterns between remote work practices and mental health consequences, with visualizations of key trends.

clustering data-science kmeans machine-learning mental-health pca python-project scikit-learn seaborn survey-analysis tech-industry visualization

Last synced: 08 Apr 2026

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026