An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/aryansingla45/cab-fare-prediction

This repository contains a Cab Fare Prediction Web App that uses machine learning to predict fares based on distance, time, and location, with Google Maps API integration for route and fare calculations. It features a user-friendly interface and secure authentication, powered by a Kaggle dataset.

flask-application google-maps-api html-css-javascript machine-learning machine-learning-algorithms pandas-numpy python scikit-learn weather-api

Last synced: 11 May 2026

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/victorkiosh/fake-news-detection

Detecting fake news using NLP and machine learning (Logistic Regression, Random Forest, XGBoost)

data-science fake-news-detection machine-learning nlp scikit-learn xgboost

Last synced: 18 May 2026

https://github.com/alam025/ai-email-guardian

🛡️ AI-Powered Email Guardian: 99.2% accurate spam detection using machine learning. Open-source, privacy-focused email security. ⚡ 50ms detection time.

artificial-intelligence email-filter email-security hishing-detection machine-learning-cybersecurity nlp open-source privacy python scikit-learn security-tools spam-detection tensorflow text-classification

Last synced: 10 Mar 2026

https://github.com/md-emon-hasan/ml-project-car-price-prediction

🚗 End-to-end ML project for predicting car prices based on various features. Includes data preprocessing, model training, and a Flask web for predictions.

car-price-prediction car-price-predictor data-science feature-engineering ml predictive-modeling scikit-learn

Last synced: 10 Mar 2026

https://github.com/lasithaamarasinghe/stock-market-price-prediction

This ML model predicts the price of the S&P500 Stock Market Index using RandomForestClassifier

jupyter-notebook machine-learning pandas python random-forest-classifier scikit-learn sp500 stock-market-price-prediction yfinance

Last synced: 10 Apr 2026

https://github.com/samarpan-rai/serveitlearn

It creates an extremely thin layer around FastAPI library which allows you to create an end point super fast.

fastapi inference ml pypi scikit-learn

Last synced: 12 Apr 2026

https://github.com/catlikeflyer/rsp-recognition

A computer vision project to recognize thumbs up

machine-learning mediapipe-hands python scikit-learn

Last synced: 16 May 2026

https://github.com/singhkunwardeep/twitter_sentiment_analysis

A machine learning project to classify Twitter sentiment into positive, negative, categories using Logistic Regression and TF-IDF Vectorization. This project involves data preprocessing, feature extraction, model training, and evaluation of the sentiment of tweets. Built with Python, NLTK, and Scikit-learn.

logistic-regression nltk-python pandas-dataframe python3 scikit-learn tfidf-vectorizer

Last synced: 05 May 2026

https://github.com/rahimizadeh/prediction-api-with-flask-and-mlflow

An end-to-end machine learning project demonstrating model lifecycle management with MLflow and production deployment using Flask.

flask machine-learning mlflow mlops-workflow python random-forest-regression rest-api scikit-learn

Last synced: 13 Apr 2026

https://github.com/chitralputhran/tutorial-sklearn-columntransformer

ColumnTransformer was introduced in scikit-learn from version 0.20 onwards. The notebook file contains a quick and easy tutorial on ColumnTransformer to get you started.

scikit-learn

Last synced: 17 May 2026

https://github.com/evanmarshall-dev/evanmarshall-tech

Professional IT services platform featuring serverless AWS infrastructure, ML-powered service recommendations, and automated CI/CD deployment. Built to showcase full-stack development, cloud architecture, and machine learning engineering skills.

api-gateway aws ci-cd cloud-computing cloudfront devops full-stack github-actions infrastructure-as-code lambda machine-learning mlops nextjs portfolio python react s3 scikit-learn serverless terraform

Last synced: 13 Apr 2026

https://github.com/karimosman89/legal-document-nlp

Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.

nltk python scikit-learn spacy

Last synced: 11 Apr 2026

https://github.com/lfenzo/ml-solar-sao-paulo

Implementation of scientific project regarding the use of Machine Learning in Solar Radiation Prediction

forecasting machine-learning python scikit-learn

Last synced: 11 Apr 2026

https://github.com/tasninanika/coded_data_prediction-knn

K-Nearest Neighbors (KNN) is a supervised machine learning algorithm

knn pandas python3 scikit-learn

Last synced: 07 Apr 2026

https://github.com/williyam-m/movie-recommendation-system

Developed a web app with a cosine similarity machine learning model for personalized recommendations based on user history, likes, bookmarks, and activity. Implemented user auth and CRUD operations for movies.

django machine-learning numpy pandas prediction-model python scikit-learn

Last synced: 10 Apr 2026

https://github.com/aarryasutar/hate_speech_detection

This project aims to detect hate speech on Twitter using advanced NLP and machine learning techniques, exploring feature extraction methods like TF-IDF and sentiment analysis, and evaluating models such as Logistic Regression and SVM.

confusion-matrix doc2vec gensim logistic-regression matplotlib naive-bayes nltk numpy pandas python random-forest scikit-learn seaborn stemming stopwords-removal svm tf-idf-vectorizer tokenization vader word-cloud

Last synced: 09 Apr 2026

https://github.com/srilaasya/handwriting-recognition-using-k-means

Used K-means clustering and scikit-learn to cluster images of handwritten digits.

handwriting-recognition k-means python scikit-learn

Last synced: 13 Apr 2026

https://github.com/aarryasutar/logistic_regression_on_age_prediction

This code evaluates the performance of a logistic regression model on age prediction using various features to predict a binary target variable, calculating metrics to determine the performance. It evaluates the comparison, identifies favorable features, and visualizes the ROC-AUC curve to determine the best model performance.

accuracy-score confusion-matrix f1-score feature-selection logistic-regression model-training numpy pandas precision recall rmse roc-auc-curve scikit-learn visualization

Last synced: 20 Jan 2026

https://github.com/francescopaolol/titaniccompetition

It's my first kaggle competition about predict survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle-competition machine-learning ml pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/amirjahantab/iris_classification

This project analyzes the famous Iris dataset using various machine learning techniques. The goal is to classify the iris flowers into three species: Setosa, Versicolor, and Virginica based on the features provided in the dataset.

classification data-science machine-learning scikit-learn

Last synced: 16 May 2026

https://github.com/ksatrajit0/heart-disease-prediction-ml

Predicts the risk of heart attack in a patient using their medical record

heart-disease-prediction machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 19 Apr 2026

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 13 Apr 2026

https://github.com/lasithaamarasinghe/movie-recommender-system

This ML model recommends movies that may align with the user's preferences based on TF-IDF matrix.

jupyter-notebook machine-learning movie-recommendation movielens-dataset numpy pandas python regex scikit-learn tf-idf-vectorizer

Last synced: 12 Apr 2026

https://github.com/bhimrazy/iris-species-prediction-using-decision-tree-algorithm-grip

Iris Species Intelligence: Classifying Iris Species with Confidence using Decision Trees | The Sparks Foundation: GRIP

decision-tree-classifier fastapi gripjan23 machine-learning python scikit-learn sparkfoundation

Last synced: 10 Apr 2026

https://github.com/viveksapkal2793/advertisement-response-analysis

This project analyzes advertisement responses using a Django backend and a Vite+React frontend. It includes scripts to load, clean, and transform data, which are executed within Docker containers. Data is stored in a MongoDB database, and the project can be run with or without Docker by adjusting the MongoDB connection strings.

advertisement advertisement-analysis container-image containerization django docker machine-learning mongodb react scikit-learn vite

Last synced: 23 Sep 2025

https://github.com/samarthmule/chatbot

This project implements a generic chatbot using Natural Language Processing (NLP) and Machine Learning techniques. The chatbot is designed to classify user input into predefined intents and provide context-aware responses. The solution is scalable, interactive, and suitable for various domains.

chatbot internship machine-learning machine-learning-algorithms nlp nltk project-repository python python3 scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/sizzlins/kalkulator-ai

A Simple Command Line Input Symbolic Regression Engine and Computer Algebra System (CAS) capable of discovering the laws of the universe, solving calculus, algebra, and trigonometrics.

calculator calculus cli computer-algebra-system curve-fitting machine-learning mathematics numpy physics python scientific-computing scikit-learn sparse-regression symbolic-regression sympy

Last synced: 13 Jan 2026

https://github.com/vicperal/ai-genai_projects

Python projects about LLM and ML use cases. I am using modules such as Pandas, Numpy, Plotly, scikit-learn, Transformers, Flask, JSON, etc. to analyze data, predict, generate insights and create text from models such as LLMs, linear regression, assembly methods, etc. Server- Front-End using Flask

assembly clinical-trials flask json linear-regression llm ml numpy pandas plotly price-prediction python rag random-forest scikit-learn sentimental-analysis sql text-summarization tokens-counter transformers

Last synced: 02 Apr 2026

https://github.com/bahar15984/obesity-classification

Machine Learning Pipeline for Obesity Classification using Azure ML & Python

azure azure-ml classification data-science healthcare machine-learning mlops obesity pandas pipeline python scikit-learn

Last synced: 03 Nov 2025

https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023

This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.

data-science machine-learning ml numpy pandas python3 scikit-learn

Last synced: 28 Oct 2025

https://github.com/tasninanika/callifornia-housing-price-prediction-svr

Support Vector Regression (SVR) is a type of Support Vector Machine used for predicting continuous values.

matplotlib numpy pandas python3 scikit-learn seaborn svm-regression

Last synced: 11 Apr 2026

https://github.com/luceldasilva/covid_19_italia

Una entidad gubernamental responsable de la gestión de la salud en Italia enfrenta el desafío de comprender y analizar la propagación del COVID-19 para tomar decisiones informadas y eficaces en la gestión de la pandemia. Como científico de datos nuestra tarea es presentar insights que responden a las inquietudes de la entidad

covid-19 deepnote google-colab jupyterlab pearson-correlation python random-forest scikit-learn

Last synced: 31 Jan 2026

https://github.com/soroush-04/incrementalsvm-road-accident-prediction

Enhance SVM and incremental SVM machine learning models for road accident severity prediction

incremental-learning machine-learning python scikit-learn svm

Last synced: 09 Apr 2026

https://github.com/bestmahdi2/uni__decisiontreesinpythonpredictingdiabetes

A university project designed in Python decision tree for diabetes diagnosis

decision-tree diabetes-prediction graphviz pandas prediction pydotplus python scikit-learn

Last synced: 11 Apr 2026

https://github.com/virajbhutada/article-recommendation-system

This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.

anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy

Last synced: 06 Jan 2026

https://github.com/rixiiz/knn-model-to-predict-the-obp-of-mlb-players

A machine learning model I built that can predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season

artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning

Last synced: 01 Jul 2025

https://github.com/rizz1406/spam-email-detector

Spam Email Classifier using Python and Streamlit A simple machine learning project that classifies emails as **spam** or **ham** using the **Naive Bayes algorithm** and **TF-IDF** for text feature extraction. The project includes a user-friendly web app built with Streamlit

nlp pandas pytho3 scikit-learn streamlit

Last synced: 09 Apr 2026

https://github.com/alessandrosocc/machine-learning-project-2022

Final project for the Machine Learning course at the University of Cagliari in 2022. Analysis of a dataset, use of Machine Learning techniques with Oversampling and Undersampling techniques. Final report with the results obtained.

imblearn machine-learning matplotlib-pyplot oversampling pandas scikit-learn spambase-dataset undersampling

Last synced: 18 Jan 2026

https://github.com/paulj1989/bulgarian-constitutional-court-decisions

Developing NLP models for text and sentence classification using legal texts from the Bulgarian constitutional court.

keras neural-network nlp scikit-learn tensorflow tesseract

Last synced: 04 May 2026

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: 10 Apr 2026

https://github.com/nekruzash/regression-correlation

This is from CS2023 - AI/DS/ML class, trained a model based on different categories of data and predicted using a linear regression for the best feature that has the greatest effect on the housing prices.

jupyter-notebook python scikit-learn

Last synced: 04 May 2026

https://github.com/grachale/predict_titanik

Predicting the survival of Titanic passengers (binary classification) with usage of decision tree and KNN from scikit-learn.

classification decision-tree-classifier knn-classifier matplotlib pandas python scikit-learn titanic-survival-prediction

Last synced: 12 Apr 2026

https://github.com/mhmudfzli/exploring-mental-health-data

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

catboost lightgbm matplotlib numpy pandas scikit-learn seaborn xgboost

Last synced: 09 Apr 2026

https://github.com/lordmitrii/win-prediction-django

A web application on Django framework. It predicts a winning team based on given sets of dota2 heroes.

django dota2 jupyter-notebook machine-learning python scikit-learn web

Last synced: 13 Apr 2026

https://github.com/filiplangiewicz/automltunability

📈 Analyzing the impact of hyperparameter optimization

automl machine-learning scikit-learn

Last synced: 18 Feb 2026

https://github.com/gokularaman-c/ev-charging-log-anomaly-detection

EV charging log anomaly detection using Isolation Forest, engineered telemetry features, and a CLI inference pipeline.

anomaly-detection ev-charging feature-engineering isolation-forest machine-learning mlops python scikit-learn time-series

Last synced: 23 May 2026

https://github.com/shanmukhsrisaivedullapalli/automatic-ticket-classification

This project processes customer complaint data using pandas for data manipulation and applies text preprocessing techniques, including lemmatization, to clean and normalize complaint text. The `tqdm` library provides progress bars for efficient tracking of text processing tasks.

matplotlib neural-networks nlp numpy pandas python3 scikit-learn seaborn tensorflow tqdm wordcloud

Last synced: 11 Apr 2026

https://github.com/jersongb22/datascience_mlops_movierecommendations_project

Simulating a Data Scientist's role in a startup aggregating streaming platforms. Building movie queries and ML-based recommendation system with MLOps focus. ML model web app deployed with Render.

data-science fastapi machine-learning matplotlib pandas python render scikit-learn stopwords

Last synced: 10 Apr 2026

https://github.com/ayberkyavuz/ml_model_server_docker_deployment

This repository is for containing source codes of machine learning model server deployment.

deployment docker flask machine-learning model python random-forest scikit-learn

Last synced: 08 Apr 2026

https://github.com/medyessinkhlif/medclaimml

An AI-powered machine learning application designed to process healthcare reimbursement claims. It analyzes medical documents, client information, insurance policies, and legal regulations to predict accurate reimbursement amounts, ensuring efficiency, compliance, and fraud detection.

healthcare jest-tests mern-stack mongodb nodejs nosql numpy pytorch react scikit-learn tailwindcss

Last synced: 13 May 2025

https://github.com/jesly-joji/house-price-prediction

House Price Prediction using Linear Regression with Scikit-learn and Flask

flask regression scikit-learn

Last synced: 03 Jan 2026

https://github.com/salmandeveloperz/ml_house_prediction

project for house price prediction using Classification & Regression models. Includes Docker setup for easy deployment.

classification-model clustering deep-learning machine-learning matplotlib numpy pandas python3 regression-models scikit-learn

Last synced: 10 Apr 2026

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 09 Apr 2026

https://github.com/its-maneeshk/fake-product-detection-system

The Fake Product Review Detection System is a machine learning-powered web application designed to analyze and detect fake reviews on eCommerce platforms. It helps users identify whether a product has genuine or manipulated reviews by leveraging Natural Language Processing (NLP) and supervised learning models.

api beautifulsoup4 fetch-api flask html-css-javascript joblib nlp-machine-learning numpy pandas python reactjs requests scikit-learn

Last synced: 05 Mar 2025

https://github.com/imswappy/brain-tumor-detection

🧠 Deep learning project for brain tumor classification using MRI images. Built with transfer learning (VGG16 + fine-tuning), TensorFlow/Keras, and deployed via Streamlit. Dataset & model loaded dynamically from KaggleHub. Includes training notebook, evaluation, and interactive web app.

kagglehub keras numpy pandas scikit-learn streamlit tensorflow vgg16-model

Last synced: 13 Apr 2026

https://github.com/ksasi/boston_housing

Predicting Boston Housing Prices - Udacity

machine-learning numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/hopquangdo/minisklearn

✨ A lightweight Python-from-scratch implementation of core Machine Learning algorithms inspired by scikit-learn.

machine-learning scikit-learn

Last synced: 09 Jul 2025

https://github.com/3rd-son/knn-breast-cancer-prediction-model-

A breast cancer prediction model using KNN with an accuracy of 96%

jupyter-notebook knn-classifier matplotlib numpy pandas python scikit-learn scipy search

Last synced: 11 Apr 2026

https://github.com/takkii/rock_ptarmigan

[PyPi] Approximate value evaluation 📊

approximate keras pca rock-ptarmigan scikit-learn tensorflow

Last synced: 10 May 2026

https://github.com/hvignolo87/marketing-campaign-classification

Real case of classification with machine learning. Analysis of real data from telemarketing campaigns of a Portuguese bank.

binary-classification data-science pandas python scikit-learn xgbclassifier xgboost

Last synced: 12 Apr 2026

https://github.com/theanujsinha01/rainfall-prediction-using-machine-learning

This project predicts whether it will rain or not based on weather features like pressure, humidity, dew point, cloud cover, sunshine, wind direction, and wind speed. We use a Random Forest Classifier, a popular ML algorithm, trained on historical weather data. The model learns patterns and helps us forecast rain chances.

classification data-analysis eda machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn supervised-learning

Last synced: 11 Apr 2026

https://github.com/shreeyas-48/creditcardfrauddetection

Project for detecting credit card frauds using neural networks and logistic regression

autoencoder keras logistic-regression matplotlib neural-networks numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/kostasereksonas/ids_test

Code for intrusion detection system based on "Intrusion Detection System Using Machine Learning Algorithms" tutorial on Geeksforgeeks and Intrusion Detection on NSL KDD Github repository.

ids intrusion-detection intrusion-detection-system nsl-kdd-dataset numpy pandas python scikit-learn tensorflow

Last synced: 08 Apr 2026

https://github.com/mgckaled/ignite-devia-supervised_algorithms

Repositório que reuni os módulos 7 ao 13 da Formação Desenvolvimento IA 2023-2024, desenvolvido pela Rocketseat Education.

gradio joblib pandas python scikit-learn statsmodels uvicorn

Last synced: 12 Apr 2026

https://github.com/machinelearningprodigy/covid-19-detection-system

It is a Flask-based web application that predicts the likelihood of COVID-19 infection based on user symptoms. The app utilizes a K-Nearest Neighbors (KNN) model trained on relevant medical features to assess COVID-19 risk.

algorithms feature-engineering flask machine-learning numpy pandas scikit-learn

Last synced: 20 Apr 2026

https://github.com/tasninanika/heart-disease-analysis

The Heart Disease Analysis project is a comprehensive machine learning study aimed at predicting the presence of heart disease using the Heart Disease UCI Dataset.

knn logistic-regression matplotlib numpy pandas python3 random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 20 Jan 2026

https://github.com/hayatoy/gcpml-notebook

Dockerfile with Jupyter Machine Learning environment plus Google Cloud SDK

dockerfile google-cloud-platform jupyter scikit-learn tensorflow

Last synced: 12 Apr 2026