An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/impesud/ai-finops-platform

AI FinOps is an AI-powered platform for cloud cost optimization and forecasting. Built with FastAPI, Python, and modern MLOps tools, it allows teams to track multi-cloud usage, detect anomalies, and predict future expenses using real-time data and machine learning.

aws docker fastapi jupyter mlflow python react scikit-learn statsmodels tailwindcss terraform xgboost

Last synced: 09 Apr 2026

https://github.com/parag000/content-based-movie-recommender

This project builds a content-based movie recommendation system using the TMDB dataset. By combining metadata features like cast, genres, and directors into a "metadata soup," it calculates movie similarity with vectorizers (Count) and cosine similarity. Ideal for learning content-based filtering and text vectorization techniques.

cosine-similarity countvectorizer recommendation-system scikit-learn tfidf-vectorizer vectorization

Last synced: 18 Apr 2026

https://github.com/svetlanam/pycon-workshop

Pycon CZ workshop: Better data analyses and product recommendations with Instagram data

data-analysis data-science martinus matplotlib pandas pycon2016 pyconcz python scikit-learn workshop

Last synced: 09 Apr 2026

https://github.com/ayan6943/employee-attrition-prediction-with-machine-learning

Employee Attrition Prediction with Machine Learning | Analyzing HR data to predict employee turnover using Random Forest. Includes EDA, feature engineering, model training, and evaluation. Achieved 90% accuracy.

attrition employee machine-learning matplotlib numpy pandas python randomforestclassifier scikit-learn seaborn smote

Last synced: 09 Apr 2026

https://github.com/al-shafi-github/deephatedetect-explainable-bengali-abusive-comments-classification-using-transformers-and-llm

This Project aims to train different models that can detect Bengali hate speech on different social media platforms and do a comparative analysis of the models

bangla-nlp nlp nlp-machine-learning python3 regex scikit-learn scikitlearn-machine-learning tabular-data

Last synced: 01 May 2026

https://github.com/jalijuhola/amazon-textual-reviews-recommender-

predicting score and recommending using amazon textual reviews

numpy pandas python scikit-learn typescript

Last synced: 09 Apr 2026

https://github.com/chengetanaim/customerpersonalityanalysis

Customer Personality Analysis involves a thorough examination of a company's optimal customer profiles. This analysis facilitates a deeper understanding of customers, enabling businesses to tailor products to meet the distinct needs, behaviors, and concerns of various customer types

kmeans-clustering pandas scikit-learn

Last synced: 21 Apr 2026

https://github.com/dragonscypher/feastfinderai

Discover the best dining spots with FeastFinderAI!

folium pandas python scikit-learn sql

Last synced: 09 Apr 2026

https://github.com/ifigeneiatsiflidou/applied-statistics-project

Project for an Applied Statistics course, involving exploratory data analysis and predictive modeling of movie revenue using engineered features and multiple linear regression.

correlation-analysis data-analysis linear-regression python scikit-learn visualization

Last synced: 29 Apr 2026

https://github.com/ravi0529/e-commerce-annual-spend-model

A basic Linear Regression model for predicting annual customer's spending

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/bkaracali/crime-data-analysis

Repository for Final Project

machine-learning python scikit-learn

Last synced: 21 Apr 2026

https://github.com/sk-g/mnist_beginners

Model search in traditional machine learning algorithms (non DL) and DL starter codes on MNIST dataset. This is a good starter code for beginners trying to learn about curse of dimensionality, overfitting and other concepts in general

keras machine-learning machine-learning-algorithms mnist mnist-beginners mnist-classification mnist-dataset numpy overfitting python pytorch pytorch-implmention resnet resnet-50 scikit-learn scikitlearn-machine-learning sklearn tensorflow

Last synced: 09 Apr 2026

https://github.com/nazmul-1117/100-days-of-machine-learning

I'm Nazmul so exited to start a new journey to learn 100 Days of Machine Learning. It's February 8, 2025. I'm so exited, let's see what happened insha'Allah

data-science machine-learning numpy pandas-dataframe python3 scikit-learn statistics

Last synced: 11 Aug 2025

https://github.com/hariprasath-v/hackerearth-amazon-business-research-analyst-hiring-challenge

Build a machine learning model that can calculate the time the delivery person takes to deliver the order.

exploratory-data-analysis hackerearth machine-learning pandas pycaret python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/abdellatif-laghjaj/salary-scope-predictor

SalaryScope: Job Salary Predictor is a machine learning solution designed to estimate salaries from job listings. It employs a full ML pipeline from exploratory data analysis, data cleaning, and NLP on job descriptions to regression model training (Linear Regression, Random Forest, etc.) and hyperparameter tuning

data-science developer-survey feature-engineering machine-learning predictive-modeling regression salary-calculator salary-prediction scikit-learn streamlit

Last synced: 08 May 2026

https://github.com/bhuvan-s-prasad/-alzheimer-diagnosis

This project predicts Alzheimer’s disease using machine learning with basic MLOps integration for better organization and reproducibility. It includes data processing, model training, evaluation, and deployment, incorporating version control, automation, and experiment tracking as a first step into MLOps.

alzheimers-disease classification eda explainable-ai exploratory-data-analysis machine-learning mlops pandas python random-forest random-forest-classifier regression scikit-learn supervised-learning

Last synced: 09 Apr 2026

https://github.com/ezeparziale/tweet-clasification

:bird: Tweet sentiment analysis

bootstrap flask nltk python scikit-learn

Last synced: 09 Apr 2026

https://github.com/prakashjha1/customer-segmentation

This repository contains a customer segmentation project implemented in a Jupyter Notebook using Python. Customer segmentation is a crucial strategy for businesses aiming to understand their customer base better, enabling targeted marketing strategies and personalized customer experiences.

clustering-algorithm customer-segmentation kmeans-clustering matplotlib python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/eusha425/housing-market-analysis

Implementation of supervised learning algorithms for real estate price prediction, featuring Ridge Regression optimization, IQR-based outlier detection, and extensive feature engineering. Includes detailed visualizations, statistical analysis, and model performance comparisons using various evaluation metrics.

data-preprocessing data-science exploratory-data-analysis house-price-prediction machine-learning python scikit-learn supervised-learning

Last synced: 09 Apr 2026

https://github.com/amandeep-gupta19/chatbot

Created a custom chatbot using Langchain. Here's a summary of what I did: Data Extraction: I gathered data about technical courses from the Brainlox website using Langchain’s URL loaders. Embedding Creation & Storage: I converted this data into embeddings and stored it in a vector store for efficient searching. API Development: I built a Flask

data-extraction faiss-vector-database flask-restful langchain numpy scikit-learn vector-database webbaseloader

Last synced: 09 Apr 2026

https://github.com/moritzkoerber/text_analysis_app

A web app that classifies the content of messages that are usually sent during disasters such as earthquakes.

flask machine-learning nltk python scikit-learn

Last synced: 09 Apr 2026

https://github.com/nicolascoiado/mulheres-ti

Este repositório contém um código em Python para analisar a evolução do número de mulheres na área de Tecnologia da Informação (TI) ao longo dos anos. Utilizando pandas para manipulação de dados e scikit-learn para criar um modelo de regressão linear, o objetivo é prever quantas mulheres estarão na TI em 2024 com base em dados históricos.

linear-regression matplotlib pandas python python3 scikit-learn

Last synced: 09 Apr 2026

https://github.com/alphacrypto246/old-car-price-prediction

The Old Car Price Prediction project predicts used car prices using features like age, mileage, and fuel type. It includes data preprocessing, model training, and visualization of trends, with easy customization for additional features or models.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 09 Apr 2026

https://github.com/tasninanika/australian-credit-approval-analysis-svm

This project uses a Support Vector Machine (SVM) Classifier to predict whether a credit application is approved (1) or denied (0) based on applicant features.

numpy pandas python3 scikit-learn svm-classifier

Last synced: 10 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_03

Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.

latex linear-regression matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/abdullahashfaqvirk/SMS-Spam-Detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 16 Aug 2025

https://github.com/djdhairya/football-match-prediction

In this project, we'll predict the winner of football matches in the English Premier League (EPL).

jupyter-notebook machine-learning pandas python3 requests scikit-learn vscode

Last synced: 09 Apr 2026

https://github.com/abidhasanrafi/bioadaptive-eyeml-diagnosis

A state-of-the-art ocular diagnosis tool leveraging biomimetic machine learning to analyze eye movement patterns and predict ocular conditions with clinical-grade accuracy.

eye-tracking ocular-disease-recognition scikit-learn streamlit

Last synced: 16 Aug 2025

https://github.com/martingit2/aiportal-ml-service

ML-mikrotjeneste for Aracanix. En Python/Flask-app som trener og serverer XGBoost-modeller for prediktiv analyse. Se README for lenker til frontend og backend.

flask fullstack machine-learning microservices pandas python scikit-learn xgboost

Last synced: 09 Apr 2026

https://github.com/1587causalai/causal-sklearn

Scikit-learn Compatible Causal Machine Learning Library - Based on CausalEngine™

cauchy-distribution causal-inference causal-machine-learning machine-learning python pytorch scikit-learn

Last synced: 17 Aug 2025

https://github.com/bruno-moura24/hand-gesture-ai

Projeto em Python que utiliza OpenCV, MediaPipe e scikit-learn para detectar gestos de mão via webcam e classificá-los como números de 0 a 5 em tempo real.

computer-vision hand-gesture-recognition machine-learning mediapipe opencv python real-time-ai scikit-learn

Last synced: 28 Apr 2026

https://github.com/balajig-24/titanic_data_analysics-

Project Title: Titanic Survival Prediction Project Overview The Titanic Survival Prediction project is a classic machine learning problem that aims to predict whether a passenger survived the Titanic disaster based on various features such as age, gender, passenger class, and more. This project demonstrates my ability to clean, analyze, and model.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/kylehperez/mushroomnet

This API is for research purposes ONLY and is NOT to be used for food-safety or medical advice. Mushroomnet is a machine learning neural network for classifying mushrooms as poisonous or edible. The model was trained on data from uni.edu: over 5,000 instances of mushrooms, with 22 qualitative features and a determination of poisonous or edible.

artificial-intelligence botany flask-api machine-learning mycology neural-network numpy python3 pytorch scikit-learn

Last synced: 09 Apr 2026

https://github.com/elam-parithi/singapore_flatprice_predicting

Flat price prediction with Machine learning tools and python.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/ledsouza/machine-learning-client-satisfaction

Este projeto tem como objetivo construir um modelo de Machine Learning capaz de prever a satisfação de clientes de uma companhia aérea.

cross-validation hyperparameter-optimization machine-learning machine-learning-algorithms mlxtend one-hot-encoding pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/aravind-selvam/student_exam_performance_predictor

Sample Machine learning project, The project uses Sklearn’s regression techniques such as XGboost and Random forests to train and test the model on student data. Deployed on Heroku with Flask application

flask-application heroku machine-learning python scikit-learn

Last synced: 10 Apr 2026

https://github.com/ankitsharma-tech/classification-of-arrhythmia-using-ecg-data

A machine learning project to detect and classify arrhythmias from ECG signals using Python, scikit-learn, and TensorFlow. Includes data preprocessing, model training, and evaluation.

arrhythmia biomedical-signal-processing cardiology classification deep-learning ecg ecg-classification healthcare machine-learning mit-bih-dataset numpy python scikit-learn scipy signal-processing tensorflow time-series-analysis

Last synced: 07 Apr 2026

https://github.com/oceanuz/house-sales-price-prediction

This project focuses on predicting house sale prices in King County, USA using regression-based machine learning models. The dataset is cleaned, explored, and analyzed to understand key factors affecting housing prices. Multiple regression techniques are applied and evaluated using R² score to compare model performance and interpret results.

data-science eda house-price-prediction machine-learning python real-estate regression scikit-learn

Last synced: 10 Apr 2026

https://github.com/programmergnome/ann-modeling

Artifical neurons and neural networks modeling repo. Only for learning!

chatgpt deep-learning machine-learning natural-language-processing python pytorch scikit-learn tensorflow

Last synced: 10 Apr 2026

https://github.com/amithjoseph777/gourmethaven-case-competition

Predictive analytics case competition project from the Master of Science in Business Analytics course at the College of Business, University of Louisville. Our team analyzed customer response trends for Gourmet Haven, developed a predictive model, and ranked in the Top 5 out of 25 teams.

google-colab jupyter-notebook numpy pandas r scikit-learn

Last synced: 10 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/davipythonweb/price_api

API de Previsão de Preço de casa com python/Machine-Learn

flask machine-learning pickle python python-dotenv scikit-learn venv

Last synced: 10 Apr 2026

https://github.com/rtmigo/skifts_py

Search for the most relevant documents containing words from a query. Uses Scikit-learn and Numpy

cosine-similarity information-retrieval numpy python scikit-learn text-mining tf-idf

Last synced: 19 Aug 2025

https://github.com/lorenzorottigni/ml-houses

Machine Learning python bootcamp: linear regression on houses model

ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/donmaruko/sentiment-analysis-api

Flask-based API for sentiment analysis using deep learning models and includes endpoints for text and file input, database storage, and integrated Swagger documentation.

api deep-learning deep-neural-networks flask keras lstm machine-learning neural-network rnn scikit-learn scikitlearn-machine-learning sklearn sqlite3 swagger swagger-ui tensorflow

Last synced: 10 Apr 2026

https://github.com/nickklos10/compressive-strenght-prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 10 Apr 2026

https://github.com/marknature/machine-learning-intern

Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset

data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn

Last synced: 10 Apr 2026

https://github.com/libra33a/retinopathy-ai

👁️ Detect Diabetic Retinopathy from retinal images using a CNN model in PyTorch, ensuring early intervention and reducing vision loss risks.

ai cnn-classification deep-learning healthcare-ai heidisql kaggle-competition machine-learning medical-imaging neural-network onnxruntime python pytorch resnet-152 retinal-fundus-images retinopathy scikit-learn tensorflow tkinter-gui

Last synced: 10 Apr 2026

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 10 Apr 2026

https://github.com/yasolg/ml-bootcamp

⚡ Master machine learning in 10 days with this interactive, open-source bootcamp. Learn Python basics to production ML at no cost.

chatgpt crewai data-science deep-learning jose-portilla langchain langgraph large-language-models llama matplotlib-pyplot mlops ollama python satellite-imagery scikit-learn tensorflow tensorflow2 udemy-course-project

Last synced: 15 Apr 2026

https://github.com/jeniljani-4444/end-to-end-car-price-prediction-model

Predict car prices effortlessly using this machine learning model. Built with Python and Scikit-learn it analyzes features like mileage age brand and more to estimate accurate prices. Perfect for buyers sellers and dealerships.

machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/audy21/data-exploratory-portfolio

An advanced visualizations from my recent practices.

matplotlib nltk pandas plotly scikit-learn seaborn tensorflow

Last synced: 27 Aug 2025

https://github.com/arrhythmia-detection/authorfeatureextracteddecisiontreeoptimizedesp32s3

Deploys an optimized Decision Tree for Arrhythmia classification using Chapman ECG dataset on ESP32-S3 dev kit

arrhythmia-classification decision-tree-classifier decision-trees eloquent esp32-arduino esp32-s3 scikit-learn

Last synced: 27 Aug 2025

https://github.com/rohitdusane/spam-classification-model

A Python-based machine learning model for spam detection leveraging TF-IDF vectorization and multiple classifiers, including Naive Bayes, Logistic Regression, and Random Forest. This project demonstrates preprocessing techniques, model training, and performance evaluation for classifying SMS messages as spam or ham.

data-science flask mlflow natural-language-processing scikit-learn spam-detection text-classification

Last synced: 19 Apr 2026

https://github.com/lakshitalearning/codsoft

Machine Learning Projects - CODSOFT Internship: This repository showcases my machine learning projects completed during my internship at Codsoft. It demonstrates my skills in developing innovative solutions using various ML techniques and tools.

churn-prediction codsoft codsoftinternship deep-learning handwritten-text-recognition internship-project keras machine-learning python rnn-tensorflow scikit-learn spam-detection

Last synced: 11 Feb 2026

https://github.com/davidyslu/PokemonRecognition

Recognize Pokemon's image using scikit-learn in Python

knn-model python scikit-learn svm-model

Last synced: 29 Aug 2025

https://github.com/murshidazher/recommendation-system

🎥 Building a recommendation system using python

python recommendation-engine scikit-learn suprise

Last synced: 08 May 2026

https://github.com/gana36/credit-card-fraud-detection

Production MLOps pipeline for fraud detection with automated testing, monitoring, and zero-downtime deployments

docker evidently fastapi fraud-detection grafana machine-learning mlflow mlops postgresql prometheus scikit-learn

Last synced: 10 Apr 2026

https://github.com/mohammedhaq/safestream

SafeStream is a machine learning project that utilizes machine learning to predict the potability of water. By analyzing various water quality parameters, SafeStream helps in determining whether a water source is safe for consumption. This project leverages Python, PyTorch, and scikit-learn.

logistic-regression machine-learning neural-network python pytorch scikit-learn

Last synced: 23 Jul 2025

https://github.com/csakig/bike-sharing-demand-analytics

Advanced analytics on Bike Sharing data using Random Forest, Gradient Boosting, and SARIMAX forecasting.

data-science machine-learning portfolio python scikit-learn time-series

Last synced: 10 Apr 2026

https://github.com/thiagohrcosta/machinelearning-temperature

A Small Machine Learning application leveraging Scikit-Learn and statistical learning to extract knowledge from data without explicit programming.

machine-learning numpy pandas python3 scikit-learn

Last synced: 08 Apr 2026

https://github.com/thatguychandan/adoptimization

This project implements an ad optimization system using a hybrid approach combining Thompson Sampling and Upper Confidence Bound (UCB) algorithms. The system learns to select the most effective ads based on user context and historical performance.

numpy pandas plotly python pytorch reinforcement-learning scikit-learn streamlit thompson-sampling upper-confidence-bound

Last synced: 10 Apr 2026

https://github.com/thammami01/simple-recruitment-ml

Simple recruitment app that allows job posting/application, and viewing regression/classification figures based on entries.

flask matplot-lib mongodb python scikit-learn

Last synced: 12 Apr 2026

https://github.com/itsmandrew/diabetes-cs178

Final project for CS178, predicting whether and when will patient with diabetes be readmitted in hospital after the treatment.

knn logistic-regression neural-network python scikit-learn

Last synced: 13 Apr 2026

https://github.com/solanovisitor/moodpredictor

An application that uses Machine Learning to predict one's risk of having mood disorders (currently in Portuguese)

pandas python scikit-learn streamlit xgboost

Last synced: 09 Apr 2026

https://github.com/sobhan-m/comp472-project1

A program that builds various machine learning models on a dataset composed of Reddit posts, their emotions, and their sentiments.

ai jupyter-notebook machine-learning python scikit-learn

Last synced: 14 Apr 2025

https://github.com/s0fft/learning-lab

Code Notes & Test-Learn // Micro Pet-Projects: Python / Asynchrony / FastAPI / Django-Tastypie / DRF / Parsing / Telegram-Bot / SQL / Docker / DS / ML / etc.

asynchrony data-science django-rest-framework docker fastapi jupyter-lab jupyter-notebook mashine-learning matplotlib notes numpy pandas parsing python3 scikit-learn seaborn sql sqlalchemy tastypie telegram-bot

Last synced: 10 Apr 2026

https://github.com/shreeparab1890/duplicate-question-predictor

The ipython notebook is working to build a model which will detect duplicate questions if two questions pair are given.

bag-of-words nlp nlp-machine-learning nltk numpy pandas python random-forest scikit-learn sklearn streamlit

Last synced: 10 Apr 2026