An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/vijaykumarr1452/ipl-first-innings-score-prediction

Developed a machine learning model to predict probable IPL match score ranges using advanced data analytics and historical cricket performance data.

adaboost ann decision-trees deployment machine-learning ml-regression pandas python random-forest scikit-learn

Last synced: 12 Apr 2026

https://github.com/stefagnone/-air-france-sponsored-search-campaign-optimization

Optimized Air France's online advertising campaigns using advanced regression modeling and data analysis, uncovering actionable insights to maximize ROI and enhance ad performance.

advertising-analytics air-france data-science digital-marketing marketing-campaigns predictive-modeling python r regression-analysis roi-optimization scikit-learn sponsored-search

Last synced: 12 Apr 2026

https://github.com/musty-ess/shopping-prediction-ai-using-machine-learning

This project is an AI model designed to predict whether customers on an online shopping website will complete a purchase. It uses machine learning, specifically a k-nearest neighbors (KNN) classifier, to analyze user behavior and predict their purchasing intent.

ai-prediction classification-model k-nearest-neighbors knn-classifier machine-learning online-shopping python scikit-learn sensitivity-specificity user-behavior-prediction

Last synced: 12 Apr 2026

https://github.com/velinovjovan/turbo-fpl

A modern Fantasy Premier League ⚽ analytics and prediction platform built with Next.js, Supabase, and machine learning — featuring live stats, price predictions, and AI-powered 🤖 point forecasts.

fpl fpl-analysis fpl-api nextjs14 scikit-learn scikitlearn-machine-learning tailwindcss

Last synced: 12 Apr 2026

https://github.com/id-andyyy/alfahack

📈💰 Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/enricobolzonello/ml_homeworks

Homeworks for the Machine Learning Course 2022/23 @ Unipd

linear-regression machine-learning neural-network scikit-learn svm

Last synced: 11 Jun 2025

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 12 Apr 2026

https://github.com/vishnu-vamshii/fraud-detection-using-machine-learning

Developed a machine learning pipeline to detect fraudulent credit card transactions, handling imbalanced data with SMOTE and scaling. Trained models like Logistic Regression and Random Forest. Conducted EDA to identify fraud patterns.

pandas python scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 12 Apr 2026

https://github.com/mark1708/neurointerfaces-of-information-systems

Laboratory work on the discipline "Neurointerfaces of information systems"

numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/nicolas-giacomelli/modelo-polinomial-api-streamlit

Codigo focado em treinamento de modelo de previsão de salario por tempo de empresa e nivel dentro da empresa disponibilizado por api e usando front-end streamlit para calcular basta inserir tempo de empresa e nivel e calcular o resultado de salario compativel

api fastapi joblib machine-learning matplotlib numpy pandas pingouin pydantic scikit-learn seaborn streamlit uvicorn

Last synced: 12 Apr 2026

https://github.com/jaspreetsingh-exe/sign-language-recognition-system

Sign Language Recognition System is an AI-powered application that enables real-time sign language recognition using MediaPipe and an MLP model. It captures hand gestures, extracts landmark features, and predicts sign language letters dynamically. The project also explores MobileNetV2 and aims to expand into Text-to-Sign Language generation.

deep-learning machine-learning mediapipe mobilenetv2 neural-networks scikit-learn sign signlanguage signlanguagedetection signlanguagerecognition tensorflow

Last synced: 01 May 2026

https://github.com/aishwaryagm1999/insurance-workflow-management

This project is an Insurance Workflow Management System designed to streamline policy management, claims processing, and fraud detection. It includes user account management, customer feedback analysis via NLP, alert notifications through SMS, and a fraud detection model, providing a secure, efficient solution for insurance operations.

css fraud-detection html json labelimg machine-learning natural-language-processing nlp opencv python qr-code-generator random-forest-classifier scikit-learn sms-notification tensorflow textblob twilio user-interface

Last synced: 26 Dec 2025

https://github.com/urvee1810/eda-time-series

A comprehensive time series analysis of French retail quarterly sales data from 2012 to 2017. The project focuses on analyzing sales patterns, seasonal decomposition, and trend analysis using various statistical techniques and visualizations.

arima-modeling data-visualization exploratory-data-analysis matplotlib numpy pandas pmdarima python scikit-learn seaborn statsmodels time-series-analysis trend-analysis

Last synced: 12 Apr 2026

https://github.com/manojkp08/student-performance-analysis

The Student Performance Analyzer is your go-to solution for understanding and improving student performance. By blending the power of machine learning with interactive visualizations, this tool provides educators and learners with personalized insights into learning styles, performance gaps, and actionable improvements.

machine-learning numpy pandas python requests scikit-learn streamlit

Last synced: 12 Apr 2026

https://github.com/brianlesko/maze-runner

Developed a Python-based maze-crawling application using a PS5 controller interface. This project highlights skills in software-hardware integration and low-code UI design, demonstrating expertise ideal for advanced software engineering.

communication dualsense engineer engineering hacking hardware hardware-hacking interface low-code-ui mechanical-engineer mechanical-engineering protocol ps5 python robotics-engineer scikit-learn software sony streamlit ui

Last synced: 12 Apr 2026

https://github.com/raduldev/ml-projects

End To End Machine Learning Project guided by Krish Naik from Ineuron.

catboost dill flask-application numpy pandas python scikit-learn xgboost

Last synced: 12 Apr 2026

https://github.com/dakohhh/politicians-face-recognition

A machine learning model where we classify famous Nigerian politicians. We restrict classification to only 4 people

gridsearchcv jupyter-notebook machine-learning opencv python pywavelets scikit-learn

Last synced: 16 Apr 2026

https://github.com/arrhythmia-detection/arrhythmiadetectionmodels

This repository contains ML codebase developed during CSE713 group project

arrhythmia-detection deep-neural-nets esp32-s3 scikit-learn tensorflow tensorflow-lite tinyml

Last synced: 12 Apr 2026

https://github.com/shubhamsoni98/project_using_knn

This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.

anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau

Last synced: 03 Jan 2026

https://github.com/thaisgarcia/scikit-learn

Utilizei aprendizado supervisionado, mais especificamente regressão linear, para prever salários com base no tempo dedicado aos estudos mensais. O modelo treinado estabeleceu uma relação matemática entre salário e horas de estudo, ajustando parâmetros durante o treinamento.

pandas scikit-learn seaborn

Last synced: 08 May 2026

https://github.com/sankoktas/bhi360-fall-detection

Fall detection system using Bosch BHI360 sensor data with time-series labeling, feature extraction, and machine learning (LOSO CV + Gradient Boosting).

accelerometer bhi360 bosch-sensors data-augmentation fall-detection feature-extraction gradient-boosting gyroscope human-activity-recognition label-studio loso-cross-validation machine-learning python scikit-learn sensor-data smote time-series

Last synced: 07 May 2026

https://github.com/sayan520/titanic-data-insights

Conducting data analysis on Kaggle's Titanic: Machine Learning from Disaster dataset using essential data wrangling, exploratory data analysis (EDA), and visualization techniques to uncover insights, identify patterns, and explore factors influencing passenger survival.

jupyter-notebook kaggle matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/tnleite/credit-card-customer-clustering

Este repositório apresenta um projeto de segmentação e predição de clientes de cartões de crédito. Utilizando EDA, clusterização (K-Means) e machine learning, o objetivo é prever o grupo de novos clientes, apoiando estratégias de marketing personalizadas.

classification-algorithm clustering-algorithm clustering-analysis data-science exploratory-data-analysis kmeans-clustering logistic-regression machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn seaborn

Last synced: 07 May 2026

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/lucasfranklinsilva/rnn-lstm

Modelo de Prevenção de Falhas em Turbinas Simuladas Utilizando Redes Neurais Recorrentes

data-visualization deep-learning jupyter-notebook keras machine-learning neural-networks python scikit-learn

Last synced: 12 Apr 2026

https://github.com/theartificialdev/movie-recommendation-system

The primary goal of this project is to provide personalized movie recommendations to users based on their preferences and the characteristics of the movies. This is achieved through a multi-step process involving data preprocessing, text vectorization, and recommendation generation.

anaconda-environment data-science jupyter-notebook machine-learning movie-recommendation movies pandas python3 recommendation-system recommender-system scikit-learn scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/sohang3112/stock-prediction-mlops

Stock Prediction MLOps group project for IIT Madras MTech (AI).

mlops python scikit-learn stock-price-prediction

Last synced: 20 Jun 2026

https://github.com/pders01/telarantula

📜 I made this for Uni. Was pretty fun. It scrapes telegram channels of known German tinfoil-hats and tries to detect the telegram channel based on the emojis that are used.

assignment python research scikit-learn scrapy

Last synced: 04 Aug 2025

https://github.com/thariniselvakumar/kidney-disease-prediction

This project is about the kidney disease prediction using machine learning algorithms

machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/jpcano/boston_housing

Predicting Boston Housing Prices using supervised Machine Learning algorithms

cross-validation machine-learning numpy pandas python regression-models scikit-learn

Last synced: 12 Apr 2026

https://github.com/ccastleberry/sk-autobots

Custom data transformers using the scikit-learn API.

scikit-learn sklearn sklearn-api

Last synced: 08 Feb 2026

https://github.com/themihirmathur/soiligator

Soiligator is an advanced machine learning project designed to optimize irrigation management by predicting whether irrigation is necessary based on environmental and soil-related data.

auc-score logistic-regression machine-learning matplotlib numpy pandas python random-forest-classifier roc-curve scikit-learn seaborn standardscaler support-vector-machine

Last synced: 12 Apr 2026

https://github.com/touhoue/oilpumpvibration

The project employs signal processing techniques like Hilbert transforms to extract amplitude envelopes and instantaneous frequencies, facilitating insights into the mechanical health and performance of the system.

python scikit-learn

Last synced: 07 May 2026

https://github.com/gititsid/visaverdict

A ML project to predict possibility of US Visa approval

classification python3 random-forest-classifier scikit-learn

Last synced: 03 Feb 2026

https://github.com/lorenzorottigni/ml-universities

Machine Learning python bootcamp: K mean clustering with public/private universities dataset

k-mean-clustering machine-learning numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/jprmaulion/bayesopt-gb-seismic-liquefaction-liq7

Bayesian-optimized gradient boosting for seismic liquefaction prediction with geographic stratified CV on the LIQ/7/2833 global database.

bayesian-optimization binary-classification gradient-boosting lightgbm liquefaction machine-learning python scikit-learn shap shear-wave-velocity soil-mechanics xgboost

Last synced: 29 May 2026

https://github.com/massimilianoviola/entity-matching-dblp-acm

Entity matching on the DBLP-ACM dataset

scikit-learn sentence-transformers

Last synced: 13 Jun 2026

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/aleksandarbuk/machine-learning

The Machine Learning Library repository provides a collection of scripts and tools leveraging Scikit-Learn, Pandas, and NumPy for various machine learning tasks and data analysis.

matplotlib numpy python scikit-learn tensorflow

Last synced: 16 Apr 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/namratha2301/bangalorehousepricepredictor

Predicting house price in Bangalore based on the key features of the house like number of rooms, size in square feet etc.

azure bashscript docker flake8 flask github-actions scikit-learn

Last synced: 12 Apr 2026

https://github.com/charlescro/reddit-classification-nlp

Analyzing subreddit language via Reddit API and NLP techniques.

data-analysis data-science data-visualization nlp-machine-learning reddit-api scikit-learn

Last synced: 03 Apr 2025

https://github.com/filsan95/project-iot_malware_identification

This repository contains the code and data for a project that detects malware from IoT devices using a publish-subscribe model with Confluent and Databricks. The project streams IoT device data to Kafka, analyzes it, and detects malware using machine learning models such as Random Forest and Gradient Boosted Trees.

apache-kafka classification confluent databricks machine-learning-algorithms scikit-learn sql

Last synced: 16 Mar 2025

https://github.com/karimosman89/health-risk-assessment

Predict health risks based on patient data.Create a machine learning model that predicts health risks (like diabetes or heart disease) based on patient data.Help healthcare providers identify at-risk patients for early intervention.

ehr-data pandas python scikit-learn

Last synced: 06 May 2026

https://github.com/gangula-karthik/bank-transaction-classification

Classifying bank transactions with precision—your first step towards smarter finance management 💳🤖📊

finance machine-learning nlp scikit-learn

Last synced: 09 Apr 2025

https://github.com/otuemre/housepricingml

A machine learning project predicting house prices using regression models. Covers data preprocessing, feature engineering, and model comparison to achieve accurate results. Developed for a Kaggle competition, focusing on effective ML workflows and model interpretability.

eda encoding evaluation-metrics kaggle-competition lightgbm-regressor machine-learning matplotlib-pyplot neural-networks numpy pandas preprocessing python ridge-regression scikit-learn seaborn tensorflow xgboost-regression

Last synced: 13 Apr 2026

https://github.com/the-developer-306/fake-review-detector

This project is a machine learning-based review classification system that predicts whether a product review is GENUINE or FAKE. It preprocesses review text, analyzes sentiment, and uses numerical features like ratings and helpfulness to make predictions. The model is deployed via a Flask web application for user interaction.

classification flask logistic-regression machine-learning numpy pandas python renderdeploy scikit-learn sentiment-analysis

Last synced: 12 Apr 2026

https://github.com/m-rishab/job-recruitment-prediction-and-hr-dashboard-using-plotly

This project features make it ideal for dynamic HR dashboards, offering insights into candidate profiles and recruitment processes.

correlation-analysis flask kmeans-clustering numpy pandas plotly python scikit-learn seaborn standardscaler

Last synced: 12 Apr 2026

https://github.com/hvalfangst/azure-functions-pandas

Azure Functions for ETL operations using Pandas. Uploaded CSV files trigger data processing, calculating correlations and storing results in a JSON file. Automated deployment via GitHub Actions and Terraform.

az-204 azure azure-functions azure-functions-python pandas python scikit-learn terraform

Last synced: 12 Apr 2026

https://github.com/raghavendra317/brain-tumor-detection

A deep learning-based project for classifying brain MRI images into Glioma, Meningioma, No Tumor, and Pituitary Tumor using Convolutional Neural Networks (CNNs). This project aims to assist in early tumor detection, improving diagnostic accuracy and healthcare outcomes.

cnn-classification keras numpy opencv scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/arijit-7612/sms-spam-detection

A deep learning–based SMS Spam Detector built with BiLSTM and Keras TextVectorization. The model classifies messages as Spam or Ham with high accuracy and is deployed on Streamlit for real-time text classification with a clean and interactive user interface.

pandas python scikit-learn seaborn streamlit tensorflow

Last synced: 12 Apr 2026

https://github.com/supriya811106/twitter-sentiment-analysis

Analyzing the mood of tweets! We sort tweets on popular topics into positive, negative, or neutral categories to gauge public opinion. See what Twitter really thinks!

bernoulli-naive-bayes jupyter-notebook matplotlib nlp-machine-learning nltk numpy pandas python scikit-learn seaborn sentiment-analysis text-classification tfidf-vectorizer wordcloud

Last synced: 05 Apr 2026

https://github.com/hrolive/recommendation-systems-ibm

Analyze the interactions that users have with articles on the IBM Watson Studio platform and make recommendations to them about new articles, using various recommendation engines.

machine-learning natural-language-processing pandas python recomendation-system scikit-learn

Last synced: 12 Apr 2026

https://github.com/rubada/machine-learning-with-ruba-dabbas

Advance your skills and start your career here, by taking the online courses on Intuidemy.

course learning machine machine-learning matplotlib matplotlib-pyplot models numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/shreyazh/respire

To deal with Respiratory Challenges

machine-learning matplotlib numpy pandas pillow python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/abdelrahman-amen/housing-price

Predicting housing prices with machine learning regression models. This project implements Linear Regression, Random Forest, and Decision Tree models for accurate predictions.

decision-tree housing-price-prediction linear-regression machine-learning python random-forest regression-analysis scikit-learn

Last synced: 07 May 2026

https://github.com/adam-maz/virtual_screening

Within this repository I present scripts that can be helpful during virtual screening in drug design & development.

clusterization jupyter-notebook k-means-clustering maestro-schrodinger medicinal-chemistry molecular-fingerprints pandas python rdkit scikit-learn scoring-functions virtual-screening

Last synced: 04 May 2026

https://github.com/murugavl/crop-prediction

This Crop Prediction System utilizes machine learning to recommend suitable crops based on environmental data. It helps farmers make informed decisions by analyzing factors like soil type and climate. The system aims to enhance agricultural efficiency and productivity.

flask machine-learning python scikit-learn

Last synced: 12 Jun 2025

https://github.com/satyavardhan2k4/medical-insurance-predictor

A linear regression model that predicts medical insurance cost based on the features like age, sex, BMI etc. the dataset values is based in US

machine-learning pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/santiago-giordano/datascienceproject

Data Science Course Project: Causes of death around the world

apis jupyter-notebook matplotlib pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/mianmharoon/sentimentanalysis_coreml_emotionclassifier

Emotion classification iOS app using CoreML and SwiftUI – demo for sentiment and emotion analysis, with the model converted from Scikit-learn using coremltools.

ai coreml coreml-models emotionclassification ios machinelearning nlp python3 scikit-learn sentimentanalysis swift swiftui

Last synced: 12 Apr 2026

https://github.com/purcellcjp/credit-risk-classification

This project utilized Python and scikit-learn libraries to train and evalute a Machinge Learning model based on loan risk.

machine-learning numpy pandas-dataframe python scikit-learn

Last synced: 12 Apr 2026

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 02 Feb 2026

https://github.com/mpolinowski/scikit-wine-quality

Predicting Wine Quality with Several Classification Techniques using SciKit Learn.

feature-classifiers python scikit-learn

Last synced: 18 May 2026

https://github.com/bhaveshbhakta/diabetes-prediction

Note* The hosted website link might take some time to load. Please be patient while the application initializes.

diabetes-prediction flask machine-learning python scikit-learn svm web-development

Last synced: 12 Apr 2026

https://github.com/francislauriano/chatsoftex

Plataforma desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação tecnológica, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.

cryptography fernet firebase flask flask-jwt-extended hugging-face-transformers numpy openai pdfplumber postgresql pyjwt pymupdf-fitz pypdf2 python pytorch scikit-learn scipy spacy sqlalchemy tensorflow

Last synced: 03 Apr 2026

https://github.com/mpoojithavigneswari/sentiment-analysis

The primary goal of this project is to build a sentiment analysis model that can predict the sentiment of a given review (positive or negative).

deep-learning keras machine-learning nlp python rnn-lstm scikit-learn tensorflow

Last synced: 04 Feb 2026

https://github.com/michal-wrzosek/real-estate-price-prediction

WREPPA - Warsaw Real Estate Price Prediction App (Machine Learning)

flask jupyter-notebook machine-learning neural-network node-js real-estate scikit-learn

Last synced: 12 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_06

Модуль 6. Навчання без вчителя.  Кластерізація. KMeans. Principal Component Analysis

dbscan-clustering hdbscan-clustering kmeans kmeans-clustering opentsne optics-clustering pca python scikit-learn tsne

Last synced: 18 Apr 2026

https://github.com/markdouthwaite/py-lingo

Utilities for helping you deploy Scikit-Learn models in Go (with lingo!)

hdf5 linear-models scikit-learn

Last synced: 25 Feb 2026

https://github.com/sravyatogarla/movie-recommendation-system

A complete Movie Recommendation System project implementing Popularity-Based, Content-Based, and Collaborative Filtering models using the MovieLens dataset. Built with Python, Pandas, and Plotly, featuring interactive inputs and visualizations.

capstone-project collaborative-filtering content-based-filtering data-science data-visualization edureka jupyter-notebook machine-learning movie-recomendation-system movielens pandas popularity-based-filtering python recommender-system scikit-learn sql

Last synced: 13 Apr 2026