An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/suvasish114/house-price-estimation

A machine learning model that estimate housing prices in California using the California census data

jupyter-notebook machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/bhoomikaniranjan/pulmotrainer

A Deep Learning-based Lung Cancer Detection application using a 3D CNN model with TensorFlow and OpenCV, featuring an interactive Tkinter GUI for easy data processing and training.

matplotlib numpy-pandas opencv python scikit-learn seaborn tensorflow-keras

Last synced: 09 May 2026

https://github.com/mpolinowski/fisher-discriminant-analysis

LDA is a widely used dimensionality reduction technique built on Fisher’s linear discriminant.

linear-discriminant-analysis matplotlib-pyplot python scikit-learn

Last synced: 10 May 2026

https://github.com/hassanislam463/nyc_airbnb_eda

This project is a comprehensive data analysis of Airbnb listings in New York City, exploring pricing trends, seasonality effects, host market dynamics, rental preferences, and revenue estimation. It provides valuable insights for hosts, investors, and policymakers to optimize Airbnb operations and understand the short-term rental landscape in NYC.

exploratory-data-analysis matplotlib python scikit-learn seaborn

Last synced: 10 May 2026

https://github.com/vijaykumarr1452/ipl-first-innings-score-prediction-deployment

Deployment of IPL Score Prediction Analyser Model. https://github.com/vijaykumarr1452/IPL-First-Innings-Score-Prediction)

css deployment gunicorn html machine-learning ml predictive-analytics python scikit-learn

Last synced: 11 May 2026

https://github.com/anras5/criteo-search-data

EDA and statistical tests on CriteoSearchData dataset

data-science pandas scikit-learn statistics

Last synced: 11 May 2026

https://github.com/ananyagubba/bike-sharing-demand-prediction

Using machine learning techniques, the model learns from features such as weather conditions, time of day, season, and holiday information to forecast hourly or daily demand.

machine-learning python scikit-learn seaborn

Last synced: 11 May 2026

https://github.com/shubhamkarampure/asl-streamlit-signlingo

streamlit based web-app for teaching sign language through real-time hand gesture recognition.

learning-exercise mediapipe opencv-python python scikit-learn sign-language streamlit-webapp

Last synced: 12 May 2026

https://github.com/xunchiasg/nyc_property_sales

Exploratory Data Analysis of rolling property sales data in NYC from March 2023-2025

matplotlib-pyplot plotly python scikit-learn

Last synced: 12 May 2026

https://github.com/arjunan-k/medical_insurance

Project to analyze and forecast medical insurance costs of patients using data science framework.

medical-insurance scikit-learn tableau

Last synced: 12 Jun 2026

https://github.com/royxlead/production-drift-detection

Production ML monitoring library - KL, PSI, MMD, and ADWIN drift detectors with empirical benchmarks, confidence tracking, and a 6-page FastAPI dashboard.

data-drift drift-detection fastapi kl-divergence mlops mmd model-monitoring production-ml psi pytorch scikit-learn uncertainty-quantification

Last synced: 23 Jun 2026

https://github.com/adadalshabab/human-stress-analysis-greadsearch-classifier

The project leverages data from physiological signals, self-reported surveys, behavioral observations, or other relevant sources to infer and analyze stress levels.

classification knn-classification machine-learning machine-learning-algorithms matplotlib pandas scikit-learn

Last synced: 09 May 2026

https://github.com/z-fran/walmart-store-sales-forecasting

Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.

machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting

Last synced: 07 May 2026

https://github.com/saswatamcode/datascienceapi

This is a RESTful API built using Flask and Scikit-Learn. It provides a host of Classification and Regression algorithms that can be used readily and returns results in the form of predictions, confusion matrices, accuracy scores and more.

api flask ml python3 scikit-learn

Last synced: 07 May 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction

A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation

linear-regression python scikit-learn xgbregressor

Last synced: 07 May 2026

https://github.com/henrytseng/example_docker_scikit-learn

A quick example of using Scikit-Learn from a Docker container

docker scikit-learn

Last synced: 08 May 2026

https://github.com/anusha-me/disease-x-detection-ml-project

A machine learning classification system for early detection of Disease X based on patient symptoms using Python, Scikit-learn, and Streamlit.

classification data-science disease-prediction healthcare-ai machine-learning medicaldata scikit-learn streamlit

Last synced: 08 May 2026

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 08 May 2026

https://github.com/jatin-mehra119/churn_modeling

This repository is dedicated to predicting customer churn using machine learning techniques. It includes comprehensive scripts for data preprocessing, model training, and evaluation, along with detailed visualizations and insights.

classification-model datavisualization pandas scikit-learn

Last synced: 08 May 2026