An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/tom-uchida/introduction_to_machine_learning

Machine learning private lesson.

machine-learning scikit-learn

Last synced: 28 Apr 2026

https://github.com/belsabbagh/employee-turnover-and-customer-churn-classification

A data science project that tests mutliple models on an employee tunronver and customer churn problem

machine-learning pandas python scikit-learn

Last synced: 28 Apr 2026

https://github.com/skypse/santander-coders-data_science-course

Curso de Data Science, proposto pelo Satander, utilizando Python!

jupyter-notebook numpy pandas-python python scikit-learn

Last synced: 29 Apr 2026

https://github.com/shibin08/sentiment-analysis-movie-reviews

A sentiment analysis project on IMDb movie reviews using Natural Language Processing (NLP) techniques. Text data is cleaned, vectorized using TF-IDF, and classified using machine learning models like Logistic Regression and Random Forest. Achieved high accuracy in distinguishing positive and negative reviews.

logistic-regression machine-learning movie-reviews natural-language-processing random-forest scikit-learn sentiment-analysis text-classification tf-idf

Last synced: 29 Apr 2026

https://github.com/vaishnavijain25/pca-based-digit-classification

A machine learning project that uses Principal Component Analysis (PCA) for dimensionality reduction and Logistic Regression for classifying handwritten digit images from the scikit-learn digits dataset.

digit-recognition dimensionality-reduction image-classification logistic-regression machine-learning pca-analysis scikit-learn

Last synced: 29 Apr 2026

https://github.com/fx31337/predict_zigzag

Prototype code to predict zigzag pattern prices.

machine-learning ml scikit-learn

Last synced: 29 Apr 2026

https://github.com/adnanrahin/sentiment_classification_logistic_regeression

Sentiment Analysis extracts subjective information in the source material. It's widely used in modern business, to understand the business module, product quality and consumer point of view regarding the products or the business.

logistic-regression machine-learning natural-language-processing preprocessing python3 scikit-learn

Last synced: 29 Apr 2026

https://github.com/mateluky/covid19-patient-status-prediction

Machine learning model to predict COVID-19 patient status from clinical data, using Python and scikit-learn for healthcare decision support.

classification clinical-decision-support covid19 data-science disease-prediction healthcare jupyter-notebook machine-learning medical-data open-source python scikit-learn

Last synced: 29 Apr 2026

https://github.com/saikumar787/car_price_prediction_using_linear-regression

A machine learning project to predict the selling price of used cars using regression techniques. Includes data preprocessing, model training, evaluation, and testing on new data.

car-price-prediction-with-machine-learning data-analysis joblib jupiter-notebook linear-regression-models model-deployment python scikit-learn standardscaler

Last synced: 29 Apr 2026

https://github.com/inclinedadarsh/regression-metrics

A simple jupyter notebook demonstrating how to use different metrics from 'scikit-learn' library.

jupyter-notebook machine-learning notebook scikit-learn

Last synced: 29 Apr 2026

https://github.com/andreaschatzopoulos/face-landmark-detector

Facial landmark detection using HOG features and Ridge Regression. Simple, effective, and fast – no deep learning required.

computer-vision face-detection hog image-processing landmark-detection python ridge-regression scikit-learn

Last synced: 29 Apr 2026

https://github.com/matheusvazdata/retail-sales-forecast-linreg-sklearn

Minimal project for retail sales forecasting using linear regression (scikit-learn).

forecasting linear-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 29 Apr 2026

https://github.com/mertafacan/fertilizer-prediction-kaggle-playground-s05e06

Top 9% in Kaggle Playground Series - Predicting Optimal Fertilizers - Season 5, Episode 6

catboost kaggle kaggle-competition machine-learning optuna scikit-learn xgboost

Last synced: 29 Apr 2026

https://github.com/mukeshthenraj/fraud-detection-model

Logistic Regression, Grid Search, and ROC-PR curve evaluation on fraud detection dataset

classification fraud-detection machine-learning numpy pandas scikit-learn

Last synced: 29 Apr 2026

https://github.com/xbants/recommendation-api

🎬 Intelligent movie recommendation system with FastAPI backend, Streamlit frontend, and collaborative filtering ML. Rate movies, get personalized suggestions, and enjoy automatic model retraining.

fastapi machine-learning movie-recommedation python3 scikit-learn streamlit

Last synced: 29 Apr 2026

https://github.com/tasnimtalha09/la-crime-analysis-from-2020-to-2025

As part of an academic project, this analysis dig into the crime statistics of Los Angeles Police Department (LAPD) from the year 2020 till 2025.

jupyter jupyter-notebook jupyter-notebooks machine-learning matplotlib pandas python python-3 python3 scikit-learn seaborn sklearn

Last synced: 29 Apr 2026

https://github.com/tbarlow12/learn-it-your-way

Using Python Flask, I wanted to create a simple web API that allows users to upload a dataset, choose one or more models, store them server side, and then hit an endpoint to get a prediction.

flask machine-learning python scikit-learn tensorflow

Last synced: 29 Apr 2026

https://github.com/rishi-sutar/healwise-ai-your-way-to-wellness

Healwise-AI is a health diagnostic tool that uses a Support Vector Classifier (SVC) model to predict diseases based on user-reported symptoms. After predicting, it offers detailed health advice, including descriptions, diets, medications, and workouts related to the diagnosis.

machine-learning scikit-learn support-vector-machine

Last synced: 30 Apr 2026

https://github.com/jarif87/tune-popularity-app

Flask web app to predict song popularity using CatBoost. Enter five song features for instant predictions. Modern, responsive UI, no CSRF for development.

catboost-classifier eda flask-application matplotlib-python music-classification python scikit-learn seaborn

Last synced: 30 Apr 2026

https://github.com/sjain2580/simple-linear-regression-model

This project demonstrates a simple, yet robust, multiple linear regression model built with Python and scikit-learn to predict median house values in California.

joblib linear-regression matplotlib matplotlib-pyplot numpy python scikit-learn

Last synced: 30 Apr 2026

https://github.com/andrewjmack/cryptoclustering

The purpose of this project is to utilize knowledge of Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes. Methods for analysis include K-Means clustering and dimensional reduction through Principal Component Analysis ("PCA").

jupyter-notebook pandas python scikit-learn

Last synced: 30 Apr 2026

https://github.com/maguids/supervised-learning---video-games

This project consists on exploratory data analysis and the application of supervised learning models for classification using a Video Games dataset. Second Semester of the First Year of the Bachelor's Degree in Artificial Intelligence and Data Science.

jupyter-notebook machine-learning matplotlib numpy pandas scikit-learn seaborn supervised-learning

Last synced: 30 Apr 2026

https://github.com/pramodyasahan/grade-predictor

This project aims to predict student performance based on various features such as job, study time, failures, absences, and first and second period grades. The project utilizes a linear regression model from the scikit-learn library in Python.

machine-learning matplotlib numpy pandas python regression scikit-learn

Last synced: 30 Apr 2026

https://github.com/smakde/learning-resource-recommender

A lightweight recommender that helps you discover your next learning resource. It blends patterns from similar users with content keywords, and explains each suggestion in the UI.

als content-based-filtering evaluation-metrics explainable-ai hybrid-recommender implicit-feedback implicit-lib lightfm logistic-matrix-factorization mapk matrix-factorization ndcg pandas precision-at-k python recommender-system scikit-learn streamlit tf-idf top-n-recommendations

Last synced: 30 Apr 2026

https://github.com/fikri-rouzan/student-stress-levels-classification

Proyek pemodelan machine learning untuk mengklasifikasikan tingkat stres mahasiswa berdasarkan parameter input akademik dan psikologis.

joblib jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 08 Jun 2026

https://github.com/fikri-rouzan/burnaway-capstone-data-science

Dashboard analitik interaktif untuk memetakan faktor fisik dan pola kerja pemicu burnout pada software developer.

jupyter-notebook matplotlib pandas pillow plotly python scikit-learn seaborn statsmodels streamlit

Last synced: 08 Jun 2026

https://github.com/abhivur/connections-ai

Contributors: Meet Gamdha, Gaurav Nimmagadda

bert python scikit-learn word2vec

Last synced: 30 Apr 2026

https://github.com/fbarffmann/credit-risk-classification

Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.

classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn

Last synced: 30 Apr 2026

https://github.com/boladjivinny/fire-prediction

Notebook for the Fire fighting using data on Zindi. Ranked number 5 on the public leaderboard and 8 on the private leaderboard. https://zindi.africa/hackathons/cmu-africa-fighting-fire-with-data

feature-engineering hackhathon machine-learning regression scikit-learn stacking

Last synced: 30 Apr 2026