An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: 10 Apr 2026

https://github.com/jersongb22/datascience_mlops_movierecommendations_project

Simulating a Data Scientist's role in a startup aggregating streaming platforms. Building movie queries and ML-based recommendation system with MLOps focus. ML model web app deployed with Render.

data-science fastapi machine-learning matplotlib pandas python render scikit-learn stopwords

Last synced: 10 Apr 2026

https://github.com/salmandeveloperz/ml_house_prediction

project for house price prediction using Classification & Regression models. Includes Docker setup for easy deployment.

classification-model clustering deep-learning machine-learning matplotlib numpy pandas python3 regression-models scikit-learn

Last synced: 10 Apr 2026

https://github.com/karimosman89/legal-document-nlp

Create a tool that uses NLP to extract key information from legal documents, contracts, or agreements.Use NLP techniques for named entity recognition and text classification.Streamline the review process for legal teams by automating information extraction.

nltk python scikit-learn spacy

Last synced: 11 Apr 2026

https://github.com/soumyagautam/sign-sense

Deep Learning and Neural Network based Sign Sense or 'Sign Language' to Speech converter is an desktop app which can detect hand signs in a frame and can convert them to Speech, according to their respective meaning. Opposite to this, it can also recognise your voice and can convert it to sign language.

ai cv2 dataprocessing deep-learning keras machine-learning mediapipe moviepy-library neural-network openai-whisper scikit-learn tensorflow tkinter-python

Last synced: 10 Apr 2026

https://github.com/camilajaviera91/prediction-of-housing-prices-using-linear-regression

This project provides tools to search for datasets on Kaggle, download and preprocess them, and perform predictions using a Linear Regression model. It includes interactive text-based user interfaces built with `curses`.

curses kaggle linear-regression matplotlib-pyplot mean-absolute-error mean-square-error numpy pandas pathlib python scikit-learn train-test-split

Last synced: 10 Apr 2026

https://github.com/alessandromonolo/descriptive-texts-classification-by-usage-purposes-of-estate-properties

The project aims to identify the best model for the classification of texts derived from descriptions of assets subject to Italian judicial auctions. The employed models include both conventional models, such as Logistic Regression, Naive Bayes, SVM, and XGBoost, and neural network models, such as Fasttext and XLM-Roberta.

fasttext logistic-regression naive-bayes nlp python pytorch scikit-learn seaborn spacy svm text-classification tfidf tokenizer xgboost xlm-roberta

Last synced: 08 Apr 2026

https://github.com/jibbs1703/classic-ml-models

This repository contains scripts for developing, training and evaluating machine learning models using several python frameworks.

aws data-preprocessing data-science deep-learning feature-engineering machine-learning multiclass-classification neural-networks predictive-modeling pyspark-mllib pytest scikit-learn xgboost-classifier

Last synced: 10 Apr 2026

https://github.com/aryansk/fake-news-detection

A sophisticated machine learning solution to detect fake news using multiple classification algorithms. Identify the credibility of news articles with advanced text analysis techniques!

fake-news-detection machine-learning machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 18 Jan 2026

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/g-eoj/cv-tl-keras

Use the cross validation functions from scikit-learn to evaluate image classification transfer learning with Keras models.

cross-validation keras numpy scikit-learn transfer-learning

Last synced: 10 Apr 2026

https://github.com/aditya172926/text_summarization

Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.

glove machine-learning nlp python scikit-learn spacy

Last synced: 05 May 2026

https://github.com/machinelearningprodigy/covid-19-detection-system

It is a Flask-based web application that predicts the likelihood of COVID-19 infection based on user symptoms. The app utilizes a K-Nearest Neighbors (KNN) model trained on relevant medical features to assess COVID-19 risk.

algorithms feature-engineering flask machine-learning numpy pandas scikit-learn

Last synced: 20 Apr 2026

https://github.com/mayankmittal29/stockvision

Stock price predictor LSTM Sequential Model with Dropout Regularization by which we can analyse any stock tickers, do its fundamental analysis using fundamental ratios and charts visualisations of 100MA and 200MA and can also predict stock price for next 10 days with its trend. Can also view candle stick charts for stock trading and latest news.

keras lstm-neural-networks matplotlib-pyplot mplfinance numpy pandas python scikit-learn streamlit yfinance-api

Last synced: 07 Apr 2026

https://github.com/takkii/rock_ptarmigan

[PyPi] Approximate value evaluation 📊

approximate keras pca rock-ptarmigan scikit-learn tensorflow

Last synced: 10 May 2026

https://github.com/hopquangdo/minisklearn

✨ A lightweight Python-from-scratch implementation of core Machine Learning algorithms inspired by scikit-learn.

machine-learning scikit-learn

Last synced: 09 Jul 2025

https://github.com/yessasvini23/deepfake_immunization_toolkit

🛡️ AI-powered toolkit to detect deepfakes, educate users, and verify content authenticity using federated learning and blockchain. Built for election security, media integrity, and digital literacy.

blockchain matplotlib numpy opencv python pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/mdalamin5/machine-learning-2.0

Machine-Learning-2.0: A comprehensive repository documenting my journey to master ML from scratch. It includes core algorithms, advanced techniques, data preprocessing, feature engineering, and real-world projects. Follow my structured approach, inspired by "100 Days of ML," featuring Python implementations, tools, and insightful resources.

data-fetching-from-api datapreprocessing end-to-end-project feature-engineering gradient-descent-optimizers machine-learning-algorithms scikit-learn webscraping-data

Last synced: 21 Apr 2026

https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon

An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.

matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud

Last synced: 11 Apr 2026

https://github.com/pockerman/tech3python

Collection of Python based algorithms on numerics, statistics, control etc

algorithms control estimation kalman-filter machine-learning numerical-methods particle-filter python3 scikit-learn statistics

Last synced: 18 May 2026

https://github.com/ksatrajit0/heart-disease-prediction-ml

Predicts the risk of heart attack in a patient using their medical record

heart-disease-prediction machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 19 Apr 2026

https://github.com/sizzlins/kalkulator-ai

A Simple Command Line Input Symbolic Regression Engine and Computer Algebra System (CAS) capable of discovering the laws of the universe, solving calculus, algebra, and trigonometrics.

calculator calculus cli computer-algebra-system curve-fitting machine-learning mathematics numpy physics python scientific-computing scikit-learn sparse-regression symbolic-regression sympy

Last synced: 13 Jan 2026

https://github.com/tasninanika/callifornia-housing-price-prediction-svr

Support Vector Regression (SVR) is a type of Support Vector Machine used for predicting continuous values.

matplotlib numpy pandas python3 scikit-learn seaborn svm-regression

Last synced: 11 Apr 2026

https://github.com/offchan42/thai-thesis-classification

Classify each document inside the corpus using Python machine learning module: scikit-learn

nlp python python2 scikit-learn segment thai thai-language thai-thesis-classification

Last synced: 13 Aug 2025

https://github.com/shanmukhsrisaivedullapalli/automatic-ticket-classification

This project processes customer complaint data using pandas for data manipulation and applies text preprocessing techniques, including lemmatization, to clean and normalize complaint text. The `tqdm` library provides progress bars for efficient tracking of text processing tasks.

matplotlib neural-networks nlp numpy pandas python3 scikit-learn seaborn tensorflow tqdm wordcloud

Last synced: 11 Apr 2026

https://github.com/haloapping/ml-workflow

Template alur kerja machine learning.

mahine-learning numpy pandas python3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 04 Mar 2025

https://github.com/tasninanika/mammographic-masses-analysis-dt

This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.

decision-tree-classifier numpy pandas pyhton3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/selcia25/sleep-disorder-detection

💤This project aims to develop an automated method for detecting sleep disorders from heart rate signals.

cnn-classification kmeans-clustering machine-learning matplotlib scikit-learn scipy sleep-disorders tensorflow

Last synced: 05 Jan 2026

https://github.com/vicperal/ai-genai_projects

Python projects about LLM and ML use cases. I am using modules such as Pandas, Numpy, Plotly, scikit-learn, Transformers, Flask, JSON, etc. to analyze data, predict, generate insights and create text from models such as LLMs, linear regression, assembly methods, etc. Server- Front-End using Flask

assembly clinical-trials flask json linear-regression llm ml numpy pandas plotly price-prediction python rag random-forest scikit-learn sentimental-analysis sql text-summarization tokens-counter transformers

Last synced: 02 Apr 2026

https://github.com/pkini2002/hpe_cty

Repository to maintain the learnings of the technologies used for the CTY'23 Project Work provided by HPE

computer-networks docker docker-container linux python scikit-learn swarm-learning ubuntu

Last synced: 07 Apr 2026

https://github.com/amirjahantab/iris_classification

This project analyzes the famous Iris dataset using various machine learning techniques. The goal is to classify the iris flowers into three species: Setosa, Versicolor, and Virginica based on the features provided in the dataset.

classification data-science machine-learning scikit-learn

Last synced: 16 May 2026

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 11 Apr 2026

https://github.com/aarryasutar/logistic_regression_on_age_prediction

This code evaluates the performance of a logistic regression model on age prediction using various features to predict a binary target variable, calculating metrics to determine the performance. It evaluates the comparison, identifies favorable features, and visualizes the ROC-AUC curve to determine the best model performance.

accuracy-score confusion-matrix f1-score feature-selection logistic-regression model-training numpy pandas precision recall rmse roc-auc-curve scikit-learn visualization

Last synced: 20 Jan 2026

https://github.com/elazzouzihassan/si-fraud-detection-prototype

Système de Détection des Fraudes avec Python (Prototype).

googlecolab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/gaurangdave/house_price_predictions

Machine Learning Application to predict House Prices

hands-on learning-by-doing machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 05 Jan 2026

https://github.com/victorkiosh/fake-news-detection

Detecting fake news using NLP and machine learning (Logistic Regression, Random Forest, XGBoost)

data-science fake-news-detection machine-learning nlp scikit-learn xgboost

Last synced: 18 May 2026

https://github.com/saro0307/pre-doctor-ai-model

Pre-Doctor is an AI-driven health advisor using sci-kit-learn, offering quick medical advice based on user-input symptoms, making healthcare accessible and user-friendly. Utilizing Flask and pyttsx3, it seamlessly integrates machine learning for informed well-being.

artificial-intelligence css flask generative-ai generative-model html machine-learning python reinforcement-learning scikit-learn

Last synced: 07 Apr 2026

https://github.com/nordszamora/predictive_lung_cancer

The lung cancer predictive ML project is use to predict a cancer based on the data of smoking intake and common symptoms with low cost.

bootstrap django django-rest-framework python reactjs rest-api scikit-learn vite

Last synced: 11 Apr 2026

https://github.com/farrajota/kaggle_titanic

My solutions to the "Titanic: Machine Learning from Disaster" kaggle competition

docker docker-compose kaggle kaggle-competition kaggle-titanic notebook pyspark python scikit-learn

Last synced: 11 Apr 2026

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 12 Apr 2026

https://github.com/rs2416/Detecting_Social_Anxiety

This repository contains the full dataset and code needed to recreate the classification models and reproduce the results within this paper: https://formative.jmir.org/2021/10/e32656/

jupyter-notebook machine-learning python scikit-learn social-anxiety

Last synced: 07 May 2025

https://github.com/rusiruchapana/blood-group-prediction

Creating a machine learning project to predict blood groups from fingerprint patterns

asp-net-web-api keras matplotlib numpy opencv-python pandas pillow scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/aryansingla45/cab-fare-prediction

This repository contains a Cab Fare Prediction Web App that uses machine learning to predict fares based on distance, time, and location, with Google Maps API integration for route and fare calculations. It features a user-friendly interface and secure authentication, powered by a Kaggle dataset.

flask-application google-maps-api html-css-javascript machine-learning machine-learning-algorithms pandas-numpy python scikit-learn weather-api

Last synced: 11 May 2026

https://github.com/ki3mono/naive_bayes_classifier

This project implements Naive Bayes Classifiers for two data types: Multinomial Naive Bayes Classifier and Gaussian Naive Bayes Classifier. Developed as part of the Probability Theory and Statistics course in the second year of Computer Science at AGH University of Krakow.

iris-dataset mushroom-dataset naive-bayes-classifier numpy python scikit-learn

Last synced: 06 May 2026

https://github.com/tharindanimnajith/deep-learning-spam-detection

Deep Learning classifiers to detect spam SMS messages - LSTM Model, DenseNet CNN Models - NLP, Python, Jupyter Notebook, Tensorflow, Keras, Numpy, Pandas, Matplotlib, Scikit-Learn

deep-learning densenet keras lstm nlp python3 scikit-learn tensorflow

Last synced: 05 Apr 2026

https://github.com/nordszamora/sentiment

This ML project is used to predict the text sentiment positive, negative or neutral. Its used a Reddit dataset in kaggle to train the model and make a prediction on new/unseen data.

django joblib kaggle naive-bayes-classifier pandas python redditdata scikit-learn

Last synced: 11 Apr 2026

https://github.com/rayyan9477/data-driven-house-price-prediction-and-property-recommendation-app

The app leverages algorithms to accurately predict house prices and recommend similar properties based on a saved dataset through content-based filtering. It is tailored for homebuyers seeking their dream house and real estate investors looking for profitable opportunities, providing powerful insights and data-driven decision-making support.

data-science eda html machine-learning numpy pandas python scikit-learn

Last synced: 06 Apr 2026

https://github.com/palak-463/tablataalrecognitionsystem

Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. 🎛️

cnn deep-learning flask fnn librosa numpy os pickle python scikit-learn

Last synced: 11 Apr 2026

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 11 Apr 2026

https://github.com/michael-insights/portfolio

This repository showcases my projects and skills in Data Analytics, Data Science, and Machine Learning. It includes hands-on work in data analysis, predictive modeling, and machine learning algorithms, aimed at solving real-world problems.

data-analytics data-science data-visualization datapreprocessing jupyter-notebooks machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn sql

Last synced: 11 Apr 2026

https://github.com/troublem1/mle

MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline

packages python3 scikit-learn sklearn

Last synced: 05 Jan 2026

https://github.com/djleamen/music-mood-matcher-demo

An intelligent music recommendation system that analyzes your mood through natural language and suggests personalized playlists using advanced AI and machine learning techniques.

demo machine-learning matplotlib ml natural-language-processing nlp nltk nltk-python numpy openai pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/alam025/ai-email-guardian

🛡️ AI-Powered Email Guardian: 99.2% accurate spam detection using machine learning. Open-source, privacy-focused email security. ⚡ 50ms detection time.

artificial-intelligence email-filter email-security hishing-detection machine-learning-cybersecurity nlp open-source privacy python scikit-learn security-tools spam-detection tensorflow text-classification

Last synced: 10 Mar 2026

https://github.com/md-emon-hasan/ml-project-car-price-prediction

🚗 End-to-end ML project for predicting car prices based on various features. Includes data preprocessing, model training, and a Flask web for predictions.

car-price-prediction car-price-predictor data-science feature-engineering ml predictive-modeling scikit-learn

Last synced: 10 Mar 2026

https://github.com/fadlyhts/klasifikasistuntingbalita

Sistem Klasifikasi Status Stunting menggunakan Machine Learning untuk mendeteksi status gizi balita berdasarkan BB/U, TB/U, dan BB/TB. Dibangun dengan Python, Flask, dan Scikit-learn.

child-health data-science flask healthcare indonesia-healthcare machine-learning medical-informatics python scikit-learn stunting-classification

Last synced: 20 Mar 2025

https://github.com/lfenzo/ml-solar-sao-paulo

Implementation of scientific project regarding the use of Machine Learning in Solar Radiation Prediction

forecasting machine-learning python scikit-learn

Last synced: 11 Apr 2026

https://github.com/aarryasutar/hate_speech_detection

This project aims to detect hate speech on Twitter using advanced NLP and machine learning techniques, exploring feature extraction methods like TF-IDF and sentiment analysis, and evaluating models such as Logistic Regression and SVM.

confusion-matrix doc2vec gensim logistic-regression matplotlib naive-bayes nltk numpy pandas python random-forest scikit-learn seaborn stemming stopwords-removal svm tf-idf-vectorizer tokenization vader word-cloud

Last synced: 09 Apr 2026

https://github.com/daniel1kp/diamond-price-predictor

💎Diamond Price Predictor is an user-friendly web application that simplifies the analysis and prediction of diamond prices.

altair pandas python scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/bestmahdi2/uni__decisiontreesinpythonpredictingdiabetes

A university project designed in Python decision tree for diabetes diagnosis

decision-tree diabetes-prediction graphviz pandas prediction pydotplus python scikit-learn

Last synced: 11 Apr 2026

https://github.com/rixiiz/knn-model-to-predict-the-obp-of-mlb-players

A machine learning model I built that can predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season

artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning

Last synced: 01 Jul 2025