scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-23 00:27:46 UTC
- JSON Representation
https://github.com/bhuvan-s-prasad/streamlit-regression
A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.
california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning
Last synced: 11 Apr 2026
https://github.com/sonnguyen25/hackbeanpot-2025
EarthBeats - An Eco-friendly Pocket Road Trip Companion
css flask googlemaps-api html humeai knn-model mongodb nextjs nodejs numpy pandas python reactjs recommender-system scikit-learn spotify tailwindcss
Last synced: 11 Apr 2026
https://github.com/parbhat-cpp/suicidal-ml
A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.
bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/pramodyasahan/model-selection
This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.
machine-learning modelselection numpy pandas python regression scikit-learn
Last synced: 08 Mar 2025
https://github.com/djdhairya/pneumonia-detection
https://youtu.be/1SQIrxhMuUs?si=lF2cg8eTnETf-5Qx
cnn cv deep-learning flask gunicorn keras matplotlib opencv pandas pillow scikit-learn seaborn tensorflow vgg19
Last synced: 11 Apr 2026
https://github.com/gurpreet0022/nlp_exploration
This repository explores various Natural Language Processing (NLP) techniques using the NLTK library in Python. It demonstrates these techniques on a sample dataset and performs sentiment analysis on movie reviews.
beginner-friendly nlp nlp-machine-learning nltk scikit-learn
Last synced: 30 Apr 2026
https://github.com/coder5omkar/logistic-regression-customer-churn-prediction
This project uses Logistic Regression to predict customer churn in the telecom industry. To run, clone the repository, install dependencies, and run the Jupyter notebook for full analysis and predictions.
logistic-regression ml pandas scikit-learn seaborn statistics
Last synced: 20 Apr 2026
https://github.com/abdiasarsene/customer_segmentation_for_a_marketing_campaign
Use unsupervised learning techniques to segment a company’s customers into distinct groups in order to personalize marketing campaigns. To ultimately propose specific marketing strategies for each customer segment based on the insights obtained.
acp kmeans-clustering matplotlib pandas plotly python scikit-learn seaborn
Last synced: 08 Mar 2025
https://github.com/paulinhok14/property-insight-sample
Property Insight is an app that helps you identify amazing real estate opportunities, leveraging AI models to estimate a property Fair Value and compare to current prices.
ai docker fastapi python pytorch real-estate scikit-learn streamlit
Last synced: 11 Apr 2026
https://github.com/jo-minseok/global-warming-100year
🌡️ 2100년까지의 지구 기온, 해수면, 북극 빙하, 탄소 예측 ML [완료]
arima-model global-warming machine-learning matplotlib numpy pandas scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/amandeep-gupta19/salaries-prediction-linear-regression-
salaries predication using linear regression algorithm
google-colab linear-regression machine-learning matplotlib-pyplot pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/bishopce16/credit_risk_analysis
An analysis to build and evaluate a dataset from LendingClub to discover credit risk using Supervised Machine Learning Algorithms.
imbalanced-learning jupyter-notebook machine-learning machine-learning-algorithms pandas python scikit-learn visual-studio-code
Last synced: 11 Apr 2026
https://github.com/anibalalpizar/python-machine-learning-example
This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.
machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/jofaval/game-of-thrones
Data Analysis and Predictions of the Game of Thrones' character's survivance from 2016
classification data-analysis data-science data-visualization deep-learning game-of-thrones google-colab kaggle keras machine-learning matplotlib python scikit-learn seaborn tensorflow xgboost
Last synced: 11 Apr 2026
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/nfordumass/hot-seat
Machine Learning Dashboard and Engine for Predicting NFL Coach Firings
astro machine-learning react scikit-learn supabase typescript
Last synced: 09 Mar 2025
https://github.com/nikhilchaudhary1/commodity-price-prediction
A Python application for predicting commodity prices (e.g., Pulses, Bread) based on state, city, year, and month using a Linear Regression model. Trained on over 1 million government dataset entries, featuring efficient data processing and prediction capabilities.
commodity-price-prediction data-processing linear-regression machine-learning pandas python scikit-learn
Last synced: 20 Apr 2026
https://github.com/arnoldchrisoduor1/machines
Testing the limits of machines
pytorch scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/ayushtiwari134/machine_learning_models
A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch
linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn
Last synced: 11 Apr 2026
https://github.com/g4brielvs/data-science-playground
Collection of data science and visualization exercises
boston-housing-dataset introduction-to-data-science plotly scikit-learn tensorflow
Last synced: 08 Apr 2026
https://github.com/broodhoney/titanic-ml-from-disaster
This repository contains my analysis and solutions for the Titanic: Machine Learning from Disaster competition on Kaggle. The notebook explores the dataset, performs extensive Exploratory Data Analysis (EDA), applies feature engineering techniques, and builds predictive models to determine survival outcomes based on passenger data
machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning
Last synced: 11 Apr 2026
https://github.com/shru924/ecommerce_customer_behavior_analysis
A machine learning project that analyzes and segments e-commerce customers based on behavior patterns using Python, Random Forest, and data visualization.
customer-segmentation data-analysis jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/jgavinb/customer-churn-ml
Customer Churn prediction using various ML models. Interactive predictions via Streamlit webapp.
joblib machine-learning pkl python scikit-learn streamlit streamlit-application streamlit-webapp
Last synced: 11 Apr 2026
https://github.com/ansh-info/industrial-scale-penicillin-simulation
Optimizing industrial-scale penicillin production using machine learning and data analysis.
jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/ahmedshahriar/restaurant-menu-pricing
Predict menu prices from 5M+ UberEats menus with an end-to-end MLOps pipeline: crawl → DWH → curate → train → deploy on Azure ML (MLflow) via APIM & CLIs.
azure azureml bert-embeddings docker fastapi github-actions huggingface machine-learning mlflow mlops optuna python restaurant-menu scikit-learn scrapy tensorflow transformers uber-eats web-crawler
Last synced: 03 Feb 2026
https://github.com/rachelnk/simple-machine-learning-model
A simple machine learning model built using scikit-learn library.
data-science linear-regression machine-learning-algorithms machine-learning-models matplotlib numpy pandas python random-forest random-forest-regressor scikit-learn
Last synced: 11 Apr 2026
https://github.com/sshBuilder/Movie-recommendation-system
The primary goal of this project is to provide personalized movie recommendations to users based on their preferences and the characteristics of the movies. This is achieved through a multi-step process involving data preprocessing, text vectorization, and recommendation generation.
anaconda-environment data-science jupyter-notebook machine-learning movie-recommendation movies pandas python3 recommendation-system recommender-system scikit-learn scikitlearn-machine-learning
Last synced: 28 Apr 2025
https://github.com/rakibhhridoy/differentprojects
Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!
data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics
Last synced: 11 Apr 2026
https://github.com/alejandrolara11/machinelearningcourse
Machine Learning Basics: From Setup to Clustering
data-analysis data-science machine-learning numpy pandas plotly preprocessing-data python scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/amnydv17/landmark-detection
This project aims to leverage the power of deep learning models to automatically detect and pinpoint landmarks such as famous monuments, buildings, natural landmarks, and other recognizable structures within images.
machine-learning matplotlib numpy pandas python3 scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/dllllb/ds-pipeline
Data Science model pipeline based on SciKit-Learn Estimator API
data-science machine-learning python scikit-learn
Last synced: 16 Apr 2026
https://github.com/amiriiw/text_classification
Welcome to the Text Classification Project! This project is designed to train a model for classifying texts based on their emotional content and then using it to categorize new texts into corresponding emotional categories.
keras numpy pandas pickle scikit-learn tensorflow text-classification
Last synced: 20 Jan 2026
https://github.com/samuele-lolli/data-analytics-techniques
A practical approach to data analytics pipeline.
numpy pandas pytorch scikit-learn
Last synced: 11 Apr 2026
https://github.com/tjpavii0809/classification-regression-models
Simple Predictive Model of Machine Learning
data-science numpy pandas scikit-learn supervised-learning
Last synced: 20 Jan 2026
https://github.com/dinuka-rp/python-machine-learning
This repository contains the projects that I followed to learn Machine Learning with Python
machine-learning python scikit-learn
Last synced: 11 Apr 2026
https://github.com/vyjayanthipolapragada/fraud_detection_creditcard
Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML
classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model
Last synced: 11 Apr 2026
https://github.com/divs-spec/skysync
SkySyncSwarm is a unified drone swarm simulation and control platform that merges the best of UAV simulators, swarm coordination libraries, deep learning models, and autonomous mission planning systems into one cohesive project.
ai-agents flask matlab python3 rrt scikit-learn scipy tcp
Last synced: 11 Apr 2026
https://github.com/infinitode/scikit-learn-decisiontreeclassifier-updater
An open-source tool to convert older Scikit-learn DecisionTreeClassifier models to the newer version.
ai classifier cli converter decisiontree python scikit-learn sklearn tools
Last synced: 31 Mar 2025
https://github.com/dharma-acha/resnet18_imageclassification_cnn
In this part of the project, we implement ResNet-18 from scratch using PyTorch and train it on an image dataset to achieve over 75% accuracy. We apply techniques to prevent overfitting and optimize performance, aiming for an accuracy of 80% or higher.
matplotlib numpy python3 pytorch scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/aicorsair/python-case-study-365-data-science-subscription-purchase-prediction
This repository contains a comprehensive case study on predicting 365 Data Science customer subscriptions using real-world student engagement data.
data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization decision-tree feature-engineering feature-selection hyperparameter-optimization hyperparameter-tuning k-nearest-neighbors logistic-regression machine-learning purchase-prediction python random-forest scikit-learn statsmodels svc
Last synced: 08 May 2026
https://github.com/mborrillo/ranking-ciudades-espana
Sistema end-to-end de análisis multicriterio que evalúa 50 ciudades españolas en calidad de vida mediante datos oficiales
business-intelligence data-analysis multi-criteria-decision-analysis pandas python3 quality-of-life ranking-system scikit-learn scoring-models
Last synced: 13 Jan 2026
https://github.com/agnivchtj/us-census-classifier
Find the optimal classification algorithm that can predict salaries above $50k, based on US Census data.
census-data decision-tree-classifier jupyter-notebooks knn-classifier logistic-regression naive-bayes-classifier python scikit-learn svm-classifier
Last synced: 07 May 2026
https://github.com/snghrsw/kikagaku-ml-learning
Pythonで単回帰分析と重回帰分析、ディープラーニングで回帰と分類
liner-regestion multiple-regression numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/epomatti/python-machine-learning
Simple examples of ML using Python
machine-learning python scikit-learn
Last synced: 11 Apr 2026
https://github.com/rohan3122k/predicting-energy-consumption-using-ann-with-pca
This project leverages Artificial Neural Networks (ANNs) with Principal Component Analysis (PCA) to predict energy demand efficiently. By reducing dimensionality while retaining 95% variance, the model achieves an R² score of 0.9815 and MAE of 523.71 MW. Deployed via Streamlit & GitHub.
artificial-neural-networks electricity git keras machine-learning pca predictive-modeling python scikit-learn streamlit-webapp tensorflow
Last synced: 10 Apr 2026
https://github.com/joemathew2004/cancer_prediction
This project implements a machine learning model (Logistic Regression) trained on the Breast Cancer dataset to predict if a tumor is benign or malignant. It includes a Python script for training the model, a terminal-based prediction tool, and a web application built with Streamlit for interactive predictions.
cancer-prediction classification csv joblib logistic-regression machine-learning python scikit-learn streamlit web-application
Last synced: 07 May 2026
https://github.com/nirmaldeepponnada/codeclauseinternshipproject1
This project involves Customer Segmentation using K-Means clustering to group customers based on Recency, Frequency, and Monetary (RFM) analysis from the Online Retail dataset. It also performs Sentiment Analysis on Amazon Product Reviews using Natural Language Processing techniques & Logistic Regression to classify reviews as positive or negative.
kmeans logistic-regression numpy pandas python3 regular-expressions scikit-learn tf-idf-vectorizer
Last synced: 11 Apr 2026
https://github.com/richardbmk/datascience_machinelearning
projects related with data science and machine learning projects.
data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn
Last synced: 11 Apr 2026
https://github.com/agnivchtj/ann
Develop an Artificial Neural Network that can classify inputs based on a number of features
backpropagation-algorithm jupyter-notebooks python scikit-learn
Last synced: 07 May 2026
https://github.com/hariprasath-v/av-dataverse-hack---insurance-claim-prediction
Create a machine learning model to predict if the policyholder will file a claim in the next 6 months or not based on the set of car and policy features.
analyticsvidhya classification exploratory-data-analysis f1-score matplotlib numpy pandas python randomforest-classification scikit-learn seaborn shap
Last synced: 11 Apr 2026
https://github.com/lorenzorottigni/dl-lending-club
Deep Learning python bootcamp: deep learning on Lending Club dataset
deep-learning ipynb keras machine-learning numpy pandas python scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/pb319/california_house-price-prediction
This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".
evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split
Last synced: 11 Apr 2026
https://github.com/atharvapathak/size_estimator_project
This project in Python aims to provide a tool for estimating the size of objects in images or videos. Using computer vision techniques, the project analyzes the input media, detects objects of interest, and provides an estimation of their size based on known reference points or objects.
cicd cnn opencv python pytorch rnn scikit-learn sql tensorflow
Last synced: 11 Apr 2026
https://github.com/nicolas-giacomelli/modelo-previsao-colesterol-com-gradio
Modelo de predição do colesterol com base em informações Modelo conta com pipelines para tratamento de dados Disponibilizado via Gradio
gradio machine-learning matplotlib pandas pingouin python3 saude scikit-learn scipy seaborn
Last synced: 11 Apr 2026
https://github.com/devinw03/movie-genre-nlp
🎬 Classify movie genres from plot summaries using various models, including Transformers, with clear EDA and MLflow tracking for reproducible results.
cosine-similarity countvectorizer datascience distilroberta gru huggingface imdb machine-learning multi-label-classification nlp python pytorch recommendation-engine scikit-learn slack tabulate text-classification word2vec
Last synced: 11 Apr 2026
https://github.com/gayathri2200/car-price-prediction---machine-learning
Car price prediction Machine Learning --- Which is used to predict the price of used cars based on the features.
data-science machine-learning modeldeployment pandas price-prediction python regression scikit-learn streamlit visual-studio visualization
Last synced: 11 Apr 2026
https://github.com/ritwika-das-gupta/flight_price_prediction
Flight price Prediction is made using decision tree model and Machine learning concepts
cross-validation data-science data-visualization desiciontree flask gridsearchcv html-css-javascript hyperparameter-tuning json machine-learning matplotlib numpy pandas pickle python scikit-learn sklearn statistics supervised-learning
Last synced: 11 Apr 2026
https://github.com/ejw-data/ml-classification-exoplanet
Classification of planets identified by Kepler telescope using multiple models and tuning using GridSearchCV
classification python scikit-learn
Last synced: 09 May 2026
https://github.com/ejw-data/ml-clustering-personality
Analaysis of the big-5 personality test survey results with clustering techniques.
clustering machine-learning python scikit-learn unsupervised-learning
Last synced: 04 May 2026
https://github.com/ejw-data/ml-classification-grants
Compares several machine learning classification models including a neural network to determine whether to approve or reject a grant applicant
classification neural-network python scikit-learn
Last synced: 10 May 2026
https://github.com/jorgevelezgarcia/ibm-data-science-professional-certificate-jorgevelez
IBM Data Science Professional Certificate
analysis data-science databases jupyter-notebook machine-learning matplotlib pandas plotly python scikit-learn scipy seaborn sql visualization
Last synced: 11 Apr 2026
https://github.com/kaguya163/marketing_campaigns
Анализ маркетинговой эффективности в спортивном ритейле.
ab-testing machine-learning matplotlib numpy pandas python scikit-learn scipy sqlite3
Last synced: 11 Apr 2026
https://github.com/nauxqouh/python-for-data-science-labs
This repo contains weekly practical codes for my Python for Data Science course at university.
data-science jupyter-notebook numpy pandas python pytorch scikit-learn
Last synced: 11 Apr 2026
https://github.com/dyarleniber/hands-on-machine-learning
This repository contains code examples, exercises, and projects related to the concepts covered in the book "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron.
ai artificial-intelligence keras machine-learning matpolotlib numpy pandas scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/sudarsann27/basic_machine_learning_algorithms
Basic Machine learning algorithms using scikit-learn and other fundamental libraries
data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning
Last synced: 20 Jan 2026
https://github.com/stella4444/linear-regression
learning about linear regression (currently a work in progress) ~ working with data
linear-regression machine-learning numpy scikit-learn
Last synced: 20 Jan 2026
https://github.com/abdiasarsene/healthpredict-api-smart-medical-diagnosis-system
Ce projet propose une API intelligente construite avec FastAPI pour prédire des maladies à partir de données médicales de patients. L'application repose sur un modèle de machine learning (Logistic Regression) géré via MLflow, et peut facilement être déployée grâce à Docker.
bentoml docker-compose dockerfiles jenkinsfiles mlflow pandas ray-serve scikit-learn taskfile
Last synced: 11 Apr 2026
https://github.com/vickshan001/tweet-sentiment-classifier-nlp-svm-project
NLP coursework project using SVM to classify tweet sentiments. Features custom preprocessing, error analysis, and cross-validation.
natural-language-processing nlp python scikit-learn sentiment-analysis svm text-classification tweets
Last synced: 31 Mar 2025
https://github.com/adi3042/sensor-fault-detection
🔍⚙️ Ensure Reliable Operations! Detect anomalies and prevent disruptions with our Sensor Fault Detection system. Explore advanced classification and regression techniques to identify and address sensor faults effectively. Your path to robust and accurate sensor data begins here! 🚨🔧 SensorFaultTech
classification css datetime fault-detection flask functools html ipykernel jupternotebook machine-learning numpy pandas python3 readme regression scikit-learn sensor setuptools venv
Last synced: 11 Apr 2026
https://github.com/kudzaiprichard/passport-verification-api
Passport document verifications using machine learning python sklearn
flask flask-sqlalchemy image-forgery-detection image-processing image-recognition juypter-notebook machine-learning python rest-api scikit-learn sklearn sql sqlite
Last synced: 11 Apr 2026
https://github.com/lorenzorottigni/ml-spam-filter
Machine Learning python bootcamp: natural language processing with spam filter purposes
machine-learning npl numpy pandas python scikit-learn seaborn spam-filtering
Last synced: 11 Apr 2026
https://github.com/csengupta1101/housing-price-prediction
Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.
jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode
Last synced: 11 Apr 2026
https://github.com/cmdecker95/learning-ml
My first foray into machine learning, comprised of my early undergrad work (circa 2021-2022) 🤓
jupyter-notebook keras machine-learning scikit-learn tensorflow2
Last synced: 08 Apr 2025
https://github.com/prakashjha1/sensor-based-human-intervention-detection
Objective: Develop an on-device machine-learning model for Android that classifies sensor sequences into four scenarios; walking, handheld still, on-desk with interaction, and on-desk idle using sensor data.
classification feature-engineering feature-extraction lightgbm machine-learning-algorithms matplotlib neural-networks pandas python3 scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/christram/covid-prediction-ml
covid-19 machine-learning pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/tayyabmughal676/machinelearningusingscikitlearn
Machine learning using sci kit learn
machine-learning machine-learning-algorithms machine-learning-library machinelearning-python python scikit-learn scikit-learn-python scikitlearn-machine-learning
Last synced: 13 Apr 2026
https://github.com/frauvate/cheatsheet
This project provides ready-to-use templates for machine learning base models. This repo, which contains templates of common models in Jupyter notebook format, aims to help users quickly implement these models and easily adapt them to their own projects.
machine-learning machinelearning matplotlib numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/karimosman89/customer-churn-prediction
Predict which customers are likely to stop using a service.Build a model to predict which customers are likely to stop using a service or product. This can help companies take proactive measures to retain customers.Provide actionable insights and retention strategies based on the model’s predictions.
matplotlib-pyplot pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/saurabhjalendra/hrf-retinal-edge-retrieval
🔬 Content-Based Image Retrieval system for retinal fundus images using HRF dataset. Implements handcrafted feature engineering (HOG, LBP, Edge Detection, Gabor, GLCM) with ML models for medical image classification and analysis.
computer-vision content-based-image-retrieval edge-detection feature-engineering gabor-filters hog-features jupyter-notebook machine-learning medical-imaging opencv retinal-analysis scikit-image scikit-learn
Last synced: 12 May 2026
https://github.com/aadrianleo/book-recommendation-system
Book Recommender System using the Book-Crossing dataset. Compares content-based (TF-IDF + cosine similarity) and collaborative filtering (SVD) methods for book recommendations. Includes data cleaning, EDA, and model evaluation (Precision@5, RMSE) in Python.
artificial-intelligence collaborative-filtering content-based-filtering cosine-similarity data-preprocessing jupyter jupyter-notebook machine-learning matrix-factorization numpy pandas predictive-modeling python3 recommendation-system recommender-system scikit-learn sckit-surprise
Last synced: 10 Apr 2026
https://github.com/nickklos10/league-of-legends-match-predictor
Predicting League of Legends Match Outcomes Using Logistic Regression and PyTorch.
jupyter-notebook matplotlib numpy pandas pytorch scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/yahiazakaria445/sentiment-analysis-using-lstm
A Sentiment analysis NLP project using LSTM model
matplotlib nltk numpy pandas scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/alaazameldev/text-based-search-engine
Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval
chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer
Last synced: 20 Jan 2026
https://github.com/dmarks84/coursework_project_ml-classification
Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics
classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml
Last synced: 11 Apr 2026
https://github.com/ojhaharsh/rockvsmine-prediction
Streamlit App for Predicting Rock vs. Mine using SONAR Data. The app allows users to input SONAR readings and returns predictions based on trained data.
machine-learning python scikit-learn
Last synced: 03 Jan 2026
https://github.com/sdpdas/sm_sentiment_analysis
Using Natural Language Processing (NLP) and pandas, numpy, scikit-learn for classification and applying logistic regression as it is a supervised model, lastly NLTK. Pickle library used for saving and running the model anywhere.
logistic-regression machine-learning nlp scikit-learn sentiment-analysis stemming vectorizer
Last synced: 03 Jan 2026
https://github.com/emms21/thoughtful_ai
Technical Screening: an agent that responds to semantically similar questions with pre-defined answers, falling back to generic LLM responses otherwise
openai python scikit-learn streamlit
Last synced: 02 Jan 2026
https://github.com/tszon/data-science-projects
Included are all the worth-noting Data Science projects in my learning journey with DataCamp.
data-analysis data-science exploratory-data-analysis feature-engineering machine-learning modelling preprocessing-data scikit-learn supervised-learning
Last synced: 15 Mar 2025
https://github.com/thbaylson/datascience
All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.
data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn
Last synced: 09 May 2026
https://github.com/shahzaibdev355/house-recommendation-backend
House Recommendation Backend is a Flask-based API that provides house recommendations based on user input. It includes endpoints to check the backend's functionality, fetch state data, and get personalized house recommendations based on criteria like price, property size, number of beds, number of baths, and state.
flask flask-api flask-cors numpy pandas scikit-learn tensorflow
Last synced: 05 Jan 2026
https://github.com/diegogallegos4/boston_housing
Udacity Machine Learning Nanodegree Project
machine-learning python scikit-learn
Last synced: 11 Apr 2026
https://github.com/maheshsunuwar/diabetes_mlops
coolify docker dvc github-actions minio mlflow mlops python s3-bucket scikit-learn streamlit
Last synced: 10 Jun 2025