scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-23 00:27:46 UTC
- JSON Representation
https://github.com/ravi-aratchige/spacefarer
What type of astronaut would you be?
astronaut astronauts flask jupyter jupyter-notebook machine-learning numpy pandas python scikit-learn space tailwind tailwindcss
Last synced: 10 Apr 2026
https://github.com/sayamalt/airline-passenger-satisfaction-classification
Successfully developed a machine learning model to predict Airline Passenger Satisfaction by building an end-to-end MLOps pipeline. It integrates DVC for data versioning, a Dockerfile for containerization, and CI/CD using GitHub Actions for automated deployment.
azure-web-app-service ci-cd-pipeline classification docker-container dvc-pipeline experiment-tracking exploratory-data-analysis feature-engineering github-actions hyperparameter-tuning machine-learning mlflow mlflow-tracking mlops-workflow model-registry model-training-and-evaluation model-versioning optuna scikit-learn
Last synced: 16 Apr 2026
https://github.com/mabjq/fastapi_iris_ml_project
Simple FastAPI ML application project
fastapi jinja2-templates macine-learning scikit-learn
Last synced: 30 Mar 2025
https://github.com/keremm26/pca-analysis
Principal Component Analysis and Clustering on Survey Data using Python and scikit-learn
k-means-clustering machine-learning pca-analysis scikit-learn unsupervised-learning
Last synced: 04 May 2026
https://github.com/vickshan001/diabetes-prediction-using-machine-learning-ci512-project-
Machine learning project (2021) predicting diabetes using the Pima Indians dataset. Compared KNN, Decision Tree, MLP, and more for accuracy.
classification diabetes-prediction machine-learning mlp pima-indians-dataset python scikit-learn
Last synced: 10 May 2026
https://github.com/dd-se/ml-app
Predict unseen numbers with ML models trained on MNIST dataset.
opencv python scikit-learn streamlit
Last synced: 30 Mar 2025
https://github.com/akashprak/socialnetworkads
Predicting customer purchase behavior from the Social Network Ads dataset.
data-analysis machine-learning mlflow pandas python scikit-learn seaborn xgboost
Last synced: 30 Mar 2025
https://github.com/igormteixeira/tcc-deteccao-ataques-dos
Este repositório contém o código-fonte e o artigo científico do meu TCC sobre detecção de ataques DoS utilizando Machine Learning. O objetivo do trabalho é analisar e implementar um modelo de aprendizado de máquina treinado com uma base pública para identificar atividades maliciosas reais em sistemas computacionais.
cybersecurity dos-attack machine-learning python scikit-learn tcc
Last synced: 03 May 2026
https://github.com/lorenzorottigni/ml-iris-svm
Machine Learning python bootcamp: Support Vector Machines on iris flower dataset
ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines
Last synced: 10 Apr 2026
https://github.com/oneapi-src/customer-churn-prediction
AI Starter Kit for customer churn prediction using Intel® Extension for Scikit-learn*
Last synced: 04 Apr 2025
https://github.com/sanggusti/mentoring-skilvul-sic
A repository for teaching and mentoring as instructor of Skilvul Samsung Innovation Campus 2024
computer-vision flask machine-learning pymongo scikit-learn sql
Last synced: 20 Jan 2026
https://github.com/sarincr/training-on-artificial-intelligence
Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training
artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn
Last synced: 10 Apr 2026
https://github.com/adi3042/credit-card-fault-detection
🔍💳 Secure Your Finances! Detect anomalies and safeguard transactions with our Credit Card Fault Detection system. Dive into cutting-edge classification techniques to identify fraud and protect financial data. Your journey to secure payments starts here! 🚨🔒 FraudDetectionTech
classification credit-card css datetime fault-detection flask functools html ipykernel jupyternotebooks machine-learning numpy pandas python3 readme scikit-learn setuptools venv
Last synced: 03 Apr 2026
https://github.com/korpog/br_cancer
Binary classifier for Breast Cancer Wisconsin Data Set created with scikit-learn and xgboost.
classification data-science machine-learning pandas python scikit-learn xgboost
Last synced: 10 Apr 2026
https://github.com/pramodyasahan/car-safe-predictor
This repository contains a machine learning project that applies the K-Nearest Neighbors (KNN) classification algorithm to predict car safety ratings. The project uses a dataset of cars, with features such as buying price, maintenance cost, number of doors, persons, lug boot size, and safety.
classification k-nearest-neighbours machine-learning numpy pandas scikit-learn
Last synced: 10 Apr 2026
https://github.com/anav5704/honeywell-aog-zero
Data-driven, proactive maintenance sheduling system for APUs
docker fastapi nextjs postgresql scikit-learn
Last synced: 20 Jan 2026
https://github.com/alyssonmach/machine-learning-com-python
Aplicações de Machine Learning usando a linguagem de programação Python.
ia keras-tensorflow machine-learning matplotlib numpy pandas programming python scikit-learn scipy
Last synced: 10 Apr 2026
https://github.com/devash2/ayur-scan
Indian Medicinal Leaf detection application using ML and DL
flask flutter google-firebase opencv python scikit-learn tensorflow
Last synced: 10 Apr 2026
https://github.com/lorenzorottigni/ml-ecommerce
Machine Learning python bootcamp: linear regression on ecommerce dataset
ipynb linear-regression machine-learning numpy pandas python scikit-learn seaborn
Last synced: 07 Apr 2026
https://github.com/lorenzorottigni/dl-tensorboard
Deep Learning python bootcamp: tensorboard with cancer dataset
deep-learning ipynb machine-learning python scikit-learn tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/sahilk12nayak/hyperspectral-corn-don-prediction-project
This project contains a machine learning pipeline for predicting DON (vomitoxin) concentration in corn samples using hyperspectral imaging data.
matplotlib numpy pandas python scikit-learn seaborn tensorflow
Last synced: 10 Apr 2026
https://github.com/dipto9999/ml_introduction
An Introduction to Machine Learning, primarily using Python scikit-learn library.
data-science decision-trees jupyter-notebook k-means-clustering k-nearest-neighbors linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib numpy pandas principal-component-analysis python random-forest scikit-learn seaborn support-vector-machines
Last synced: 10 Apr 2026
https://github.com/1egoman/machine-deep-learning-notes
Notes from my machine deep learning exploration
deep-learning machine-learning neural-networks notes python scikit-learn tensorflow udacity
Last synced: 10 May 2026
https://github.com/arasoul/face-recognition-streamlit
🎯 Neural Face Recognition Matrix - Professional AI-powered biometric identification system with real-time face detection, recognition, and cyberpunk-styled interfaces. Features both web (Streamlit) and desktop (Tkinter) applications with comprehensive training pipeline, Docker deployment, and CI/CD automation.
ai bioinformatics computer-vision deep-learning face-recognition facenet gui machine-learning mtcnn neural-network open-source opencv pytorch real-time scikit-learn streamlit svm
Last synced: 02 Apr 2026
https://github.com/ksasi/customer_segments
machine-learning numpy pandas python scikit-learn
Last synced: 10 Apr 2026
https://github.com/adityakumarda/kmeans-web-analytics
Built with Python, Pandas, and Scikit-learn, this machine learning project uses K-Means to cluster website users by behavior. It reveals patterns in engagement and bounce, helping drive data-informed decisions.
cluster-analysis elbow-curves elbow-method elbow-plot jupyter-notebook kmeans-clustering machine-learning matplotlib numpy pandas python python3 relationship scikit-learn seaborn sklearn
Last synced: 10 Apr 2026
https://github.com/dineshdhamodharan24/amazon-reviews-sentiment-analysis
This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.
matplotlib numpy pandas python scikit-learn
Last synced: 10 Apr 2026
https://github.com/sunnyrao07/stroke-risk-prediction
Predicting stroke risk using machine learning models based on healthcare and demographic data.
data-cleaning data-visualization decision-trees feature-engineering label-encoding matplotlib model-evaluation numpy outlier-detection pandas python random-forest scikit-learn seaborn standard-scaler
Last synced: 10 Apr 2026
https://github.com/jelhamm/principle-component-analysis-data-mining
"This repository contains an implementation of the Principal Component Analysis (PCA) algorithm, which is one of the key techniques used for dimensionality reduction in data mining and machine learning."
data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms pca principal-component-analysis python pytorch scikit-learn scipy-library tensorflow
Last synced: 10 Apr 2026
https://github.com/sisolieri/prova_ds_saloocupacio2024
Admission challenge to Hackató Saló Ocupació by Barcelona activa
arima barcelona catboost data-analysis data-visualizations forecasting machine-learning pandas public-funding python scikit-learn time-series xgboost
Last synced: 10 Apr 2026
https://github.com/broodhoney/heart-disease-prediction
This is a machine learning project which has a trained model that classifies whether a patient has a heart-disease or not.
kaggle-dataset matplotlib numpy pandas python scikit-learn scikitlearn-machine-learning uci
Last synced: 10 Apr 2026
https://github.com/thiagohrcosta/movieapp-ml
The Movie APP is a project created to apply some of the concepts learned throughout the post-graduation degree at XP Educação in Artificial Intelligence with an emphasis on Machine Learning. While this project is not integrated into the curriculum of the course, some of the concepts used were learned during the program.
docker flask-api machine-learning mysql-database postgresql python scikit-learn
Last synced: 10 Apr 2026
https://github.com/ameykasbe/credit-card-fraud-detection-on-imbalanced-dataset
Examined data preprocessing techniques and performance of six different predictive models in Python to credit card fraud detection problem on an imbalanced dataset. Algorithms implemented - Logistic Regression, K Nearest Neighbours, Support Vector Classification, Naïve Bayes Classifier, Decision Tree Classifier, and Random Forest Classifier.
classification machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 10 Apr 2026
https://github.com/vijaykumarr1452/startup_success_predictor
This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)
machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn
Last synced: 10 Apr 2026
https://github.com/jol79/python_exercises
Solving interesting python exercises on different topics
matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn
Last synced: 10 Apr 2026
https://github.com/snigdho8869/clustering-projects
Repository for various clustering projects including mall customer segmentation and more. Explore data analysis and clustering techniques
cluster-analysis clustering clustering-algorithm customer-segmentation deep-learning gaussian-mixture-models hierarchical-clustering hierarchical-models kmeans-algorithm kmeans-clustering machine-learning mall-customer-segmentation mall-customers numpy pandas scikit-learn segmentation spectral-clustering tensorflow
Last synced: 10 Apr 2026
https://github.com/gamowy/music-classification
Music genre classification using k nearest neighbors classifier based on gtzan dataset
machinelearning python scikit-learn university-assignment
Last synced: 10 Apr 2026
https://github.com/filsan-musa/project-iot_malware_identification
This repository contains the code and data for a project that detects malware from IoT devices using a publish-subscribe model with Confluent and Databricks. The project streams IoT device data to Kafka, analyzes it, and detects malware using machine learning models such as Random Forest and Gradient Boosted Trees.
apache-kafka classification confluent databricks machine-learning-algorithms scikit-learn sql
Last synced: 31 Aug 2025
https://github.com/arssite/dirty-cleanflooringimageprocessingusingyolov5
Uses YOLOv5 to classify floor cleanliness into five categories based on visual cues. It includes an annotated dataset, trained model,& evaluation outputs. Code covers data preprocessing, training, & testing. A comparative analysis highlights YOLOv5's advantages over traditional methods, providing an efficient solution automated floor cleanliness.
deep-neural-networks github google-colab jupyter-notebook labelimg matplotlib-pyplot numpy-library opencv-python pandas-python pytorch scikit-learn tensorflow yolov5
Last synced: 10 Apr 2026
https://github.com/annasmustafadev/network-intrusion-detection-ml
Machine learning-based Intrusion Detection System (IDS) for classifying network traffic as normal or malicious using supervised learning techniques. Includes data preprocessing, feature selection, model training, and evaluation for improved cybersecurity intelligence.
anomaly-detection classification cyber-security data-science intrusion-detection machine-learning python scikit-learn supervised-learning
Last synced: 29 Apr 2026
https://github.com/chirindaopensource/measuring_economic_outlook_in_news
End-to-End Python implementation of Beck et al.'s (2025) economic sentiment analysis framework for constructing a high-frequency economic sentiment indicator using 1024-dimensional Jina embeddings and LLM-generated training data. Features L2-regularized classification and rigorous POOS econometric validation with DM-HAC tests for GDP forecasting.
claude-ai computational-economics econometrics financial-modeling jina-embeddings llm nlp privacy-preserving-ml python regularized-regression reproducible-research scikit-learn sentiment-analysis statsmodels synthetic-data tensorflow time-series-forecasting transformers weak-supervision
Last synced: 30 Apr 2026
https://github.com/anastasius21/fakenewsmodel
The repo contains the model for fake news detection and a streamlit app for its implementation.
fake-news-detection machine-learning nlp pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/tomgorb/ds-utils
pre-processing of a DataFrame into a sparse matrix for model input
machine-learning preprocessing scikit-learn
Last synced: 16 May 2026
https://github.com/jay4codes/time-series-comparative-analysis
A comparative analysis of various time series models on JP Morgan's stock price data for stock price predictive analysis
scikit-learn tensorflow time-series-analysis
Last synced: 10 May 2026
https://github.com/miteshgupta07/zomato-restaurant-rating-predictor
A Zomato rating prediction app that uses machine learning to forecast restaurant ratings based on various factors, helping users make informed dining decisions.
flask machine-learning python scikit-learn
Last synced: 10 Apr 2026
https://github.com/sakshi2215/email-sms--spam_classifier
nltk-python scikit-learn streamlit-webapp
Last synced: 17 May 2026
https://github.com/gozsari/ml-oneday-course
This is a one-day machine learning introductory course for beginners
anomaly-detection classification clustering course dimensionality-reduction machine-learning machine-learning-algorithms ml-course ml-workflow regression scikit-learn supervised-learning unsupervised-learning
Last synced: 20 Jan 2026
https://github.com/tsungtsetu122/datamining-cifar10-classification
Data mining project on CIFAR-10 extracted features, applying preprocessing, classification models, and evaluation techniques to improve classification performance.
matplotlib numpy pandas python scikit-learn
Last synced: 10 Apr 2026
https://github.com/parthapray/nlp_pipeline_openai
This repo contains nlp pipeline and openai API integration
gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud
Last synced: 10 Apr 2026
https://github.com/j-sephb-lt-n/unsupervised-fraud-detection
Exploring anomaly detection using unsupervised methods in scikit-learn
anomaly anomaly-detection fraud-detection isolation-forest local-outlier-factor outlier outlier-det scikit-learn sklearn unsupervised
Last synced: 07 May 2026
https://github.com/anras5/songs-classifier
Classify songs' genres with Machine Learning
data-science docker machine-learning mlflow pandas python scikit-learn seaborn streamlit tpot
Last synced: 10 Apr 2026
https://github.com/prasadhiremath1/movie-recommender-system
Select a movie and 5 similar movies are recommended from the tmdb dataset
pandas python3 scikit-learn streamlit
Last synced: 11 Apr 2026
https://github.com/crispengari/ml-web-applications
✔ This repository contains a series of machine learning web applications, using python.
artificial-intelligence deeplearning flask javascript machinelearning nueral-networks python scikit-learn sentiment-analysis webapplication
Last synced: 11 Apr 2026
https://github.com/mnitin-reddy/collaborative-filtering-based-recommendation-system
This project is a Book Recommendation System that uses two main approaches: Popularity-Based and Collaborative Filtering. It recommends top books based on their rating frequency and average ratings, and also provides personalized book suggestions by analyzing user interactions.
collaborative-filtering numpy pandas popularity-based-recommendation python recommendation-system scikit-learn
Last synced: 11 Apr 2026
https://github.com/toluwalase-taiwo/ml-zoomcamp
Machine learning repository
algorithms jupyter-notebook machine-learning matplotlib numpy pandas python3 scikit-learn
Last synced: 11 Apr 2026
https://github.com/simon2k/ieee-cis-fraud-detection
Can you detect fraud from customer transactions?
customer-transactions fraud-detection fraud-relation machine-learning numpy pandas scikit-learn
Last synced: 11 Apr 2026
https://github.com/akshaya13/recommendation-system
Content Based Recommendation system using tags!
nltk scikit-learn similarity-search tmdb-database
Last synced: 18 May 2026
https://github.com/domingosdeeulariadumba/imdb250_stars
IMDb 250 analysis (Stars vs Gross ROI/Rating)
cinema feature-engineering powerbi regression-models rfe scikit-learn ssms statsmodels
Last synced: 18 Jan 2026
https://github.com/gamowy/urbansounds-classification
Classification of urban sounds using Tensorflow Keras
keras machine-learning python scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/eco786786/spotify-playlist-generator
This project uses machine learning to cluster songs by features like tempo, genre and mood with K-Means. It then creates personalised Spotify playlists based on these clusters, providing dynamic, genre specific track collections. Integrating the Spotify API, it enables users to explore new music within custom groupings.
flask matplotlib pandas python3 scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/archie-cm/churn-analysis-for-bank-customer
The objective from this project are to predict customer churn and provide recommendations to the business team
feature-engineering machine-learning python scikit-learn
Last synced: 11 Apr 2026
https://github.com/zuhairzia/titanic-survival-project
This is a Titanic Survival Prediction Model developed using Python, Pandas, Scikit-learn, and Jupyter Notebook. The model predicts whether a passenger survived the Titanic disaster based on features such as age, gender, and passenger class.
csv-dataset flask jupyter-notebook matplotlib numpy pandas pandas-library python scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/kishanlalchoudhary/te-sem-6
TE SEM 6 Assignments
cpp data-science dsa-cpp matplotlib nltk numpy pandas python salesforce scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/djdhairya/crop-recommendation
Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.
adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc
Last synced: 11 Apr 2026
https://github.com/shreeparab1890/handwritten-digit-recognition
In this iPython Noetbook we are going to use the MNIST dataset for the implementation of a handwritten digit recognition app using LogisticRegression and SGDClassifier and compare the accuracy and other metrics.
handwritten-digit-recognition image-classification matplotlib mnist-dataset python scikit-learn sklearn
Last synced: 11 Apr 2026
https://github.com/msikorski93/breast-cancer-classifying
Identifying and assigning breast cancer diagnosis using machine learning methods, based on observations in WDBC dataset. All classifiers have been evaluated and performed well for this task.
breast-cancer classification k-nearest-neighbours keras logistic-regression naive-bayes neural-networks scikit-learn tensorflow
Last synced: 30 Apr 2026
https://github.com/kirtipratihar/python_libraries_for_ds
This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.
artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow
Last synced: 11 Apr 2026
https://github.com/akimuddinshaikh/machine-learning-project
A comparative study of regression models (Decision Tree, Random Forest, Ridge, Lasso, SVM) for predicting real estate prices in King County, NYC, and California using PCA & Pipeline techniques.
machine-learning pca-analysis python regression-models scikit-learn statsmodels
Last synced: 16 May 2026
https://github.com/eljandoubi/deploy-ml
Deploying a ML Model to Cloud Application Platform with FastAPI
ci-cd fastapi github-actions gunicorn pandas pytest render scikit-learn uvicorn
Last synced: 11 Apr 2026
https://github.com/das-amlan/delay-prediction-in-urban-mobility-networks
Predicting delays in Urban mobility netwrok using different ML algorithms.
delay-prediction gradient-boosting machine-learning python r scikit-learn
Last synced: 05 Apr 2026
https://github.com/mr-ndi/tibebai
Machine learning experiments on student performance prediction. Inspired by tibeb (wisdom) in Amharic, this project explores regression models to understand how study factors influence exam scores.
ai data-science education elevvo google-colab internship kaggle linear-regression machine-learning matplotlib pandas polynomial-regression prediction regression scikit-learn student-performance tibebai-wisdom
Last synced: 11 Apr 2026
https://github.com/timothyjan/intro-machine-learning-classifiers
We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.
numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/renato4333/learn-artificial-intelligence
bayesian-inference capsule-network causal-inference convolutional-neural-networks data-structures deep-learning deep-learning-algorithm knn lua matplotlib pandas probabilistic-programming python pytorch question-answering regression-algorithms scikit-learn torch
Last synced: 11 Apr 2026
https://github.com/brej-29/disaster-tweets-nlp-model-benchmarks
Benchmark NLP models on Kaggle “Disaster Tweets”: TF-IDF + Naive Bayes baseline, Keras deep nets (Dense/LSTM/GRU/BiRNN/Conv1D), and TensorFlow Hub Universal Sentence Encoder transfer learning—compared using accuracy, precision, recall, and F1.
bidirectional-rnn cnn conv1d deep-learning disaster-tweets gru kaggle keras lstm machine-learning naive-bayes nlp rnn scikit-learn tensorflow tensorflow-hub text-classification tfidf
Last synced: 11 Apr 2026
https://github.com/priteshramani/movie-recommender
A content-based movie recommendation system using Python, Pandas, and cosine similarity to suggest movies based on their features.
cosine-similarity pandas pickle python scikit-learn streamlit
Last synced: 11 Apr 2026
https://github.com/abrarshahok/electric-vehicle-charging-station-energy-consumption-prediction
With the rapid adoption of electric vehicles, optimizing energy usage at charging stations has become crucial for improving operational efficiency and ensuring customer satisfaction. This tool leverages predictive modeling to forecast energy consumption for charging sessions based on various input features.
matplotlib numpy pandas plotly python3 scikit-learn xgboost
Last synced: 09 Jun 2026
https://github.com/mrktsm/spam-email-recognizer
Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.
data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model
Last synced: 09 Apr 2026
https://github.com/abdiasarsene/routerwise-api-predictive-analytics-for-shipments
🧭 RouterWise optimise la logistique d’œuvres d’art grâce à une pipeline MLOps automatisée, prédictive et monitorée, intégrée au backend de PrecisioArt.
bentoml docker fastapi jenkins mlflow prometheus scikit-learn
Last synced: 11 Apr 2026
https://github.com/lmizner/grokking_data_science
Coding practice for basic data science interview questions in Python
data-science numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/jayadavv/dynamic-ml-model-selector
An interactive web application that allows users to upload their datasets and dynamically select, train, and evaluate various machine learning models. The app provides comprehensive performance metrics and visualizations, making it easy for users to analyze their data effectively.
decision-trees linear-regression logistic-regression matplotlib-pyplot plotly python random-forest scikit-learn streamlit
Last synced: 11 Apr 2026
https://github.com/genaray/ml.shopanalytics
A minimalist Python & cloud ML project that trains on Amazon sales & review data to recommend optimal prices/discounts to boost ratings/sales and surface actionable visual insights. Powered end-to-end by AWS CloudFront, S3, ALB & Fargate and Svelte.
ai aws aws-alb aws-cloudfront aws-ecs aws-fargate aws-s3 cicd devops machine-learning python scikit-learn terraform
Last synced: 11 Apr 2026
https://github.com/gregoritsch3/ml_eda_clustering_aidassessment
An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.
anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics
Last synced: 16 Apr 2026
https://github.com/kishanlalchoudhary/be-sem-7
BE SEM 7 Assignments
blockchain cpp design-analysis-algorithms machine-learning matplotlib numpy pandas scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/felinjob/ibm-applied-data-science-capstone
Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.
data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql
Last synced: 11 Apr 2026
https://github.com/jonad/boston_housing_price
Predicting Boston Housing Prices.
boston-housing-dataset jupyter-notebook matplotlib numpy pandas python3 scikit-learn
Last synced: 11 Apr 2026
https://github.com/sshbuilder/movie-recommendation-system
The primary goal of this project is to provide personalized movie recommendations to users based on their preferences and the characteristics of the movies. This is achieved through a multi-step process involving data preprocessing, text vectorization, and recommendation generation.
anaconda-environment data-science jupyter-notebook machine-learning movie-recommendation movies pandas python3 recommendation-system recommender-system scikit-learn scikitlearn-machine-learning
Last synced: 26 Feb 2025
https://github.com/eljandoubi/genre_classification
Create an ML pipeline for Genre Classification using MLflow.
hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb
Last synced: 11 Apr 2026
https://github.com/talapanenivarshithchowdary/asteroid-detection-ml
This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.
classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn
Last synced: 11 Apr 2026
https://github.com/audy21/datacamp
Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.
data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/swarnabhaghosh/house-price-prediction-model
Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.
column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/das-amlan/customer-churn-prediction
Predicting customer churn using machine learning algorithms
customer-churn-prediction imbalanced-data keras-tensorflow machine-learning pandas prediction-model python scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/ricardorobledo/ml_optimization
matplotlib numpy python scikit-learn xgboost
Last synced: 11 Apr 2026
https://github.com/allanreda/telco-customer-churn-predictor-app
A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.
app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn
Last synced: 16 Apr 2026
https://github.com/dastogirrudro/machine-learning-and-deep-learning
This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.
deep-learning lstm machine-learning numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/matbesancon/kaggle-digit-recognizer
Some tests with the Kaggle Digit Recognition challenge
image-processing kaggle kaggle-digit-recognizer machine-learning mnist-dataset numpy pandas python scikit-image scikit-learn
Last synced: 11 Apr 2026
https://github.com/andrewjmack/credit-risk-classification
Supervised learning model trained and evaluated on loan risk for potential use in the prediction of the creditworthiness of an applicant
banking loan-prediction-analysis machine-learning pandas python scikit-learn supervised-learning
Last synced: 11 Apr 2026
https://github.com/trimoyee-g/adenovirus-disease-prediction
A machine learning project using scikit-learn to compare models for Adenovirus detection, selecting the most effective one based on accuracy, precision, and recall.
machine-learning matplotlib python random-forest-classifier scikit-learn
Last synced: 11 Apr 2026