scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-11 00:27:27 UTC
- JSON Representation
https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis
Last synced: 07 May 2026
https://github.com/mrapp-ke/examplewisef1maximizer
A scikit-learn meta-estimator for multi-label classification that aims to maximize the example-wise F1 measure
machine-learning multilabel-classification scikit-learn
Last synced: 27 Apr 2026
https://github.com/mehuaniket/blog-classifier
blog classifier with scikit random forest.
bag-of-words blog-classifier python scikit-learn
Last synced: 07 May 2026
https://github.com/otuemre/realtimenids
Real-time network intrusion detection system using Zeek flow logs and machine learning (IsolationForest). Detects threats with both signature-based and anomaly-based techniques trained on the CSE-CIC-IDS2018 dataset.
anomaly-detection cybersecurity flow-analysis isolation-forest machine-learning network-intrusion-detection nids scapy scikit-learn zeek
Last synced: 07 May 2026
https://github.com/islam-hady9/smartai_customersupport
Smart Customer Support Assistant
customer-support gpt-2 natural-language-processing python pytorch scikit-learn transformers
Last synced: 17 Feb 2026
https://github.com/antonio-f/find-duplicate-questions
Find duplicate questions on StackOverflow by their embeddings. From the Natural Language Processing course - Coursera's Advanced Machine Learning specialization.
cosine-similarity discounted-cumulative-gain embeddings gensim natural-language-processing nlp nltk scikit-learn starspace text-similarity word2vec
Last synced: 27 Apr 2026
https://github.com/tddschn/hack-ncsu-2024
ML and doc part of our Hack_NCState project builtin in less than 1 day | Racial Bias in Criminal Justice Visualized: Code Black
bias machine-learning scikit-learn
Last synced: 08 May 2026
https://github.com/canayter/unsupervised-machine-learning
Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.
k-means-clustering python scikit-learn unsupervised-machine-learning
Last synced: 08 May 2026
https://github.com/cool-japan/sklears
A comprehensive machine learning library in Rust, inspired by scikit-learn's intuitive API and combining it with Rust's performance and safety guarantees.
ai artificial-intelligence machine-learning rust rust-lang scikit-learn scikitlearn-machine-learning
Last synced: 26 Apr 2026
https://github.com/ahmetcansolak/decision-tree-classifier-scikit-learn
A simple decision tree classifier example using scikit-learn
decision-tree-classifier python scikit-learn
Last synced: 28 Apr 2026
https://github.com/official-biswadeb941/clopimedi---your-healths-trusted-care
ClopiMedi is an AI-driven healthcare application that simplifies doctor appointment bookings, offering personalized recommendations based on medical conditions to enhance patient-provider connections.
adam ai flask flask-api flask-api-backend full-stack-web-development joblib machine-learning scikit-learn tensorflow
Last synced: 28 Apr 2026
https://github.com/carpentries-incubator/python-classifying-power-consumption
Clustering and Classifying Time Series Data for Engineers
carpentries-incubator classification clustering engineering english lesson power-consumption pre-alpha python scikit-learn sklearn
Last synced: 12 Feb 2026
https://github.com/nirmalyabag20/breast-cancer-prediction-using-machine-learning
This project leverages machine learning to classify breast cancer as malignant or benign based on tumor characteristics. By applying and evaluating multiple algorithms, the model achieves high accuracy, demonstrating the practical application of data-driven solutions in medical diagnostics.
logistic-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 12 Feb 2026
https://github.com/kritimbist/365-days-of-github-challenge-ai-machine-learning
This repository is part of my 365 Days Challenge: AI × Machine learning, where I combine my passion for Machine Learning 🤖 to learn, build, and document projects every single day for one year.
data-science data-visualization deep-learning machine-learning matplotlib numpy python scikit-learn
Last synced: 28 Apr 2026
https://github.com/francescopaolol/logisticregression
About predicting survival on the Titanic and get familiar with ML basics
jupyter-notebook kaggle logistic-regression machine-learning ml pandas scikit-learn
Last synced: 16 Apr 2026
https://github.com/aakanksha1406/fake-news-classifier
to identify when an article might be fake news
keras lstm lstm-neural-networks nltk python scikit-learn tensorflow
Last synced: 13 Feb 2026
https://github.com/adithaker/falafel
🤖 A from-scratch implementation of a small scaled federated learning application.
cli-app distributed-systems federated-learning logistic-regression python scikit-learn
Last synced: 28 Apr 2026
https://github.com/loong64/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
ai-framework deep-learning hardware-acceleration loong64 loongarch64 machine-learning neural-networks onnx pytorch scikit-learn tensorflow
Last synced: 09 May 2026
https://github.com/lakshitalearning/churninsight
Customer Churn prediction means knowing which customers are likely to leave or unsubscribe from your service.
churn-prediction data-science flask google-colab machine-learning predictive-analytics python scikit-learn user-retention web-development
Last synced: 09 May 2026
https://github.com/davidcamilo0710/hate_speech_analysis
Hate speech detection using NLP for linguistic analysis and machine learning (XGBoost) for classification with Python and SpaCy.
hate-speech-detection linguistic-analysis nlp scikit-learn spacy xgboost
Last synced: 09 May 2026
https://github.com/bhuvaneshwarguttula/student-performance-indicator
To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).
exploratory-data-analysis machine-learning pandas python scikit-learn student-performance-analysis
Last synced: 07 Mar 2026
https://github.com/vishal-038/attendance_by_face_recogination
This project is a face recognition-based attendance system that uses Python, OpenCV, Scikit-learn, Streamlit, and various other libraries like Pandas, Numpy, Datetime, and OS for different functionalities. It enables adding faces to the database, taking attendance based on face recognition, and showing live attendance through a web interface built
Last synced: 14 Feb 2026
https://github.com/hq969/customer-churn-prediction-with-hyperparameter-optimization-and-model-deployment
A complete end-to-end machine learning project that predicts customer churn using the Telco dataset. It includes data preprocessing, exploratory data analysis (EDA), model training with Random Forest, hyperparameter tuning, evaluation, and deployment via a Flask API.
flask numpy pandas python scikit-learn xgboost
Last synced: 02 Apr 2026
https://github.com/rakibhhridoy/supportvectormachinein-medical
Support vector machine in medical disease detection. Both linear and non-linear data can be fitted in svm through its kernel specialization In medical we focus on precision or recall rather than accuracy.
diabetes-prediction machine-learning medical precision-medicine recall-precision scikit-learn support-vector-machines svm
Last synced: 29 Apr 2026
https://github.com/akhil888binoy/intelligent-supplychain-management-system
Blockchain-powered supply chain management system with ML-driven sales prediction. Streamlines supplier-employee transactions and inventory management. Built with MERN stack, Solidity, and Flask.
blockchain decentralized-payments ethereum express flask foundry hackathon-project inventory-management machine-learning mern-stack mongodb nodejs python react sales-prediction scikit-learn smart-contracts solidity supply-chain-management wagmi
Last synced: 09 Oct 2025
https://github.com/RickContreras/StudentPerformancePredictionSaberPro
Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.
classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance
Last synced: 24 Oct 2025
https://github.com/andresmg07/real-time-sign-language-translator
AI-driven real-time American Sign Language translator. Implemented leveraging Support Vector Machines (SVM), OpenCV library and MediaPipe hands module.
ai computer-vision machine-learning mediapipe opencv pattern-recognition scikit-learn support-vector-machines
Last synced: 16 Apr 2026
https://github.com/realamirhe/leaf-node
A leaf node for your machine learning journey, from scratch to practical applications...
algorithm auto-encoder classification cybernetics feature-extraction feedback-mechanism lda learning machine-learning machine-learning-journey numpy pca practice regression scikit-learn sklearn smlfdl
Last synced: 09 May 2026
https://github.com/jasper-koops/easy-gscv
This library allows you to quickly train machine learning classifiers by automatically splitting the data set and using both grid search and cross validation in the training process.
classification machine-learning python3 scikit-learn
Last synced: 14 Feb 2026
https://github.com/siam29/ensemble-majority-voting-hard
In this project, we implemented an ensemble learning approach using majority voting (hard voting) with five machine learning classifiers: DT, RF, XGBC, ANN, and KNN. The ensemble model achieved an impressive accuracy score of 99.95% and an F1 score of 85.51%.
credit-card-fraud ensemble-learning machine-learning matplotlib pandas scikit-learn
Last synced: 09 May 2026
https://github.com/garcane/Income-Prediction-ML
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 24 Oct 2025
https://github.com/hedriss10/knn-machine-learning
Machine learning
machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/kengz/feature_transform
Build Scikit ColumnTransformers by specifying configs.
auto-ml automated-feature-preprocessor columntransformer data-preprocessing feature-engineerig machine-learning scikit-learn
Last synced: 15 Feb 2026
https://github.com/ibrahimsharaf/kaggle-competitions
gensim kaggle kaggle-popcorn machine-learning nltk scikit-learn
Last synced: 10 May 2026
https://github.com/t-abishek/embedded-intent-classifier
A production-grade FastAPI application that uses sentence embeddings to classify user prompts into 4 categories: Built using Python, BGE SentenceTransformer, Scikit-learn, and FastAPI.
classifier embedded huggingface pandas scikit-learn transformer
Last synced: 10 May 2026
https://github.com/zachpinto/xc-rankings-predictions
Applied ML Project predicting cross-country team rankings based on individual-level performances
Last synced: 29 Apr 2026
https://github.com/ayyucedemirbas/solar_power_elasticnet
ElasticNet Linear Regression on Solar Power Generation
elasticnet-regression scikit-learn skops tabular-regression
Last synced: 29 Apr 2026
https://github.com/aryansk/customer-segmentation-analysis
Advanced customer segmentation project using K-Means clustering to analyze customer behavior based on annual income, spending score, and age.
elbow-method exploratory-data-analysis machine-learning machine-learning-algorithms python scikit-learn sentiment-analysis sentiment-classification
Last synced: 29 Apr 2026
https://github.com/bestmahdi2/uni__dataminningstackoverflowproject
A university project related to data mining lesson on StackOverflow website data with Python language
cart csv data-mining logistic-regression matplotlib mlp naive-bayes nltk numpy pandas python scikit-learn scipy seaborn stackoverflow svc textblob tqdm xgboost
Last synced: 16 Feb 2026
https://github.com/ahmetzamanis/usedcarkicksclassification
Imbalanced classification with scikit-learn and PyTorch Lightning.
class-weights classification classification-metrics data-science deep-learning focal-loss hyperparameter-optimization imbalanced-classification logistic-regression machine-learning neural-network optuna python pytorch pytorch-lightning scikit-learn sensitivity-analysis stochastic-gradient-descent support-vector-machines xgboost
Last synced: 10 May 2026
https://github.com/njorogepaul-moghul/iris-flower-classification
This project predicts the species of an Iris flower (Setosa, Versicolor, Virginica) based on its sepal and petal measurements. We trained and evaluated multiple ML models — with Logistic Regression performing best at 93% accuracy. Finally, we deployed on streamlit:[app] (https://irisflowerapp-ripwlmfmctrzqphjapj97t.streamlit.app/)
iris-classification jupyter-notebook logistic-regression machine-learning python random-forest-classifier scikit-learn
Last synced: 29 Apr 2026
https://github.com/mijisu0103/data-driven-decision-making-risk-analysis
This repository contains my coursework project for ECS7005P - Risk and Decision-Making for Data Science and AI. It applies probabilistic models, Bayesian networks, and decision analysis using Python and PyAgrum to evaluate risk and optimise decision-making under uncertainty.
machine-learning pandas probability-and-statistics pyagrum python quantitative-decision-making risk-assessment scikit-learn
Last synced: 10 May 2026
https://github.com/sahil210695/machine-learning-with-scikit-learn
Machine Learning with scikit-learn
confusion-matrix data-science machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/kshula/cipatala-hospital-management-system
Cipatala Hospital management systempowered by AI and machine learning built with Django and Bootstrap
bootstrap django django-project html-css-javascript python scientific-computing scikit-learn tensorflow
Last synced: 01 Mar 2026
https://github.com/marcinwitnik/iris-classifier
Klasyfikator gatunków Iris z użyciem TensorFlow i Keras
ai data-science deep-learning iris-classification keras machine-learning neural-network python scikit-learn tensorflow
Last synced: 10 Jun 2026
https://github.com/neelanjan-chakraborty/custoclarity
CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.
clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn
Last synced: 11 May 2026
https://github.com/pngo1997/astrophysical-objects-classification
Project applies machine learning techniques to classify astrophysical objects using observational data from the Large Synoptic Survey Telescope (LSST).
adaptive-boosting-algorithm classification down-sampling gradient-boosting keras machine-learning neural-network python random-forest scikit-learn supervised-learning tensorflow time-series
Last synced: 10 May 2026
https://github.com/vaibhavs10/learn-ml
Modified notebooks (single) from kaggle.com/learn with added nuances
decision-trees machine-learning pandas random-forest scikit-learn
Last synced: 11 May 2026
https://github.com/hasanulmukit/spam-email-classifier
This is a Spam Email Classifier built using Python and Streamlit. It uses a pre-trained model to predict whether an email is Spam or Not Spam. The app also provides the probability scores for both categories, enhancing transparency and reliability of the prediction.
email-classifier machine-learning nlp python scikit-learn spam-detection streamlit text-classification
Last synced: 11 May 2026
https://github.com/aravindnathan02/whatsapp-chat-analytics
This is an advanced analytics project on a WhatsApp group chat.
communication-complexity data-analytics emoji-sentiment latent-dirichlet-allocation network-analysis nlp python scikit-learn sentiment-analysis
Last synced: 11 May 2026
https://github.com/francescopaolol/decisiontree
About classify iris plants into three species in this classic dataset
decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn
Last synced: 16 Apr 2026
https://github.com/aditya-ranjan1234/interactive-salary-prediction-with-machine-learning
A Streamlit web application for exploring the UCI Census Income dataset, training machine learning models, and predicting employee salaries.
data-science machine-learning prediction python scikit-learn streamlit xgboost
Last synced: 29 Apr 2026
https://github.com/aravindnathan02/credit-card-fraud-detection
This repository contains a Machine Learning project aimed at detecting fraudulent credit card transactions. The goal is to build a reliable and efficient model that minimizes false positives and false negatives, ensuring financial safety and improving fraud detection capabilities.
classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn
Last synced: 11 May 2026
https://github.com/tszon/end-to-end_ds_ml_project
I built an end-to-end customer churn segregation and prediction project.
containerisation data-science docker explianable-ai exploratory-data-analysis feature-engineering hdbscan-clustering kmeans-clustering machine-learning mlflow preprocessing-data scikit-learn shap statistical-test statistical-tests streamlit supervised-learning visualisation vscode
Last synced: 16 Apr 2026
https://github.com/emmanuelezenwere/aind-aiprojects
Portfolio of AI projects developed during my Udacity AI Nanodegree, covering Planning AI, Constraint Satisfaction, Hidden Markov Models, and Search algorithms.
alpha-beta-pruning astar-algorithm bellman-equation breadth-first-search constraint-satisfaction-problem depth-first-search hidden-markov-model kalman-filter minmax-algorithm networkx nltk numpy pandas scikit-learn scipy sympy
Last synced: 29 Apr 2026
https://github.com/python840/machine-learning-from-math-to-models
An in-depth book covering essential topics for AI, ML and DL.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks deep-learning deep-neural-networks machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python neural-network neural-networks python python3 reinforcement-learning reinforcement-learning-algorithms scikit-learn tensorflow
Last synced: 29 Apr 2026
https://github.com/texnoforge/texnomagic
TexnoMagic library for digital Magic
gmm magic numpy python recognition scikit-learn scipy
Last synced: 03 Mar 2026
https://github.com/brenofariasdasilva/scientific-research
My Scientific Research Code Repository.
ck code-metrics commons-lang jabref matplotlib numpy pandas pydriller python scientific-research scikit-learn similarity-measures statistical-analysis wem word2vec worked-example worked-example-miner
Last synced: 16 Apr 2026
https://github.com/elifftosunn/bert-bank-model
It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.
countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer
Last synced: 12 May 2026
https://github.com/gigdevelopment10/neuralfunk
A Machine learning resource library for funky ML-Learners
algorithm keras machine-learning optimization-algorithms py-torch python scikit-learn tensorflow
Last synced: 29 Apr 2026
https://github.com/thevarunsharma/extracting-dominant-colors
A web application that extracts the dominant colors from an image using K-means clustering.
flask-application k-means-clustering machine-learning python scikit-learn unsupervised-learning
Last synced: 12 May 2026
https://github.com/jofaval/pima-indian-diabetes
Data Analysis and Classification of Pima Indian Women's Diabetes in 1988
data-analysis data-science deep-learning google-colab kaggle logistic-regression machine-learning pima-diabetes-data python scikit-learn xgboost
Last synced: 16 Apr 2026
https://github.com/alessiochen/setiment-analysis-ai-project
Application of Sentimental Analysis for Artificial Intelligence class at UNIFI
ai andrew dataset movie-reviews scikit-learn sentiment-analysis
Last synced: 12 May 2026
https://github.com/aliy98/navigation-sensor-data-classification
Classification of a Navigation Robot Sensor Dataset Using SVM, Random Forest and Neural Network
artificial-neural-networks keras multiclass-classification random-forest scikit-learn scitos-g5 support-vector-machines
Last synced: 13 May 2026
https://github.com/ultrasage-danz/scikit-learn-ml
Machine Learning with scikit-learn by Data School
ai data data-school machine-learning macos ml scikit-learn ultrasage-dan
Last synced: 13 May 2026
https://github.com/alam025/customer-churn-prediction
🎯 Predict customer churn with 96%+ accuracy using Random Forest ML. Beautiful visualizations, production-ready code, and real business impact. Save revenue before customers leave! 🚀
churn-prediction classification customer-analytics customer-churn customer-retention data-science machine-learning pandas predictive-analytics python random-forest scikit-learn
Last synced: 11 Jun 2026
https://github.com/dhavaltaunk08/gender-classification
I did this project during my internship at IIT Guwahati. It aimed to perform gender classification in video streaming.
deep-learning librosa opencv-python python scikit-learn
Last synced: 14 May 2026
https://github.com/antoniskl/amsterdam-metro-crowdedness-prediction
The aim of this full-stack project is to predict with RandomForest and visualize crowdedness for metro stations of Amsterdam by using external factors.
amsterdam covid-19 crowded-areas dash full-stack metro prediction-model python random-forest regression scikit-learn ticketmaster-api
Last synced: 14 May 2026
https://github.com/anishshinde01/machine-learning-exercises
Python implementations of machine learning, statistics, and mathematical foundations.
linear-algebra machine-learning machine-learning-algorithms matplotlib numerical-analysis numpy python scikit-learn scipy statistics
Last synced: 11 Jun 2026
https://github.com/gerdm/machine_learning
A repository with a bunch of machine learning
analyses data-science machine-learning machine-learning-algorithms scikit-learn
Last synced: 30 Apr 2026
https://github.com/the-developer-306/house-price-predictor
House Price Predictor: Harnessing machine learning algorithms to forecast housing prices in Boston, empowering buyers and sellers with accurate predictions based on key factors like location, crime rate, rooms, accessibility, and more.
csv ipynb-jupyter-notebook joblib matplotlib numpy pandas python scikit-learn
Last synced: 23 Feb 2026
https://github.com/pankajarm/tabular_ml_toolkit
A helper library to jumpstart your machine learning project based on tabular or structured data.
data-science feature-engineering hyperparameter-tuning machine-learning parallelism python scikit-learn structured-data tabular xgboost
Last synced: 19 Jan 2026
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 04 Mar 2026
https://github.com/ricardouchub/colab-ml-pipeline-agent
Agente en Colab que, dado un dataset en CSV, planifica y ejecuta un pipeline de Machine Learning de inicio a fin: análisis inicial, preprocesamiento, entrenamiento con Scikit-Learn y reporte automático con evalcards.
agent ai deepseek evalcards langchain llm ml pipeline-agent scikit-learn
Last synced: 16 Apr 2026
https://github.com/abdullah321umar/internee.pk-dataanalytics_internship-assignment4
🌟 Fraud Detection in Application 🌟 Through Isolation Forest and K-Means Clustering, the project detects suspicious patterns like inconsistent income, duplicate entries, and unrealistic employment data. This end-to-end workflow transforms raw data into actionable fraud insights — enhancing trust and accuracy.
anomaly-detection csv-handling data-cleaning data-exporting data-import data-normalization exploratory-data-analysis export interpretation matplotlib model-evaluation pandas pca python reporting scaling scikit-learn seaborn
Last synced: 06 May 2026
https://github.com/xharshit/careerconnect-smart-campus-placement-portal
CareerConnect is an AI-powered campus placement portal that helps students prepare for jobs through smart aptitude and coding tests, mock interviews, resume analysis, and more — all monitored with face recognition-based proctoring. Designed to assist students, TPOs, and companies for seamless hiring and tracking.
aptitude artificial-intelligence css face-recognition html machine-learning mockinterview nodejs opencv python resume-builder resumescanner scikit-learn streamlit technical-test tensorflow
Last synced: 13 Apr 2026
https://github.com/0xsolanaceae/neurolog
ML-Powered Log Analysis
isolation-forest machine-learning scikit-learn
Last synced: 24 Apr 2026
https://github.com/ashishkumarpanda45/movies_recomendation_system
this project give you recomendation on your search
machine-learning-algorithms numpy-arrays pandas python3 scikit-learn vscode
Last synced: 09 May 2026
https://github.com/adrien-1997/bike-forecast-paris-velib
Bike-sharing demand forecasting in Paris (Vélib’). A data science and machine learning project leveraging open urban mobility data to predict bike availability, analyze time series usage patterns, and provide interactive dashboards for visualization.
bike-sharing dashboard data-science duckdb forecasting machine-learning matplotlib open-data pandas paris predictive-modeling python scikit-learn streamlit transportation urban-mobility velib
Last synced: 11 Apr 2026
https://github.com/aryan-coder-student/smart-agricultural-monitering-aryanpahari037-
Increase agricultural productivity and sustainability by offering a smart farming assistant that helps farmers make data-driven decisions to optimize resource use and crop management.
hackathon-project machine-learning numpy pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/finite-sample/stagecoachml
Build two-stage models when your features arrive in two batches at different times.
machine-learning scikit-learn two-stage-models
Last synced: 14 Jan 2026
https://github.com/aryansk/fake-news-detection
A sophisticated machine learning solution to detect fake news using multiple classification algorithms. Identify the credibility of news articles with advanced text analysis techniques!
fake-news-detection machine-learning machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn
Last synced: 10 Apr 2026
https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning
This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.
decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc
Last synced: 18 Jan 2026
https://github.com/shanmukhsrisaivedullapalli/automatic-ticket-classification
This project processes customer complaint data using pandas for data manipulation and applies text preprocessing techniques, including lemmatization, to clean and normalize complaint text. The `tqdm` library provides progress bars for efficient tracking of text processing tasks.
matplotlib neural-networks nlp numpy pandas python3 scikit-learn seaborn tensorflow tqdm wordcloud
Last synced: 11 Apr 2026
https://github.com/andi611/libsvm-classification
Performing classification tasks with the LibSVM toolkit on four different datasets: Iris, News, Abalone, and Income.
abalone abalone-dataset classification classification-algorithm data-mining income income-dataset iris iris-dataset libsvm libsvm-ready news-dataset newsgroups-dataset scikit-learn svm svm-classifier svm-training
Last synced: 30 Aug 2025
https://github.com/uhstray-io/pyrizon
Data Collection, Analysis, Mapping, Pipelining & Transformation, & API using Python
api data-engineering etl numpy pandas plotly python pytorch raw-data scikit-learn seaborne sql sqlite tensorflow
Last synced: 09 Apr 2026
https://github.com/2003harsh/house-price-prediction-using-machine-learning
This project features a web app that predicts house prices using a linear regression model. Users can input details like location, square footage, bathrooms, and bedrooms through an HTML form. I've added a CI/CD pipeline with GitHub Actions, unit testing with pytest, and automated Docker containerization to improve deployment and robustness.
ci-cd data-analysis docker-image flask linear-regression machine-learning matplotlib mlops-workflow requests scikit-learn
Last synced: 04 Jan 2026
https://github.com/varun-khorgade/churnshield-customer-retention-predictor
Built an ML-based classification model to predict customer churn. Applied data preprocessing, feature engineering, and ensemble algorithms to improve prediction accuracy and help businesses implement retention strategies.
classification-algorithm datapreprocessing f1-score feature-engineering hyperparameter-tuning logistic-regression matplotlib model-evaluation numpy pandas python ran roc-auc scikit-learn seaborn xgboost
Last synced: 07 May 2026
https://github.com/elifirinci/mushrooms-plants-classification
This project features AI models for identifying mushrooms and plants as poisonous or edible using image-based predictions. Both models are tested through an interactive Gradio interface, ensuring user-friendly and accurate identification for foragers and researchers.
classification cnn cnn-classification gradio image-classification machine-learning mushroom-classification plant-classification scikit-learn
Last synced: 17 May 2026
https://github.com/ayisha-mohammed/freecodecamp_data_analysis_projects
freeCodeCamp-Data_Analysis projects using python
matplotlib-pyplot numpy python scikit-learn seaborn-plots
Last synced: 07 May 2026
https://github.com/rakibhhridoy/easywaydiveinto-datascience
Data Science is not as easy as it seems at first. The most problem faced by new learner are lack of resource knowledge as well as confusion in using the various resources. I hope this repository will benefit confusion learner.
algorithms algorithms-implemented bayesian-statistics data-science deep-learning deep-neural-networks linear-algebra machine-learning matplotlib multivariate-calculus numpy optimization pandas python scikit-learn scipy seaborn statistics statsmodels tensorflow
Last synced: 06 Apr 2026
https://github.com/fohlen/stats-experiment
A tiny stats experiment with GENESIS data
matplotlib python3 scikit-learn
Last synced: 17 May 2026
https://github.com/skywalkerSam/ML-101
Getting started with AI & ML!
ai-ml jupyter-notebooks matplotlib ml ml-framework numpy pandas python3 scikit-learn tensorflow
Last synced: 23 Apr 2025
https://github.com/evangks/hierarchical-clustering-mall-customers
A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.
clustering data-science hierarchical-clustering jupyter-notebook machine-learning mall-customers portfolio-project python scikit-learn unsupervised-learning
Last synced: 07 Mar 2026
https://github.com/udityamerit/curafind-powered-by-ai
CuraFind AI is a web-based application leveraging Natural Language Processing (NLP) to intelligently recommend medicines. Users can search using symptoms, medicine names, or free-text descriptions, and receive suggestions along with brand substitutes for drugs
ai machine-learning nlp numpy pandas scikit-learn
Last synced: 18 Sep 2025
https://github.com/somjit101/human-activity-recognition
This project is to build a model that predicts the human activities such as Walking, Walking Upstairs, Walking Downstairs, Sitting, Standing or Laying using readings from the sensors on a smartphone carried by the user.
decision-tree-classifier eda feature-engineering gradient-boosting-classifier grid-search human-activity-recognition keras logistic-regression lstm random-forest-classifier rbf-kernel scikit-learn seaborn-plots signal-processing support-vector-classifier support-vector-machine t-sne tensorflow uci-har-dataset uci-machine-learning
Last synced: 23 Feb 2026
https://github.com/chrislemke/scikit-tabtrans
TabTransformer ready for Scikit learn 🧑🔬
deep-learning machine-learning python scikit-learn transformer
Last synced: 19 Apr 2025
https://github.com/pr38/survivalpredict
Estimating survival predictions
cox-proportional-hazard cox-regression python reliability-analysis scikit-learn survival-analysis
Last synced: 19 May 2026
https://github.com/soroush-04/incrementalsvm-road-accident-prediction
Enhance SVM and incremental SVM machine learning models for road accident severity prediction
incremental-learning machine-learning python scikit-learn svm
Last synced: 09 Apr 2026