scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-23 00:27:46 UTC
- JSON Representation
https://github.com/mehuaniket/blog-classifier
blog classifier with scikit random forest.
bag-of-words blog-classifier python scikit-learn
Last synced: 07 May 2026
https://github.com/otuemre/realtimenids
Real-time network intrusion detection system using Zeek flow logs and machine learning (IsolationForest). Detects threats with both signature-based and anomaly-based techniques trained on the CSE-CIC-IDS2018 dataset.
anomaly-detection cybersecurity flow-analysis isolation-forest machine-learning network-intrusion-detection nids scapy scikit-learn zeek
Last synced: 07 May 2026
https://github.com/islam-hady9/smartai_customersupport
Smart Customer Support Assistant
customer-support gpt-2 natural-language-processing python pytorch scikit-learn transformers
Last synced: 17 Feb 2026
https://github.com/antonio-f/find-duplicate-questions
Find duplicate questions on StackOverflow by their embeddings. From the Natural Language Processing course - Coursera's Advanced Machine Learning specialization.
cosine-similarity discounted-cumulative-gain embeddings gensim natural-language-processing nlp nltk scikit-learn starspace text-similarity word2vec
Last synced: 27 Apr 2026
https://github.com/tddschn/hack-ncsu-2024
ML and doc part of our Hack_NCState project builtin in less than 1 day | Racial Bias in Criminal Justice Visualized: Code Black
bias machine-learning scikit-learn
Last synced: 08 May 2026
https://github.com/canayter/unsupervised-machine-learning
Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.
k-means-clustering python scikit-learn unsupervised-machine-learning
Last synced: 08 May 2026
https://github.com/cool-japan/sklears
A comprehensive machine learning library in Rust, inspired by scikit-learn's intuitive API and combining it with Rust's performance and safety guarantees.
ai artificial-intelligence machine-learning rust rust-lang scikit-learn scikitlearn-machine-learning
Last synced: 26 Apr 2026
https://github.com/ahmetcansolak/decision-tree-classifier-scikit-learn
A simple decision tree classifier example using scikit-learn
decision-tree-classifier python scikit-learn
Last synced: 28 Apr 2026
https://github.com/official-biswadeb941/clopimedi---your-healths-trusted-care
ClopiMedi is an AI-driven healthcare application that simplifies doctor appointment bookings, offering personalized recommendations based on medical conditions to enhance patient-provider connections.
adam ai flask flask-api flask-api-backend full-stack-web-development joblib machine-learning scikit-learn tensorflow
Last synced: 28 Apr 2026
https://github.com/carpentries-incubator/python-classifying-power-consumption
Clustering and Classifying Time Series Data for Engineers
carpentries-incubator classification clustering engineering english lesson power-consumption pre-alpha python scikit-learn sklearn
Last synced: 12 Feb 2026
https://github.com/nirmalyabag20/breast-cancer-prediction-using-machine-learning
This project leverages machine learning to classify breast cancer as malignant or benign based on tumor characteristics. By applying and evaluating multiple algorithms, the model achieves high accuracy, demonstrating the practical application of data-driven solutions in medical diagnostics.
logistic-regression matplotlib numpy pandas python scikit-learn seaborn
Last synced: 12 Feb 2026
https://github.com/kritimbist/365-days-of-github-challenge-ai-machine-learning
This repository is part of my 365 Days Challenge: AI × Machine learning, where I combine my passion for Machine Learning 🤖 to learn, build, and document projects every single day for one year.
data-science data-visualization deep-learning machine-learning matplotlib numpy python scikit-learn
Last synced: 28 Apr 2026
https://github.com/francescopaolol/logisticregression
About predicting survival on the Titanic and get familiar with ML basics
jupyter-notebook kaggle logistic-regression machine-learning ml pandas scikit-learn
Last synced: 16 Apr 2026
https://github.com/aakanksha1406/fake-news-classifier
to identify when an article might be fake news
keras lstm lstm-neural-networks nltk python scikit-learn tensorflow
Last synced: 13 Feb 2026
https://github.com/adithaker/falafel
🤖 A from-scratch implementation of a small scaled federated learning application.
cli-app distributed-systems federated-learning logistic-regression python scikit-learn
Last synced: 28 Apr 2026
https://github.com/loong64/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
ai-framework deep-learning hardware-acceleration loong64 loongarch64 machine-learning neural-networks onnx pytorch scikit-learn tensorflow
Last synced: 09 May 2026
https://github.com/lakshitalearning/churninsight
Customer Churn prediction means knowing which customers are likely to leave or unsubscribe from your service.
churn-prediction data-science flask google-colab machine-learning predictive-analytics python scikit-learn user-retention web-development
Last synced: 09 May 2026
https://github.com/davidcamilo0710/hate_speech_analysis
Hate speech detection using NLP for linguistic analysis and machine learning (XGBoost) for classification with Python and SpaCy.
hate-speech-detection linguistic-analysis nlp scikit-learn spacy xgboost
Last synced: 09 May 2026
https://github.com/bhuvaneshwarguttula/student-performance-indicator
To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).
exploratory-data-analysis machine-learning pandas python scikit-learn student-performance-analysis
Last synced: 07 Mar 2026
https://github.com/vishal-038/attendance_by_face_recogination
This project is a face recognition-based attendance system that uses Python, OpenCV, Scikit-learn, Streamlit, and various other libraries like Pandas, Numpy, Datetime, and OS for different functionalities. It enables adding faces to the database, taking attendance based on face recognition, and showing live attendance through a web interface built
Last synced: 14 Feb 2026
https://github.com/hq969/customer-churn-prediction-with-hyperparameter-optimization-and-model-deployment
A complete end-to-end machine learning project that predicts customer churn using the Telco dataset. It includes data preprocessing, exploratory data analysis (EDA), model training with Random Forest, hyperparameter tuning, evaluation, and deployment via a Flask API.
flask numpy pandas python scikit-learn xgboost
Last synced: 02 Apr 2026
https://github.com/rakibhhridoy/supportvectormachinein-medical
Support vector machine in medical disease detection. Both linear and non-linear data can be fitted in svm through its kernel specialization In medical we focus on precision or recall rather than accuracy.
diabetes-prediction machine-learning medical precision-medicine recall-precision scikit-learn support-vector-machines svm
Last synced: 29 Apr 2026
https://github.com/akhil888binoy/intelligent-supplychain-management-system
Blockchain-powered supply chain management system with ML-driven sales prediction. Streamlines supplier-employee transactions and inventory management. Built with MERN stack, Solidity, and Flask.
blockchain decentralized-payments ethereum express flask foundry hackathon-project inventory-management machine-learning mern-stack mongodb nodejs python react sales-prediction scikit-learn smart-contracts solidity supply-chain-management wagmi
Last synced: 09 Oct 2025
https://github.com/RickContreras/StudentPerformancePredictionSaberPro
Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.
classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance
Last synced: 24 Oct 2025
https://github.com/andresmg07/real-time-sign-language-translator
AI-driven real-time American Sign Language translator. Implemented leveraging Support Vector Machines (SVM), OpenCV library and MediaPipe hands module.
ai computer-vision machine-learning mediapipe opencv pattern-recognition scikit-learn support-vector-machines
Last synced: 16 Apr 2026
https://github.com/realamirhe/leaf-node
A leaf node for your machine learning journey, from scratch to practical applications...
algorithm auto-encoder classification cybernetics feature-extraction feedback-mechanism lda learning machine-learning machine-learning-journey numpy pca practice regression scikit-learn sklearn smlfdl
Last synced: 09 May 2026
https://github.com/jasper-koops/easy-gscv
This library allows you to quickly train machine learning classifiers by automatically splitting the data set and using both grid search and cross validation in the training process.
classification machine-learning python3 scikit-learn
Last synced: 14 Feb 2026
https://github.com/siam29/ensemble-majority-voting-hard
In this project, we implemented an ensemble learning approach using majority voting (hard voting) with five machine learning classifiers: DT, RF, XGBC, ANN, and KNN. The ensemble model achieved an impressive accuracy score of 99.95% and an F1 score of 85.51%.
credit-card-fraud ensemble-learning machine-learning matplotlib pandas scikit-learn
Last synced: 09 May 2026
https://github.com/garcane/Income-Prediction-ML
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 24 Oct 2025
https://github.com/hedriss10/knn-machine-learning
Machine learning
machine-learning python scikit-learn
Last synced: 09 May 2026
https://github.com/kengz/feature_transform
Build Scikit ColumnTransformers by specifying configs.
auto-ml automated-feature-preprocessor columntransformer data-preprocessing feature-engineerig machine-learning scikit-learn
Last synced: 15 Feb 2026
https://github.com/ibrahimsharaf/kaggle-competitions
gensim kaggle kaggle-popcorn machine-learning nltk scikit-learn
Last synced: 10 May 2026
https://github.com/t-abishek/embedded-intent-classifier
A production-grade FastAPI application that uses sentence embeddings to classify user prompts into 4 categories: Built using Python, BGE SentenceTransformer, Scikit-learn, and FastAPI.
classifier embedded huggingface pandas scikit-learn transformer
Last synced: 10 May 2026
https://github.com/zachpinto/xc-rankings-predictions
Applied ML Project predicting cross-country team rankings based on individual-level performances
Last synced: 29 Apr 2026
https://github.com/ayyucedemirbas/solar_power_elasticnet
ElasticNet Linear Regression on Solar Power Generation
elasticnet-regression scikit-learn skops tabular-regression
Last synced: 29 Apr 2026
https://github.com/aryansk/customer-segmentation-analysis
Advanced customer segmentation project using K-Means clustering to analyze customer behavior based on annual income, spending score, and age.
elbow-method exploratory-data-analysis machine-learning machine-learning-algorithms python scikit-learn sentiment-analysis sentiment-classification
Last synced: 29 Apr 2026
https://github.com/bestmahdi2/uni__dataminningstackoverflowproject
A university project related to data mining lesson on StackOverflow website data with Python language
cart csv data-mining logistic-regression matplotlib mlp naive-bayes nltk numpy pandas python scikit-learn scipy seaborn stackoverflow svc textblob tqdm xgboost
Last synced: 16 Feb 2026
https://github.com/ahmetzamanis/usedcarkicksclassification
Imbalanced classification with scikit-learn and PyTorch Lightning.
class-weights classification classification-metrics data-science deep-learning focal-loss hyperparameter-optimization imbalanced-classification logistic-regression machine-learning neural-network optuna python pytorch pytorch-lightning scikit-learn sensitivity-analysis stochastic-gradient-descent support-vector-machines xgboost
Last synced: 10 May 2026
https://github.com/njorogepaul-moghul/iris-flower-classification
This project predicts the species of an Iris flower (Setosa, Versicolor, Virginica) based on its sepal and petal measurements. We trained and evaluated multiple ML models — with Logistic Regression performing best at 93% accuracy. Finally, we deployed on streamlit:[app] (https://irisflowerapp-ripwlmfmctrzqphjapj97t.streamlit.app/)
iris-classification jupyter-notebook logistic-regression machine-learning python random-forest-classifier scikit-learn
Last synced: 29 Apr 2026
https://github.com/mijisu0103/data-driven-decision-making-risk-analysis
This repository contains my coursework project for ECS7005P - Risk and Decision-Making for Data Science and AI. It applies probabilistic models, Bayesian networks, and decision analysis using Python and PyAgrum to evaluate risk and optimise decision-making under uncertainty.
machine-learning pandas probability-and-statistics pyagrum python quantitative-decision-making risk-assessment scikit-learn
Last synced: 10 May 2026
https://github.com/sahil210695/machine-learning-with-scikit-learn
Machine Learning with scikit-learn
confusion-matrix data-science machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/kshula/cipatala-hospital-management-system
Cipatala Hospital management systempowered by AI and machine learning built with Django and Bootstrap
bootstrap django django-project html-css-javascript python scientific-computing scikit-learn tensorflow
Last synced: 01 Mar 2026
https://github.com/marcinwitnik/iris-classifier
Klasyfikator gatunków Iris z użyciem TensorFlow i Keras
ai data-science deep-learning iris-classification keras machine-learning neural-network python scikit-learn tensorflow
Last synced: 10 Jun 2026
https://github.com/neelanjan-chakraborty/custoclarity
CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.
clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn
Last synced: 11 May 2026
https://github.com/pngo1997/astrophysical-objects-classification
Project applies machine learning techniques to classify astrophysical objects using observational data from the Large Synoptic Survey Telescope (LSST).
adaptive-boosting-algorithm classification down-sampling gradient-boosting keras machine-learning neural-network python random-forest scikit-learn supervised-learning tensorflow time-series
Last synced: 10 May 2026
https://github.com/vaibhavs10/learn-ml
Modified notebooks (single) from kaggle.com/learn with added nuances
decision-trees machine-learning pandas random-forest scikit-learn
Last synced: 11 May 2026
https://github.com/rvats20/income-classification-using-ml
Model Training, Implementing various machine learning algorithms such as Logistic Regression, Decision Trees, Random Forests, and Gradient Boosting. Model Evaluation: Assessing model performance using metrics like accuracy, precision, recall, and F1-score. Hyperparameter Tuning
classification machine-learning machine-learning-algorithms ml pandas-dataframe python scikit-learn
Last synced: 11 May 2026
https://github.com/hasanulmukit/spam-email-classifier
This is a Spam Email Classifier built using Python and Streamlit. It uses a pre-trained model to predict whether an email is Spam or Not Spam. The app also provides the probability scores for both categories, enhancing transparency and reliability of the prediction.
email-classifier machine-learning nlp python scikit-learn spam-detection streamlit text-classification
Last synced: 11 May 2026
https://github.com/aravindnathan02/whatsapp-chat-analytics
This is an advanced analytics project on a WhatsApp group chat.
communication-complexity data-analytics emoji-sentiment latent-dirichlet-allocation network-analysis nlp python scikit-learn sentiment-analysis
Last synced: 11 May 2026
https://github.com/francescopaolol/decisiontree
About classify iris plants into three species in this classic dataset
decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn
Last synced: 16 Apr 2026
https://github.com/aditya-ranjan1234/interactive-salary-prediction-with-machine-learning
A Streamlit web application for exploring the UCI Census Income dataset, training machine learning models, and predicting employee salaries.
data-science machine-learning prediction python scikit-learn streamlit xgboost
Last synced: 29 Apr 2026
https://github.com/aravindnathan02/credit-card-fraud-detection
This repository contains a Machine Learning project aimed at detecting fraudulent credit card transactions. The goal is to build a reliable and efficient model that minimizes false positives and false negatives, ensuring financial safety and improving fraud detection capabilities.
classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn
Last synced: 11 May 2026
https://github.com/tszon/end-to-end_ds_ml_project
I built an end-to-end customer churn segregation and prediction project.
containerisation data-science docker explianable-ai exploratory-data-analysis feature-engineering hdbscan-clustering kmeans-clustering machine-learning mlflow preprocessing-data scikit-learn shap statistical-test statistical-tests streamlit supervised-learning visualisation vscode
Last synced: 16 Apr 2026
https://github.com/emmanuelezenwere/aind-aiprojects
Portfolio of AI projects developed during my Udacity AI Nanodegree, covering Planning AI, Constraint Satisfaction, Hidden Markov Models, and Search algorithms.
alpha-beta-pruning astar-algorithm bellman-equation breadth-first-search constraint-satisfaction-problem depth-first-search hidden-markov-model kalman-filter minmax-algorithm networkx nltk numpy pandas scikit-learn scipy sympy
Last synced: 29 Apr 2026
https://github.com/python840/machine-learning-from-math-to-models
An in-depth book covering essential topics for AI, ML and DL.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks deep-learning deep-neural-networks machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python neural-network neural-networks python python3 reinforcement-learning reinforcement-learning-algorithms scikit-learn tensorflow
Last synced: 29 Apr 2026
https://github.com/texnoforge/texnomagic
TexnoMagic library for digital Magic
gmm magic numpy python recognition scikit-learn scipy
Last synced: 03 Mar 2026
https://github.com/brenofariasdasilva/scientific-research
My Scientific Research Code Repository.
ck code-metrics commons-lang jabref matplotlib numpy pandas pydriller python scientific-research scikit-learn similarity-measures statistical-analysis wem word2vec worked-example worked-example-miner
Last synced: 16 Apr 2026
https://github.com/elifftosunn/bert-bank-model
It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.
countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer
Last synced: 12 May 2026
https://github.com/gigdevelopment10/neuralfunk
A Machine learning resource library for funky ML-Learners
algorithm keras machine-learning optimization-algorithms py-torch python scikit-learn tensorflow
Last synced: 29 Apr 2026
https://github.com/thevarunsharma/extracting-dominant-colors
A web application that extracts the dominant colors from an image using K-means clustering.
flask-application k-means-clustering machine-learning python scikit-learn unsupervised-learning
Last synced: 12 May 2026
https://github.com/jofaval/pima-indian-diabetes
Data Analysis and Classification of Pima Indian Women's Diabetes in 1988
data-analysis data-science deep-learning google-colab kaggle logistic-regression machine-learning pima-diabetes-data python scikit-learn xgboost
Last synced: 16 Apr 2026
https://github.com/alessiochen/setiment-analysis-ai-project
Application of Sentimental Analysis for Artificial Intelligence class at UNIFI
ai andrew dataset movie-reviews scikit-learn sentiment-analysis
Last synced: 12 May 2026
https://github.com/aliy98/navigation-sensor-data-classification
Classification of a Navigation Robot Sensor Dataset Using SVM, Random Forest and Neural Network
artificial-neural-networks keras multiclass-classification random-forest scikit-learn scitos-g5 support-vector-machines
Last synced: 13 May 2026
https://github.com/ultrasage-danz/scikit-learn-ml
Machine Learning with scikit-learn by Data School
ai data data-school machine-learning macos ml scikit-learn ultrasage-dan
Last synced: 13 May 2026
https://github.com/alam025/customer-churn-prediction
🎯 Predict customer churn with 96%+ accuracy using Random Forest ML. Beautiful visualizations, production-ready code, and real business impact. Save revenue before customers leave! 🚀
churn-prediction classification customer-analytics customer-churn customer-retention data-science machine-learning pandas predictive-analytics python random-forest scikit-learn
Last synced: 11 Jun 2026
https://github.com/dhavaltaunk08/gender-classification
I did this project during my internship at IIT Guwahati. It aimed to perform gender classification in video streaming.
deep-learning librosa opencv-python python scikit-learn
Last synced: 14 May 2026
https://github.com/antoniskl/amsterdam-metro-crowdedness-prediction
The aim of this full-stack project is to predict with RandomForest and visualize crowdedness for metro stations of Amsterdam by using external factors.
amsterdam covid-19 crowded-areas dash full-stack metro prediction-model python random-forest regression scikit-learn ticketmaster-api
Last synced: 14 May 2026
https://github.com/anishshinde01/machine-learning-exercises
Python implementations of machine learning, statistics, and mathematical foundations.
linear-algebra machine-learning machine-learning-algorithms matplotlib numerical-analysis numpy python scikit-learn scipy statistics
Last synced: 11 Jun 2026
https://github.com/gerdm/machine_learning
A repository with a bunch of machine learning
analyses data-science machine-learning machine-learning-algorithms scikit-learn
Last synced: 30 Apr 2026
https://github.com/the-developer-306/house-price-predictor
House Price Predictor: Harnessing machine learning algorithms to forecast housing prices in Boston, empowering buyers and sellers with accurate predictions based on key factors like location, crime rate, rooms, accessibility, and more.
csv ipynb-jupyter-notebook joblib matplotlib numpy pandas python scikit-learn
Last synced: 23 Feb 2026
https://github.com/pankajarm/tabular_ml_toolkit
A helper library to jumpstart your machine learning project based on tabular or structured data.
data-science feature-engineering hyperparameter-tuning machine-learning parallelism python scikit-learn structured-data tabular xgboost
Last synced: 19 Jan 2026
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 04 Mar 2026
https://github.com/ricardouchub/colab-ml-pipeline-agent
Agente en Colab que, dado un dataset en CSV, planifica y ejecuta un pipeline de Machine Learning de inicio a fin: análisis inicial, preprocesamiento, entrenamiento con Scikit-Learn y reporte automático con evalcards.
agent ai deepseek evalcards langchain llm ml pipeline-agent scikit-learn
Last synced: 16 Apr 2026
https://github.com/royxlead/multi-objective-feature-selection
NSGA-II multi-objective feature selection on medical tabular data. 9 of 30 features at 94.74% accuracy - matching full-feature baselines with 70% feature reduction.
deap evolutionary-algorithms feature-selection interpretable-ml medical-ml multi-objective-optimization nsga2 pareto-front random-forest scikit-learn
Last synced: 23 Jun 2026
https://github.com/petrosdemetrakopoulos/flight-passengers-prediction
A supervised learning problem given as a project in the "Data Mining in Databases and World Wide Web" course in Computer Science Department of AUEB in Winter semester of 2019.
classification classifier data-science machine-learning python scikit-learn sklearn university-project
Last synced: 30 Apr 2026
https://github.com/swimshahriar/heart-attack-prediction
Heart attack prediction from 13 features.
jupyter-notebook pandas python3 scikit-learn
Last synced: 18 Apr 2026
https://github.com/bhimrazy/iris-species-prediction-using-decision-tree-algorithm-grip
Iris Species Intelligence: Classifying Iris Species with Confidence using Decision Trees | The Sparks Foundation: GRIP
decision-tree-classifier fastapi gripjan23 machine-learning python scikit-learn sparkfoundation
Last synced: 10 Apr 2026
https://github.com/bestmahdi2/uni__pythonsupportvectormachinesbinaryclassification
A university project in which the binary classification of support vector machines is implemented with Python language
binary-classification classification matplotlib numpy python scikit-image scikit-learn seaborn support-vector-machine svm
Last synced: 07 Apr 2026
https://github.com/labrijisaad/chefclub-data-internship
Repository showcasing my Data Engineer / Scientist internship at Chefclub, contributing to data infrastructure enhancement and fostering data-driven insights.
airflow chefclub data-engineering data-science gcp scikit-learn
Last synced: 28 Apr 2025
https://github.com/kingabzpro/mlops-with-jenkins
From data ingestion to deploying the model using Jenkins.
classification fastapi jenkins mlops scikit-learn
Last synced: 13 Feb 2026
https://github.com/skekre98/picture-compressor
A tool for compressing images using unsupervised machine learning
kmeans-clustering scikit-learn
Last synced: 17 May 2026
https://github.com/andi611/libsvm-classification
Performing classification tasks with the LibSVM toolkit on four different datasets: Iris, News, Abalone, and Income.
abalone abalone-dataset classification classification-algorithm data-mining income income-dataset iris iris-dataset libsvm libsvm-ready news-dataset newsgroups-dataset scikit-learn svm svm-classifier svm-training
Last synced: 30 Aug 2025
https://github.com/stitchsages/implyo
An advanced imputation library compatible with mixed type data with a focus on performance and high accuracy, with advanced imputation algorithms for numeric and categorical variables.
imputation imputation-algorithm imputation-methods knn machine-learning pandas pandas-dataframe pip python python3 random-forest scikit-learn
Last synced: 23 Jun 2026
https://github.com/Zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 02 Apr 2025
https://github.com/matsunagalab/tutorial_analyzingmddata
Google colab notebooks for typical MD trajectory analysis routines with Python
mdtraj molecular-dynamics scikit-learn tutorial
Last synced: 20 Apr 2026
https://github.com/ayushshahh/fespn
A neural network made to predict final exam scores of students
mlp mlp-regressor multilayer-perceptron neural-network prediction-model scikit-learn
Last synced: 02 May 2026
https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction
Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.
breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn
Last synced: 17 May 2026
https://github.com/mnitin-reddy/reducing-review-overhead-with-ml-based-application-screening
A machine learning classification project to filter out low-probability visa applications using historical data. It features an end-to-end implementation with CI/CD on AWS, achieving 93% accuracy with a KNN model optimized through Optuna, alongside integration of MLOps tools like Evidently and MLflow.
aws docker githubactions hypothesistesting machinelearning matplotlib mlflow mlops mongodb numpy optuna pandas python scikit-learn seaborn
Last synced: 10 Apr 2026
https://github.com/trainingbypackt/machine-learning-fundamentals-elearning
Use Python and scikit-learn to get up and running with the hottest developments in AI
artificial-intelligence clustering decision-tree machine-learning neural-network python scikit-learn supervised-learning unsupervised-learning
Last synced: 10 Apr 2026
https://github.com/alisonmitchell/boston-housing
Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.
data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn
Last synced: 10 Apr 2026
https://github.com/hokagem/damagedlogginganalyzer
A project about an analyzation of a statistic of damaged logging (wood) in Germany using Python.
analysis csv csv-parser k-fold-cross-validation numpy pandas pandas-dataframe pandas-python polynomial-regression scikit-learn statistics wood
Last synced: 03 May 2026
https://github.com/mehmoodulhaq570/machine-learning-models
A repository consisting of machine learning models for predicting the future instance. More specifically this repository is a Machine Learning course for those who are interested in learning the basics of machine learning algorithms.
decision-trees gradient-descent gradient-descent-algorithm knn-algorithm linear-regression linear-regression-models logistic-regression-algorithm machine-learning-algorithms machine-learning-models ml naive-bayes-algorithm one-hot-encoding pca python random-forest-classifier scikit-learn svm-model
Last synced: 08 Apr 2025
https://github.com/andrewquijano/operating_systems_ii
Creating an Intrusion Detection System
ids kdd99 nsl-kdd-dataset scikit-learn
Last synced: 17 Jan 2026
https://github.com/jbayardo/aa-tp1
Machine learning on spam
email-classifier jupyter-notebook numpy pandas python scikit-learn spam-filtering text-processing
Last synced: 10 Apr 2026
https://github.com/chaitanya1436/student_performance_analysis
A project focused on analyzing college student performance using data on department, assessment scores, and performance labels. Implemented in Google Colab, the analysis includes data preprocessing, feature scaling, and exploratory data analysis to uncover insights and prepare the data for further analysis or modeling.
ata-preprocessing data-preparation exploratory-data-analysis feature-scaling google-colab numpy pandas scikit-learn
Last synced: 07 Feb 2026
https://github.com/jishen-harilal/lung-cancer-prediction-logistic-regression
Using logistic regression to predict cases of lung cancer.
classification data-visualization exploratory-data-analysis healthcare jupyter-notebook logistic-regression lung-cancer machine-learning python scikit-learn
Last synced: 15 May 2026
https://github.com/lasithaamarasinghe/stock-market-price-prediction
This ML model predicts the price of the S&P500 Stock Market Index using RandomForestClassifier
jupyter-notebook machine-learning pandas python random-forest-classifier scikit-learn sp500 stock-market-price-prediction yfinance
Last synced: 10 Apr 2026
https://github.com/williyam-m/movie-recommendation-system
Developed a web app with a cosine similarity machine learning model for personalized recommendations based on user history, likes, bookmarks, and activity. Implemented user auth and CRUD operations for movies.
django machine-learning numpy pandas prediction-model python scikit-learn
Last synced: 10 Apr 2026