scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2026-06-25 00:23:58 UTC
- JSON Representation
https://github.com/rickiepark/ml-ko
머신러닝, 딥러닝 한글 번역 저장소
deep-learning keras machine-learning python scikit-learn tensorflow
Last synced: 17 Apr 2026
https://github.com/neelanjan-chakraborty/custoclarity
CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.
clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn
Last synced: 11 May 2026
https://github.com/pngo1997/astrophysical-objects-classification
Project applies machine learning techniques to classify astrophysical objects using observational data from the Large Synoptic Survey Telescope (LSST).
adaptive-boosting-algorithm classification down-sampling gradient-boosting keras machine-learning neural-network python random-forest scikit-learn supervised-learning tensorflow time-series
Last synced: 10 May 2026
https://github.com/prashver/titanic-survival-prediction
This project tackles the Titanic challenge on Kaggle, predicting passenger survival based on variables like age, sex, and passenger class. The Jupyter notebook covers essential steps of a data science pipeline, including exploratory data analysis, data cleaning, feature engineering, and modeling. The dataset used is the Titanic dataset.
classification-algorithm machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn
Last synced: 02 May 2026
https://github.com/dolongbien/ml2018
Machine Learning Fall 2018
decision-tree-classifier dimensionality-reduction jupyter-notebook machine-learning-algorithms naive-bayes-classifier neural-networks python scikit-learn
Last synced: 01 May 2026
https://github.com/vaibhavs10/learn-ml
Modified notebooks (single) from kaggle.com/learn with added nuances
decision-trees machine-learning pandas random-forest scikit-learn
Last synced: 11 May 2026
https://github.com/rvats20/income-classification-using-ml
Model Training, Implementing various machine learning algorithms such as Logistic Regression, Decision Trees, Random Forests, and Gradient Boosting. Model Evaluation: Assessing model performance using metrics like accuracy, precision, recall, and F1-score. Hyperparameter Tuning
classification machine-learning machine-learning-algorithms ml pandas-dataframe python scikit-learn
Last synced: 11 May 2026
https://github.com/rakibhhridoy/machinelearning-featureselection
Before training a model or feed a model, first priority is on data,not in model. The more data is preprocessed and engineered the more model will learn. Feature selectio one of the methods processing data before feeding the model. Various feature selection techniques is shown here.
extratreesclassifier feature-selection gridsearchcv lasso-regression logistic-regression machine-learning numpy pandas pca rfe rfecv scikit-learn selectkbest
Last synced: 02 May 2026
https://github.com/anarya22/heart-disease-classification
Predicting heart disease using machine learning. This notebook looks into various python base ML and DS libraries in an attempt to build a machine learning model capable of predicting whether or not someone has heart disease based on their medical attributes.
data-cleaning data-visualization machine-learning matplotlib numpy pandas scikit-learn
Last synced: 01 May 2026
https://github.com/hermann-web/search-engine-with-python-nlp
A python search engine build with NLP methods for a django project
cosine-similarity document-searching natural-language-processing nlp nltk pandas python scikit-learn search-engine semantic-similarity similarity-score similarity-search
Last synced: 02 May 2026
https://github.com/hasanulmukit/spam-email-classifier
This is a Spam Email Classifier built using Python and Streamlit. It uses a pre-trained model to predict whether an email is Spam or Not Spam. The app also provides the probability scores for both categories, enhancing transparency and reliability of the prediction.
email-classifier machine-learning nlp python scikit-learn spam-detection streamlit text-classification
Last synced: 11 May 2026
https://github.com/khaymanii/titanic_survival_prediction_-model
This Model was built using Python and Logistic Regression algorithm
matplotlib numpy pandas python scikit-learn seaborn
Last synced: 02 May 2026
https://github.com/aravindnathan02/whatsapp-chat-analytics
This is an advanced analytics project on a WhatsApp group chat.
communication-complexity data-analytics emoji-sentiment latent-dirichlet-allocation network-analysis nlp python scikit-learn sentiment-analysis
Last synced: 11 May 2026
https://github.com/francescopaolol/decisiontree
About classify iris plants into three species in this classic dataset
decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn
Last synced: 16 Apr 2026
https://github.com/aditya-ranjan1234/interactive-salary-prediction-with-machine-learning
A Streamlit web application for exploring the UCI Census Income dataset, training machine learning models, and predicting employee salaries.
data-science machine-learning prediction python scikit-learn streamlit xgboost
Last synced: 29 Apr 2026
https://github.com/aravindnathan02/credit-card-fraud-detection
This repository contains a Machine Learning project aimed at detecting fraudulent credit card transactions. The goal is to build a reliable and efficient model that minimizes false positives and false negatives, ensuring financial safety and improving fraud detection capabilities.
classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn
Last synced: 11 May 2026
https://github.com/umar-saadat/car-price-prediction-ml
🚗 A Machine Learning project that predicts the price of used cars using Linear Regression. Built with Python, Scikit-learn, and Streamlit, this app takes inputs like car brand, year, mileage, engine size, and more to estimate the selling price in real-time
ai-project car-price-prediction data-science linear-regression machine-learning ml-project python scikit-learn streamlit
Last synced: 02 May 2026
https://github.com/bistcuite/plainml
Painless Machine Learning Library for python based on scikit-learn
machine-learning ml plainml python scikit-learn
Last synced: 02 May 2026
https://github.com/tszon/end-to-end_ds_ml_project
I built an end-to-end customer churn segregation and prediction project.
containerisation data-science docker explianable-ai exploratory-data-analysis feature-engineering hdbscan-clustering kmeans-clustering machine-learning mlflow preprocessing-data scikit-learn shap statistical-test statistical-tests streamlit supervised-learning visualisation vscode
Last synced: 16 Apr 2026
https://github.com/royxlead/multi-objective-feature-selection
NSGA-II multi-objective feature selection on medical tabular data. 9 of 30 features at 94.74% accuracy - matching full-feature baselines with 70% feature reduction.
deap evolutionary-algorithms feature-selection interpretable-ml medical-ml multi-objective-optimization nsga2 pareto-front random-forest scikit-learn
Last synced: 23 Jun 2026
https://github.com/sapsan14/water-quality-ee
Estonian water quality ML — binary classification of Terviseamet open data, Jupyter + scikit-learn.
classification estonia jupyter ml open-data scikit-learn
Last synced: 02 May 2026
https://github.com/assamirzafar/learning
My Roadmaps and challenges are in this repo...I will add my colab and kaggle notebook links along with py script files in here.
calculus convolutional-neural-networks deep-learning deep-neural-networks keras linear-algebra machine-learning numpy opencv probability python3 pytorch scikit-learn scipy statistics
Last synced: 05 Apr 2026
https://github.com/jordandeklerk/pygridge
A scikit-learn compatible Python package for data-driven group regularized ridge regression
python regression regularized-regression scikit-learn
Last synced: 05 May 2026
https://github.com/rhazra-003/fake_news_detector
A Machine Learning model to detect fake news with more than 95% accuracy
fake-news numpy pandas scikit-learn
Last synced: 18 Apr 2026
https://github.com/saman-nia/multiclass-classification
Deep Learning VS. Machine learning
classification deep-learning logistic-regression multi-class-classification multi-classify-with-tensorflow one-vs-rest scikit-learn tensorflow text-features
Last synced: 06 May 2026
https://github.com/siam29/hybrid-feature-engineering-and-ensemble-learning
In this ML project, I proposed a methodology that provided an outperformed performance compared to another existing paper. For the comparison here focused mainly on F1, accuracy, AUC, and ROC score. This methodology provides a 99.96% accuracy score and 90.05% F1 score.
feature-selection keras-tensorflow machine-learning matplotlib python scikit-learn
Last synced: 18 Apr 2026
https://github.com/venky-1710/stress-level-predection
Stress Level Prediction is a web app using machine learning to estimate user stress levels. It takes inputs like anxiety, sleep quality, and academic performance, then predicts stress using a Decision Tree Classifier. Built with Python, Flask, and scikit-learn, it's useful for students, researchers, and those interested in stress management.
css flask html machine-learning numpy pandas python python-sklearn scikit-learn
Last synced: 05 Apr 2026
https://github.com/ayushsaksena30/cosmic-classifier
This notebook implements a structured machine learning pipeline to classify cosmic data using the CatBoost Classifier, known for its efficiency with categorical features and minimal preprocessing requirements.
catboost-classifier label-encoder machine-learning matplotlib numpy pandas robust-scaler scikit-learn seaborn simple-imputer
Last synced: 15 Apr 2026
https://github.com/joaoassalim/class-by-description-classifier-with-nlp
Enhancing Item Classification through Natural Language Processing: Leveraging Text Descriptions for Precise Categorization
bert fine-tuning nlp nlp-machine-learning scikit-learn sklearn tensorflow
Last synced: 06 May 2026
https://github.com/sarthak-1408/rain-fall-prediction
This repository represents the End to End Machine Learning Project (Rain Fall Prediction in Australia).
heroku heroku-deployment machine-learning numpy pandas rain-fall rain-fall-prediction scikit-learn xgboost-algorithm
Last synced: 05 May 2026
https://github.com/somjit101/nlp-casestudy-quora-question-similarity
An application of NLP and classical ML algorithms to an interesting real-world use case of predicting similarity between two questions on Quora. This allows the platform to combine similar questions into one and combine their answers to avoid duplication and unnecessary confusion.
cross-validation feature-engineering feature-extraction gradient-boosting kaggle logistic-regression machine-learning model-calibration natural-language-processing nlp quora-question-pairs scikit-learn svm text-mining xgboost
Last synced: 05 Apr 2026
https://github.com/tomwassing/brane-project
Brane example project using the Scikit-learn and Matplotlib packages
brane branescript matplotlib scikit-learn
Last synced: 17 Oct 2025
https://github.com/khaymanii/diabetes_prediction_model
This is a Machine learning model built using Python
matplotlib numpy pandas python scikit-learn
Last synced: 19 Apr 2026
https://github.com/drcbeatz/machine-learning-tool
Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.
data-science data-visualization django machine-learning python scikit-learn
Last synced: 06 May 2026
https://github.com/himendersharma0712/life_expectancy_pred
This repository is for a hackathon project.
jupyter-notebook machine-learning python scikit-learn
Last synced: 06 May 2026
https://github.com/shubhranpara/heart-disease-predictor
I have created this project as my Python term assignment. In this project I have trained a ML model to predict the heart disease using Scikit-learn library in python.
google-colab jupyter-notebook machine-learning medical prediction-model python scikit-learn
Last synced: 06 May 2026
https://github.com/emmanuelezenwere/aind-aiprojects
Portfolio of AI projects developed during my Udacity AI Nanodegree, covering Planning AI, Constraint Satisfaction, Hidden Markov Models, and Search algorithms.
alpha-beta-pruning astar-algorithm bellman-equation breadth-first-search constraint-satisfaction-problem depth-first-search hidden-markov-model kalman-filter minmax-algorithm networkx nltk numpy pandas scikit-learn scipy sympy
Last synced: 29 Apr 2026
https://github.com/mauriceling/sipy
Python-Based Statistical Graphical User Interface for Python
data-analysis julia julia-language jupyter jupyter-kernels pandas pandas-python python python3 r r-packages r-project r-stats scikit-learn scipy scipy-stats statistical-analysis statistical-tests statistics
Last synced: 15 Apr 2026
https://github.com/python840/machine-learning-from-math-to-models
An in-depth book covering essential topics for AI, ML and DL.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks deep-learning deep-neural-networks machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python neural-network neural-networks python python3 reinforcement-learning reinforcement-learning-algorithms scikit-learn tensorflow
Last synced: 29 Apr 2026
https://github.com/kbo-data-portal/pipeline
Automates KBO data collection and deployment with Airflow.
airflow dbt kbo lightgbm python scikit-learn
Last synced: 07 Oct 2025
https://github.com/sandeepbalachandran/predictor
A collection of prediction algorithms for different purposes
collection jupyter-notebook machine-learning notebook predictor regression-models scikit-learn
Last synced: 06 May 2026
https://github.com/sanu0711/healthcare-intelligence
Enhancing Patient Care through AI-Driven Disease Prediction
ai-health-assistant chatbot decision-tree-classifier disease-prediction djang generative-ai k-nearest-neighbours leafletjs map medication-details medication-history naive-bayes-classifier py random-forest-classifier scikit-learn sklearn support-vector-machines symptom-checker voice-search
Last synced: 14 Mar 2026
https://github.com/varun-khorgade/cvinsight-ai-resume-analyzer
AI tool that analyzes resumes, extracts keywords, and matches them with job descriptions.
css django html5 nlp python scikit-learn textparse
Last synced: 06 May 2026
https://github.com/nurulashraf/ann-cancer-prediction
An Artificial Neural Network built with TensorFlow and Keras to predict breast cancer based on the Wisconsin Breast Cancer dataset.
artificial-neural-network breast-cancer-prediction deep-learning keras machine-learning python scikit-learn tensorflow
Last synced: 06 May 2026
https://github.com/kieranlitschel/kerassearchcv
Built for the implementation of Keras in Tensorflow. Behaves similarly to GridSearchCV and RandomizedSearchCV in Sci-Kit learn, but allows for progress to be saved between folds and for fitting and scoring folds in parallel.
classification grid-search keras keras-tensorflow multithreading randomized-search scikit-learn
Last synced: 20 Apr 2026
https://github.com/khaymanii/house-price-prediction-model
This model was built using Python and XGBoost Regression algorithm
matplotlib numpy pandas python scikit-learn
Last synced: 06 May 2026
https://github.com/k-ashik/genescout-ai-genetic-disease-pathologist
GeneScout: An interpretable AI Pathologist that predicts 5 genetic diseases with 93.5% accuracy using an Ensemble Voting Classifier and SHAP for clinical explainability.
data-science explainable-ai healthcare-ai machine-learning precision-medicine python scikit-learn shap streamlit
Last synced: 20 Apr 2026
https://github.com/deaneeth/telco-churn-prediction-mlops
Production-ready ML pipeline for telco customer churn prediction using advanced ensemble methods (XGBoost, CatBoost, Random Forest). Handles class imbalance, provides business insights, and includes modular MLOps architecture. Built with scikit-learn, featuring comprehensive EDA, feature engineering, and business impact analysis.
catboost data-preprocessing ensemble-methods feature-engineering machine-learning mlops pipeline-development python random-forest scikit-learn telco-analytics xgboost
Last synced: 15 Apr 2026
https://github.com/0xpriyanshujha/ml-expedition
Machine Learning Assignments and Practice from zero
deeplearning jupyter kaggle machinelearning neural-network python scikit-learn tensorflow
Last synced: 24 Jan 2026
https://github.com/ghufranbarcha/codsoft-machine-learning-internship
This repository contain all Machine Learning & NLP task during my internship at Codsoft.
jupyter-notebook machinelearning nlp nltk python scikit-learn
Last synced: 17 Apr 2026
https://github.com/mg380/ibm-applied-data-science-capstone
This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization
capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql
Last synced: 05 Mar 2026
https://github.com/jagadishdas21/brain-tumor-detection
This repository contains the implementation of a deep learning model to detect brain tumors from MRI images using Convolutional Neural Networks (CNN). The goal of this project is to classify MRI images as either having a brain tumor (Positive) or not having one (Negative).
computer-vision convolutional-neural-networks matplotlib scikit-learn tensorflow
Last synced: 26 Feb 2026
https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis
Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.
data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn
Last synced: 10 Feb 2026
https://github.com/grachale/predict_life_expect
Predicting life expectancy (regression) with usage of custom random forest, linear regression and decision tree regressor from scikit-learn.
decision-tree-regression jupyter-notebook linear-regression pandas python random-forest regression scikit-learn
Last synced: 05 May 2026
https://github.com/elcorto/gp_playground
Explore selected topics related to Gaussian processes
gaussian-processes gpy gpytorch kernel-ridge-regression machine-learning scikit-learn tinygp
Last synced: 06 May 2026
https://github.com/texnoforge/texnomagic
TexnoMagic library for digital Magic
gmm magic numpy python recognition scikit-learn scipy
Last synced: 03 Mar 2026
https://github.com/myounus-codes/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
In this project I have cleaned the data for the model. Project Google Colab Link: https://colab.research.google.com/drive/1vQY-XEFJSdEkW2PQOSf1j13Yk8L-XXNw?usp=sharing
algorithms data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 05 May 2026
https://github.com/pngo1997/yelp-business-recommender-system
Building an item-based collaborative recommendation system using embeddings for establishments from the Yelp dataset.
content-based-recommendation embeddings geo-mapping geospatial information-retrieval python recommender-system scikit-learn spacy
Last synced: 05 May 2026
https://github.com/sralter/classifire
Wildfire Prediction Model: Samuel Alter's BrainStation 2023 Data Science Capstone Project
Last synced: 02 May 2026
https://github.com/mohammadvhossein/ml-gym
The ML-GYM repository showcases machine learning projects using **scikit-learn**, covering classification, regression, and clustering. It offers educational resources for beginners and practical examples for experienced users, complete with detailed instructions.
classification-algorithms clustering-methods cross-validation data-preprocessing data-science decision-trees feature-engineering machine-learning model-evaluation neural-networks python-programming random-forests regression-techniques scikit-learn supervised-learning unsupervised-learning
Last synced: 06 May 2026
https://github.com/tromesh/sinhala-parser
Sinhala parser project is based on Natural Language Processing (NLP)
flux-architecture natural-language-processing nlp python react scikit-learn sinhala
Last synced: 05 May 2026
https://github.com/wesslen/dsba6211-summer2024
DSBA6211 Adv Business Analytics Lab Notebooks
Last synced: 17 Apr 2026
https://github.com/brenofariasdasilva/scientific-research
My Scientific Research Code Repository.
ck code-metrics commons-lang jabref matplotlib numpy pandas pydriller python scientific-research scikit-learn similarity-measures statistical-analysis wem word2vec worked-example worked-example-miner
Last synced: 16 Apr 2026
https://github.com/magnuss0/movie-rec-system
The project extracts movie data using TheMovieDB API, processes it using TF-IDF and cosine similarity for generating recommendations, and stores the data in a DuckDB database. The system is encapsulated within a FastAPI web application and can be deployed using Docker. It provides movie recommendations in JSON format.
cosine-similarity docker duckdb movies-recommendation moviesdb-api ploomber poetry-python scikit-learn streamlit tf-idf
Last synced: 14 Apr 2026
https://github.com/brenofariasdasilva/worked-example-miner
Worked Example Miner (WEM): A Comprehensive Tool for Analyzing Java Repositories.
apache-kafka autometrics ck code-metrics code-quality-metrics distributed-systems heuristic java-code-analyzer java-repositories matplotlib numpy pydriller python refactoring-miner scikit-learn software-engineering wem worked-example-miner zookeeper
Last synced: 20 Apr 2026
https://github.com/george-gca/ai_papers_search_tool
Automatic paper clustering and search tool by fastext from Facebook Research
fasttext fasttext-embeddings fasttext-python nlp python scikit-learn
Last synced: 02 May 2026
https://github.com/elifftosunn/bert-bank-model
It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.
countvectorizer doc2vec f1-score huggingface huggingface-transformer huggingface-transformers nlp nltk python3 scikit-learn stopwords tagged tfidf-transformer train-test-split word-tokenizer wordnetlemmatizer
Last synced: 12 May 2026
https://github.com/rixiiz/using-knn-to-predict-the-obp-of-mlb-players
Using KNN to predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season
artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning
Last synced: 05 Apr 2026
https://github.com/gigdevelopment10/neuralfunk
A Machine learning resource library for funky ML-Learners
algorithm keras machine-learning optimization-algorithms py-torch python scikit-learn tensorflow
Last synced: 29 Apr 2026
https://github.com/tynab/predictive-analysis
CyberSoft Data Analyst 08 - Predictive Analysis
cybersoft cybersoft-academy cybersoft-academy-da cybersoft-academy-da-08 cybersoft-academy-data-analyst cybersoft-academy-data-analyst-08 cybersoft-da cybersoft-da-08 cybersoft-data-analyst cybersoft-data-analyst-08 data-analyst data-analyst-08 jupyter-notebook linear-regression pandas predictive-analysis python scikit-learn sklearn yan
Last synced: 04 Jan 2026
https://github.com/omanshu209/ml-basics-2022
Machine Learnings(AI) models developed using the scikit-learn library in Python.
jupyter-notebook machine-learning python python3 scikit-learn
Last synced: 06 May 2026
https://github.com/rakshit-vasava/predictive-analytics-for-insurance-purchase
Predicting customer insurance purchases using stacking models and SMOTE for the Homesite Quote Conversion Problem on Kaggle.
k-nearest-neighbours kaggle-competition multilayer-perceptron python random-forest scikit-learn smote support-vector-machines
Last synced: 05 May 2026
https://github.com/thevarunsharma/extracting-dominant-colors
A web application that extracts the dominant colors from an image using K-means clustering.
flask-application k-means-clustering machine-learning python scikit-learn unsupervised-learning
Last synced: 12 May 2026
https://github.com/glencrawford/matchmaker
A k-nearest neighbors machine learning project to perform similarity matching using a dataset of OkCupid dating profiles.
django machine-learning python scikit-learn scipy
Last synced: 06 May 2026
https://github.com/sorna-fast/breast-cancer-diagnosis-neural-network
ANN-based breast cancer classifier using the Wisconsin Diagnostic Dataset. Implements advanced feature engineering and achieves 98.25% test accuracy. Includes comprehensive EDA, model training, and clinical impact analysis
keras-classification-models keras-neural-networks keras-tensorflow matplotlib-pyplot pandas-dataframe scikit-learn seaborn-plots sklearn-library tensorflow
Last synced: 20 Apr 2026
https://github.com/codenexa/nairobi
Quantifying Integrity in the Digital Age Misinformation spreads rapidly, accountability often falters, and the lines between transparency and manipulation blur
csv ipynb-jupyter-notebook matpotlib pkl-model python scikit-learn
Last synced: 05 May 2026
https://github.com/vatshayan/hospital-discharge-analysis
Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc
data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn
Last synced: 04 Mar 2025
https://github.com/tasninanika/will-you-survive-frontend
A full-stack machine learning app to predict Titanic passenger survival with a modern, interactive UI. Powered by FastAPI, scikit-learn, and a React frontend.
fastapi framer-motion python3 react react-router scikit-learn
Last synced: 12 Apr 2026
https://github.com/skywalkersam/ml-101
Machine Learning 101
ai-ml jupyter-notebooks matplotlib ml ml-framework numpy pandas python3 scikit-learn tensorflow
Last synced: 09 Apr 2026
https://github.com/andystmc/nextflownyc
Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting
Last synced: 07 Apr 2026
https://github.com/kr1shnasomani/sentimentscope
Sentiment analysis on movie review using TensorFlow and GloVe embeddings
deep-learning keras matplotlib natural-language-processing neural-networks numpy pandas scikit-learn tensorflow
Last synced: 12 Apr 2026
https://github.com/tasninanika/mammographic-masses-analysis-dt
This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.
decision-tree-classifier numpy pandas pyhton3 scikit-learn
Last synced: 11 Apr 2026
https://github.com/rayyan9477/machine-learning-driven-backorder-prediction-system
Experience a state-of-the-art Django web application designed to predict product backorders with exceptional accuracy. This platform leverages advanced machine learning techniques, incorporating pre-trained Random Forest Classifier, Decision Tree, and LGBM models.
matplotlib notebook numpy pandas python scikit-learn
Last synced: 12 Apr 2026
https://github.com/xushiyan/udacity-machine-learning
Content for Udacity's Machine Learning curriculum
jupyter-notebook machine-learning numpy pandas python reinforcement-learning scikit-learn supervised-learning udacity-machine-learning unsupervised-learning
Last synced: 13 Apr 2026
https://github.com/yaronkoresh/definers
A comprehensive Python toolkit for AI, data processing, media manipulation, and system utilities.
artificial-intelligence cuda data-science deep-learning diffusers feature-extraction generative-ai gpu gradio image-generation machine-learning multimedia music-generation python-library pytorch scikit-learn toolkit transformers video-generation web-scraping
Last synced: 08 Apr 2026
https://github.com/arnoldchrisoduor1/machinelearning-and-neuralnetworks
Starting out in Machine Learning.
numpy pandas python scikit-learn scipy sql tensorflow2
Last synced: 11 Apr 2026
https://github.com/jishen-harilal/lung-cancer-prediction-logistic-regression
Using logistic regression to predict cases of lung cancer.
classification data-visualization exploratory-data-analysis healthcare jupyter-notebook logistic-regression lung-cancer machine-learning python scikit-learn
Last synced: 15 May 2026
https://github.com/filipspl/bayesian-rf-knime-scikit
Bayesian optimization of RF via scikit in KNIME
bayesian-optimization knime knime-analytics-platform python random-forest scikit-learn
Last synced: 12 Apr 2026
https://github.com/chaitanya1436/student_performance_analysis
A project focused on analyzing college student performance using data on department, assessment scores, and performance labels. Implemented in Google Colab, the analysis includes data preprocessing, feature scaling, and exploratory data analysis to uncover insights and prepare the data for further analysis or modeling.
ata-preprocessing data-preparation exploratory-data-analysis feature-scaling google-colab numpy pandas scikit-learn
Last synced: 07 Feb 2026
https://github.com/evangks/hierarchical-clustering-mall-customers
A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.
clustering data-science hierarchical-clustering jupyter-notebook machine-learning mall-customers portfolio-project python scikit-learn unsupervised-learning
Last synced: 07 Mar 2026
https://github.com/aaa1928/iris-ml-classifier
PyTorch model that classifies Iris species based on characteristics about the length and width of sepals and petals.
deep-learning iris-classification iris-dataset machine-learning neural-network numpy pandas python pytorch scikit-learn
Last synced: 05 Apr 2026
https://github.com/jibbs1703/classic-ml-models
This repository contains scripts for developing, training and evaluating machine learning models using several python frameworks.
aws data-preprocessing data-science deep-learning feature-engineering machine-learning multiclass-classification neural-networks predictive-modeling pyspark-mllib pytest scikit-learn xgboost-classifier
Last synced: 10 Apr 2026
https://github.com/elifirinci/mushrooms-plants-classification
This project features AI models for identifying mushrooms and plants as poisonous or edible using image-based predictions. Both models are tested through an interactive Gradio interface, ensuring user-friendly and accurate identification for foragers and researchers.
classification cnn cnn-classification gradio image-classification machine-learning mushroom-classification plant-classification scikit-learn
Last synced: 17 May 2026
https://github.com/aymen016/film-recommendation-engine
A machine learning-powered movie recommender system designed to provide personalized recommendations based on user preferences and data analysis. This project includes a backend recommendation engine, a Streamlit-based interface, and a web-based frontend for an enhanced user experience.
flask numpy pandas pickle python scikit-learn streamlit
Last synced: 09 Apr 2026
https://github.com/omar-karimov/ai4renewals-gendes
This project adopts a modular Python architecture within an MLOps framework to enhance subscription renewal predictions, utilizing FastAPI and MongoDB with AWS integration (S3, ECR, EC2). Docker ensures seamless deployment, and GitHub Actions automate the CI/CD workflows. Evidently AI monitors drift to guarantee predictive accuracy and reliability.
api aws deployment docker drift-detection ec2 ecr evidently fastapi github-actions mangodb mlops modularization pipeline s3 scikit-learn workflow yaml-configuration
Last synced: 15 Apr 2026
https://github.com/davidyen1124/cowculator
COWCULATOR: AI-driven catering cost forecasting in Python. Trains order-level and daily time series models, exports an edge-ready JSON bundle, and includes a demo web UI.
cli data-science edge-ai forecasting github-actions machine-learning mypy pandas python ruff scikit-learn time-series uv
Last synced: 05 May 2026
https://github.com/garcane/income-prediction-ml
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 08 Apr 2026