An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/zuhairzia/titanic-survival-project

This is a Titanic Survival Prediction Model developed using Python, Pandas, Scikit-learn, and Jupyter Notebook. The model predicts whether a passenger survived the Titanic disaster based on features such as age, gender, and passenger class.

csv-dataset flask jupyter-notebook matplotlib numpy pandas pandas-library python scikit-learn seaborn streamlit

Last synced: 11 Apr 2026

https://github.com/karimosman89/resume-screening

Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.

machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing

Last synced: 29 Apr 2026

https://github.com/djdhairya/crop-recommendation

Crop Recommendation System is a powerful tool for enhancing agricultural decision-making. By leveraging data-driven insights, it empowers farmers to maximize yield and ensure sustainable practices.

adaboostclassifier bagging-classifier csv decision-trees gaussian html knn-classification logistic-regression machine-learning machine-learning-algorithms matplotlib model numpy pandas random-forest random-forest-classifier scikit-learn seaborn svc

Last synced: 11 Apr 2026

https://github.com/shreeparab1890/handwritten-digit-recognition

In this iPython Noetbook we are going to use the MNIST dataset for the implementation of a handwritten digit recognition app using LogisticRegression and SGDClassifier and compare the accuracy and other metrics.

handwritten-digit-recognition image-classification matplotlib mnist-dataset python scikit-learn sklearn

Last synced: 11 Apr 2026

https://github.com/msikorski93/breast-cancer-classifying

Identifying and assigning breast cancer diagnosis using machine learning methods, based on observations in WDBC dataset. All classifiers have been evaluated and performed well for this task.

breast-cancer classification k-nearest-neighbours keras logistic-regression naive-bayes neural-networks scikit-learn tensorflow

Last synced: 30 Apr 2026

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/akimuddinshaikh/machine-learning-project

A comparative study of regression models (Decision Tree, Random Forest, Ridge, Lasso, SVM) for predicting real estate prices in King County, NYC, and California using PCA & Pipeline techniques.

machine-learning pca-analysis python regression-models scikit-learn statsmodels

Last synced: 16 May 2026

https://github.com/eljandoubi/deploy-ml

Deploying a ML Model to Cloud Application Platform with FastAPI

ci-cd fastapi github-actions gunicorn pandas pytest render scikit-learn uvicorn

Last synced: 11 Apr 2026

https://github.com/das-amlan/delay-prediction-in-urban-mobility-networks

Predicting delays in Urban mobility netwrok using different ML algorithms.

delay-prediction gradient-boosting machine-learning python r scikit-learn

Last synced: 05 Apr 2026

https://github.com/mr-ndi/tibebai

Machine learning experiments on student performance prediction. Inspired by tibeb (wisdom) in Amharic, this project explores regression models to understand how study factors influence exam scores.

ai data-science education elevvo google-colab internship kaggle linear-regression machine-learning matplotlib pandas polynomial-regression prediction regression scikit-learn student-performance tibebai-wisdom

Last synced: 11 Apr 2026

https://github.com/timothyjan/intro-machine-learning-classifiers

We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.

numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/vasu7052/spam-classifier

This is a Machine Learning Project to detect whether a given sentence maybe a spam or not using Python and Keras.

keras keras-neural-networks python3 scikit-learn spam-classification tensorflow

Last synced: 11 Apr 2026

https://github.com/brej-29/disaster-tweets-nlp-model-benchmarks

Benchmark NLP models on Kaggle “Disaster Tweets”: TF-IDF + Naive Bayes baseline, Keras deep nets (Dense/LSTM/GRU/BiRNN/Conv1D), and TensorFlow Hub Universal Sentence Encoder transfer learning—compared using accuracy, precision, recall, and F1.

bidirectional-rnn cnn conv1d deep-learning disaster-tweets gru kaggle keras lstm machine-learning naive-bayes nlp rnn scikit-learn tensorflow tensorflow-hub text-classification tfidf

Last synced: 11 Apr 2026

https://github.com/priteshramani/movie-recommender

A content-based movie recommendation system using Python, Pandas, and cosine similarity to suggest movies based on their features.

cosine-similarity pandas pickle python scikit-learn streamlit

Last synced: 11 Apr 2026

https://github.com/simranjeet97/spam-classification

Spam Classification Using Natural Language Processing (NLP), Scikit-Learn Library, and Bayesian Method.

data-science emails kaggle kaggle-dataset naive-bayes-classifier nlp-machine-learning nltk-python python scikit-learn spam-classification

Last synced: 11 Apr 2026

https://github.com/vamsi0333/ai-sentiment-cicd-project

End-to-end CI/CD deployment of an AI-powered Sentiment Analysis API using FastAPI, Docker, Kubernetes, Terraform, and GitHub Actions. Demonstrates complete MLOps + DevOps workflow.

ai cicd devops docker fastapi github-actions kubernetes mlops scikit-learn terraform

Last synced: 12 Apr 2026

https://github.com/abrarshahok/electric-vehicle-charging-station-energy-consumption-prediction

With the rapid adoption of electric vehicles, optimizing energy usage at charging stations has become crucial for improving operational efficiency and ensuring customer satisfaction. This tool leverages predictive modeling to forecast energy consumption for charging sessions based on various input features.

matplotlib numpy pandas plotly python3 scikit-learn xgboost

Last synced: 09 Jun 2026

https://github.com/mrktsm/spam-email-recognizer

Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.

data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model

Last synced: 09 Apr 2026

https://github.com/abdiasarsene/routerwise-api-predictive-analytics-for-shipments

🧭 RouterWise optimise la logistique d’œuvres d’art grâce à une pipeline MLOps automatisée, prédictive et monitorée, intégrée au backend de PrecisioArt.

bentoml docker fastapi jenkins mlflow prometheus scikit-learn

Last synced: 11 Apr 2026

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/alpha597/music_classification_ml

A project which compares different machine learning algorithms' accuracy in music genre classification of a large dataset.

machine-learning pandas python scikit-learn tensorflow

Last synced: 11 Apr 2026

https://github.com/lmizner/grokking_data_science

Coding practice for basic data science interview questions in Python

data-science numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/jingjing-jin/purchase-behavior-analysis

Purchase Behavior Analysis for Targeted Customer Segmentation

clustering-algorithm data-mining machine-learning python scikit-learn

Last synced: 20 Jan 2026

https://github.com/christianconchari/bike-sharing-demand

Este repositorio contiene el trabajo práctico final de la materia Aprendizaje de Máquina II de la Especialización en Inteligencia Artificial (CEIA) de la Facultad de Ingeniería de la Universidad de Buenos Aires (FIUBA).

airflow docker fastapi machine-learning mlflow python scikit-learn

Last synced: 20 Jan 2026

https://github.com/jayadavv/dynamic-ml-model-selector

An interactive web application that allows users to upload their datasets and dynamically select, train, and evaluate various machine learning models. The app provides comprehensive performance metrics and visualizations, making it easy for users to analyze their data effectively.

decision-trees linear-regression logistic-regression matplotlib-pyplot plotly python random-forest scikit-learn streamlit

Last synced: 11 Apr 2026

https://github.com/arrhythmia-detection/authorprovidedfeaturescombineddtoptimized

Deploys an optimized Decision Tree for Arrhythmia classification using Chapman ECG dataset on Arduino UNO board

arduino-uno arrhythmia-classification atmega328p chapman-ecg decision-tree-classifier eloquent scikit-learn

Last synced: 17 May 2026

https://github.com/monish-nallagondalla/cement_strength_prediction

The Cement Strength Prediction project uses machine learning to predict the compressive strength of cement based on its components, such as Cement, Fly Ash, Water, Superplasticizer, Coarse Aggregate, Fine Aggregate, and Age. The goal is to forecast compressive strength (MPa) for optimized cement production and quality control.

cement-strength-prediction construction-industry data-analysis data-preprocessing data-science data-visualization feature-engineering machine-learning predictive-modeling python regression-analysis scikit-learn

Last synced: 11 May 2026

https://github.com/blaz-cerpnjak/student-dropout-prediction

Student dropout predictions based on grades and other info. Classification problem with MLPClassifier.

classification machine-learning mlpclassifier neural-networks poetry predicting-student-dropout python scikit-learn scikit-learn-pipelines

Last synced: 17 May 2026

https://github.com/priyanshul28/ml_regression_eda_waiterstip

An EDA and Machine Learning Regression exercise on the Waiter's Tip dataset demonstrating the use of Linear Regression, Neural Network Regressors, Decision Trees, Random Forests, Linear SVR, XGBoost, etc. The models are optimized using hyperparameter tuning through GridSearchCV.

eda machine-learning regression scikit-learn seaborn

Last synced: 17 May 2026

https://github.com/genaray/ml.shopanalytics

A minimalist Python & cloud ML project that trains on Amazon sales & review data to recommend optimal prices/discounts to boost ratings/sales and surface actionable visual insights. Powered end-to-end by AWS CloudFront, S3, ALB & Fargate and Svelte.

ai aws aws-alb aws-cloudfront aws-ecs aws-fargate aws-s3 cicd devops machine-learning python scikit-learn terraform

Last synced: 11 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/dionixius7/titanic-disaster-ml-model

This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances

data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction

Last synced: 07 Feb 2026

https://github.com/surajsanap/technohack_mlinternship

1) Wine Quality Analysis and Classification, 2)Movie Review Sentiment Analysis, 3)Diabetes Prediction Using Machine Learning

deep-learning machine-learning pandas python scikit-learn

Last synced: 08 May 2025

https://github.com/paulinhok14/csgo-datascience-project

📊 Analysis of CS:GO grenade usage patterns and their impact on match outcomes using data science and statistical methods.

matplotlib mlflow numpy python scikit-learn scipy seaborn

Last synced: 30 Dec 2025

https://github.com/sshbuilder/movie-recommendation-system

The primary goal of this project is to provide personalized movie recommendations to users based on their preferences and the characteristics of the movies. This is achieved through a multi-step process involving data preprocessing, text vectorization, and recommendation generation.

anaconda-environment data-science jupyter-notebook machine-learning movie-recommendation movies pandas python3 recommendation-system recommender-system scikit-learn scikitlearn-machine-learning

Last synced: 26 Feb 2025

https://github.com/mitchmedeiros/mlcompare

Quickly compare machine learning models across libraries and datasets.

huggingface-datasets kaggle machine-learning openml pytorch scikit-learn xgboost

Last synced: 02 Feb 2026

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 11 Apr 2026

https://github.com/talapanenivarshithchowdary/asteroid-detection-ml

This project uses Machine Learning to detect and classify asteroids based on trajectory and size, aiding in Near-Earth Object detection and planetary defense.

classification data-science decision-trees jupyter-notebook knn logistic-regression machine-lea matplotlib numpy pandas pillow prediction python3 random-forest scikit-learn

Last synced: 11 Apr 2026

https://github.com/audy21/datacamp

Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.

data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/swarnabhaghosh/house-price-prediction-model

Built an end-to-end regression pipeline to predict house prices using Linear Regression with automated preprocessing (PowerTransform, StandardScaling) via Scikit-learn's Pipeline and ColumnTransformer.

column-transformer linear-regression matplotlib-pyplot numpy pandas pipeline python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/heyitsjoealongi/fantasy-football-qbwr-model

Fantasy Football: Quarterback / Wide Receiver - Gaussian Process Regression (GPR) Machine Learning Model

machine-learning matplotlib model numpy python scikit-learn

Last synced: 01 Apr 2025

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 11 Apr 2026

https://github.com/rizquuula/sentimentanalyzenaivebayes

Analisis Sentimen menggunakan metode Naive Bayes dengan "One time learning" dan "Continuous Learning"

machine-learning naive-bayes nlp python scikit-learn sentiment-analysis text-classification

Last synced: 17 May 2026

https://github.com/urvee1810/bitcoin-price-forecasting-using-arma

The analysis reveals the challenges of predicting Bitcoin prices during highly volatile periods and demonstrates how traditional time series models perform under different market conditions. The project includes comparative analysis of model performance during stable and volatile market phases.

arima arma augmented-dickey-fuller-test feature-engineering machine-learning matplotlib mplfina numpy pandas python random-forest randomforestregressor scikit-learn seaborn statsmodels time-series-analysis

Last synced: 06 Mar 2026

https://github.com/adirbella37/safety-analytics-project

Final project in Safety Management: analytics and predictive modeling for occupational incidents. Includes EDA, logistic regression, Poisson/Negative Binomial with overdispersion checks, ROC/AUC, and prediction exercises.

classification data-visualization drunk-and-drive eda logistic-regression matplotlib negative-binomial numpy occupational-safety overdispersion pandas poisson-regression python road-safety roc-auc scikit-learn seaborn statmodels

Last synced: 09 Apr 2026

https://github.com/capac/higher-education-students-performance-evaluation

Machine learning project for evaluating higher education student performance

docker evidently grafana mlflow postgresql prefect python scikit-learn xgboost

Last synced: 09 Apr 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/dastogirrudro/machine-learning-and-deep-learning

This is my thesis project which i have done in varsity.Here i used machine learning and deep learning i used LSTM as deep learning.This can identify aggresive spam message. Here i used pandas scikit-learn and many more framework i used python as a programming language.I used many algorithm for highering the accuracy of my project.

deep-learning lstm machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/duruii/contest-dingtalkcup2-a

2023年第二届“钉钉杯”大学生大数据挑战赛——智能手机用户监测数据分析

data-mining machine-learning pandas scikit-learn xgboost

Last synced: 12 Mar 2025

https://github.com/andrewjmack/credit-risk-classification

Supervised learning model trained and evaluated on loan risk for potential use in the prediction of the creditworthiness of an applicant

banking loan-prediction-analysis machine-learning pandas python scikit-learn supervised-learning

Last synced: 11 Apr 2026

https://github.com/trimoyee-g/adenovirus-disease-prediction

A machine learning project using scikit-learn to compare models for Adenovirus detection, selecting the most effective one based on accuracy, precision, and recall.

machine-learning matplotlib python random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/trimoyee-g/flipkart-reviews-sentiment-analysis

A RandomForestClassifier-based sentiment analysis model for efficient binary categorization of Flipkart reviews.

machine-learning matplotlib python random-forest-classifier scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/trimoyee-g/phishing-site-predictor

A phishing site prediction model using scikit-learn's Random Forest Classifier, achieving high accuracy and gaining insights into website characteristics.

data-visualization machine-learning python random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/pradeep-r04/attendiq

AttendIQ is a Face Recognition Attendance System designed to automate and streamline the attendance process with precision and ease. By leveraging real-time face detection and recognition technology, AttendIQ eliminates the need for manual roll calls or ID-based check-ins. The system captures facial data during a quick registration process .

csv cv2 kneighborsclassifier numpy os pandas pickle python scikit-learn streamlit time

Last synced: 02 Apr 2026

https://github.com/ghoumbadji/analyzing-customer-churn-for-a-telecom-company

The project involves utilizing various machine learning techniques, both supervised and unsupervised, to detect customer churn and identify the key factors contributing to it.

churn-analysis churn-prediction kaggle machine-learning pandas random-forest-classifier scikit-learn

Last synced: 03 May 2026

https://github.com/pratishtha-abrol/sentimentanalysis

Logistic Regression: A sentiment analysis case study

logistic-regression nltk-python scikit-learn sentiment-analysis

Last synced: 17 May 2026

https://github.com/alsult/wine_classification

This is a wine classification project based on 13 numerical features of wines grown in the same region in Italy but derived from three different cultivars.

logistic-regression machine-learning matplotlib multiclass-classification pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer

Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.

breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm

Last synced: 05 Feb 2026

https://github.com/lijesh010/ml_project_data_preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn

Last synced: 11 Apr 2026

https://github.com/alexsolov28/ml_course

Курс "Технология машинного обучения"

colab-notebooks jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2025

https://github.com/pranavgautam29/flight-price-prediction

The Flight Price Prediction project uses machine learning to forecast flight ticket prices based on historical data. Hosted on Streamlit Community Cloud and deployed via Streamlit, this application allows users to input flight details such as departure and arrival airports, travel dates, and class to receive accurate price predictions.

machine-learning prediction-model regression scikit-learn statistical-machine-learning streamlit

Last synced: 21 Feb 2026

https://github.com/javi-cc/python-ml-portcanto

Portcanto és un projecte de simulació d'un trajecte en bicicleta. S'ha definit 4 tipus de ciclistes que es diferencien en el temps que tarda a fer el trajecte. L'objectiu és descobrir els 4 patrons amb l'algoritme de clustering KMeans.

clustering docker docker-compose kmeans machine-learning mlfow pydoc pylint python scikit-learn testing venv

Last synced: 13 Apr 2026

https://github.com/alisson-t-bucchi/cost-of-living-ai-ml

Cost of living predictor for some world cities, using AI and ML to scrap and predict cost for each selected city.

artificial-intelligence linear-regression machine-learning matplotlib pandas-dataframe python scikit-learn

Last synced: 18 Jun 2025

https://github.com/kkinzzza/meansalaryprediction

This project focuses on predicting the mean salary for job vacancies from HeadHunter.

catboost classic-ml regression salary-prediction scikit-learn

Last synced: 29 Apr 2026

https://github.com/oceanuz/car-price-regression

A comprehensive ML evaluation and improvement notebook for a car price prediction model. It includes topics such as scoring with r2, cross-validation, overfitting/underfitting diagnosis, and polynomial regression. *Ridge regression* is applied to reduce overfitting, and (GridSearchCV) techniques are used to find the best alpha hyperparameter.

cross-validation data-science grid-search hyperparameter-tuning machine-learning machine-learning-models model-evaluation overfitting python regression ridge-regression scikit-learn

Last synced: 11 Dec 2025

https://github.com/szymonrucinski/pippi-lang

Elegant 📑 text preprocessing pipeline 🚰 available as pip package 🐍 based on scikit-learn pipeline. Combines Transformer and Column Transformer into a single object.

data-cleaning data-science nlp pipeline scikit-learn

Last synced: 30 Apr 2026

https://github.com/scikit-learn/pairwise-distances-reductions-asv-suite

A dedicated asv suite for scikit-learn private PairwiseDistancesReductions

asv benchmarks cython scikit-learn

Last synced: 18 Jan 2026

https://github.com/lefteris-souflas/the-algorithmic-approach-to-winning-guess-who

This repository provides a systematic approach to winning the "Guess Who?" game through advanced machine learning techniques. It offers a comprehensive methodology for enhancing gameplay strategy and optimizing decision-making processes with meticulous attention to detail.

decision-tree drawio gradient-boosting graphviz-dot lightgbm machine-learning matplotlib numpy pandas python random-forest scikit-learn

Last synced: 09 Apr 2026

https://github.com/mramshaw/intro-to-ml

Intro to Machine Learning - Pattern Recognition for Fun and Profit

machine-learning matplotlib ml numpy pandas pip pip3 python scikit-learn scipy seaborn seaborn-plots sklearn statsmodels tensorflow weka

Last synced: 11 Apr 2026

https://github.com/ebadshabbir/decision_tree_algorithm

Decision Tree Classifier for Social Network Ads A Python implementation of a Decision Tree Classifier to predict user purchasing behavior based on age and estimated salary. Includes feature scaling, model evaluation (confusion matrix and accuracy), and visualizations of decision boundaries for both training and test sets.

decision-tree-classifier jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/lohiyah/real-estate-price-forecast

A Python-based app predicting real estate prices using machine learning. Built with Pandas, NumPy, Scikit-learn, Matplotlib, and Seaborn for data processing and visualization, and Flask for the web interface.

flask matplotlib numpy pandas python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/prajakta1321/authencheck

Amdocs Gen AI Graduate Hackathon 2024-25- A comprehensive fact-checking and misinformation detection system that leverages cutting-edge AI models and multiple news sources to verify information circulating on social media

api bert-fine-tuning flask-application matplotlib ngrok-server nlp nlp-machine-learning numpy pandas python3 scikit-learn seaborn wandb

Last synced: 05 Apr 2026

https://github.com/bhuvan-s-prasad/streamlit-regression

A machine learning project that predicts housing prices in California using regression techniques. This project includes comprehensive exploratory data analysis, feature engineering, linear regression modeling, and an interactive Streamlit web application for making predictions.

california-housing-price-prediction exploratory-data-analysis linear-regression machine-learning matplotlib numpy pandas python scikit-learn seaborn streamlit supervised-learning

Last synced: 11 Apr 2026

https://github.com/nafis2508/mobile-price-predictor

Machine learning project that classifies mobile phones into price ranges (low, medium, high, very high) based on hardware specifications.

classification data-science eda jupyter-notebook kagle knn logistic-regression machine-learning mobile-price-prediction python scikit-learn xgboost

Last synced: 24 Jun 2026

https://github.com/nicolasvauche/vinylexplore_ml

VinyleXplore est un moteur de recommandation de vinyles intelligent basé sur l'humeur et le contexte d'écoute de l'utilisateur. Il utilise FastAPI pour exposer une API REST et scikit-learn pour entraîner un modèle de Machine Learning qui améliore la pertinence des suggestions.

machine-learning python scikit-learn vinyle

Last synced: 17 May 2026

https://github.com/netcodez/climate-prediction-pipeline

Predicting London's climate using machine learning techniques. This project aims to forecast mean temperature in Celsius (°C) using various regression models and logging experiments with MLflow

huggingface machine-learning mlflow mlflow-tracking mlflow-tracking-server mlops python scikit-learn streamlit

Last synced: 09 Apr 2026

https://github.com/parbhat-cpp/suicidal-ml

A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.

bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/pramodyasahan/model-selection

This repository explores and compares different regression models for predicting continuous outcomes. This repository includes implementations and evaluations of five key regression models. The primary goal is to demonstrate how each model works, evaluate their performance using R-squared values, and guide users in selecting the best model.

machine-learning modelselection numpy pandas python regression scikit-learn

Last synced: 08 Mar 2025