An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/alexsomai/machine-learning-getting-started

Dummy examples and experiments to get started with Machine Learning

artificial-intelligence deep-learning machine-learning python scikit-learn

Last synced: 13 Apr 2026

https://github.com/anudeepjonnada/phishshield-ai

🛡️ PhishShield AI – An intelligent phishing email detector that uses BERT and Machine Learning to identify phishing attempts in real time. Integrated with the Gmail API, powered by Flask, React, and MongoDB for secure full-stack email analysis and threat detection.

bert flask gmail-api mongodb oauth2 python react scikit-learn

Last synced: 13 Apr 2026

https://github.com/felipeclarindo/energy-predict-api

Api para realizar previsões sobre energia.

api api-development api-rest flask pandas pickle python scikit-learn

Last synced: 13 Apr 2026

https://github.com/flysirin/adstextclassification

Classification of advertisements by topic

docker excel flask pandas python pytorch scikit-learn

Last synced: 02 Jan 2026

https://github.com/mirgis/plucky-playground

A modest collection of machine learning and deep learning algorithms, along with examples implemented in diverse toolkits.

bayes bayesian deep-learning examples ipynb keras machine-learning neural-network pandas playground python3 pytorch scikit-learn sklearn statistics tensorflow

Last synced: 13 Apr 2026

https://github.com/srikarveluvali/heart-disease-prediction-ml

This machine learning project aims to predict the presence or absence of heart disease in individuals based on a set of health-related features. By utilizing a dataset containing information about patients, we employ various machine learning techniques and data analysis to build a predictive model.

exploratory-data-analysis machine-learning python scikit-learn

Last synced: 04 May 2026

https://github.com/leabrodyheine/water-pump-status-prediction

This project implements machine learning models to predict the status of water pumps in Tanzania using data from DrivenData's competition. The project includes preprocessing steps, model evaluation using cross-validation, and hyperparameter optimization with Optuna.

argparse cross-validation gradient-boosting-classifier logistic-regression machine-learning multilayer-perceptron numpy optuna pandas random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/devspidr/ml-programs

A collection of foundational machine learning programs covering supervised and unsupervised algorithms, implemented using Python and libraries like scikit-learn, pandas, and matplotlib. Ideal for beginners and students learning core ML concepts through practical coding.

classification machine-learning-algorithms regression scikit-learn supervised-learning unsupervised-learning

Last synced: 30 May 2026

https://github.com/yancotta/anti-aging-epigenetics-ml-app

A thesis MVP for a personalized anti-aging system that analyzes genetic SNPs and lifestyle habits using ML models (Random Forest and Neural Networks) to provide risk assessments and actionable recommendations. Built with FastAPI, React, PostgreSQL, and containerized via Docker for scalability and explainability.

anti-aging bioinformatics docker explainable-ai fastapi genetics healthtech machine-learning mlops personalized-medicine pytorch reactjs scikit-learn synthetic-data thesis-project

Last synced: 16 Sep 2025

https://github.com/aarryasutar/credit_eda

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/uea-geral/rna-perceptron-exercise

🤖Disciplina de RNA: treinamento de um neurônio Perceptron.

jupyter-notebook neural-network numpy perceptron python scikit-learn

Last synced: 13 Apr 2026

https://github.com/colinwu0403/heartbpmusic

Music discovery platform that recommends you a song based on your heart's BPM and your mood using Machine Learning.

django neurokit2 scikit-learn spotify-web-api vuejs

Last synced: 05 May 2026

https://github.com/singhkunwardeep/twitter_sentiment_analysis

A machine learning project to classify Twitter sentiment into positive, negative, categories using Logistic Regression and TF-IDF Vectorization. This project involves data preprocessing, feature extraction, model training, and evaluation of the sentiment of tweets. Built with Python, NLTK, and Scikit-learn.

logistic-regression nltk-python pandas-dataframe python3 scikit-learn tfidf-vectorizer

Last synced: 05 May 2026

https://github.com/rahimizadeh/prediction-api-with-flask-and-mlflow

An end-to-end machine learning project demonstrating model lifecycle management with MLflow and production deployment using Flask.

flask machine-learning mlflow mlops-workflow python random-forest-regression rest-api scikit-learn

Last synced: 13 Apr 2026

https://github.com/evanmarshall-dev/evanmarshall-tech

Professional IT services platform featuring serverless AWS infrastructure, ML-powered service recommendations, and automated CI/CD deployment. Built to showcase full-stack development, cloud architecture, and machine learning engineering skills.

api-gateway aws ci-cd cloud-computing cloudfront devops full-stack github-actions infrastructure-as-code lambda machine-learning mlops nextjs portfolio python react s3 scikit-learn serverless terraform

Last synced: 13 Apr 2026

https://github.com/srilaasya/handwriting-recognition-using-k-means

Used K-means clustering and scikit-learn to cluster images of handwritten digits.

handwriting-recognition k-means python scikit-learn

Last synced: 13 Apr 2026

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 13 Apr 2026

https://github.com/rohitpawar001/bone_marrow_surival_prediction

Bone marrow transplants can be life-saving, but predicting patient survival is complex. In this project, I used machine learning to analyze key medical factors and improve survival predictions. I also implemented CI/CD pipelines, used MLflow for model tracking, and deployed the model on an AWS EC2 instance.

aws docker ec2-instance flask machine-learning mlflow python scikit-learn

Last synced: 08 Apr 2026

https://github.com/gokulgowthams/smart-premium

An Interactive Premium Amount Detection for user which accurately predicts the required premium amount for a default loan by using series of questions that satisfies the criteria in Streamlit Application

data-preprocessing feature-engineering git github mlflow model-deployment numpy pandas python scikit-learn streamlit xgboost

Last synced: 11 Apr 2026

https://github.com/samarthmule/chatbot

This project implements a generic chatbot using Natural Language Processing (NLP) and Machine Learning techniques. The chatbot is designed to classify user input into predefined intents and provide context-aware responses. The solution is scalable, interactive, and suitable for various domains.

chatbot internship machine-learning machine-learning-algorithms nlp nltk project-repository python python3 scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/strcoder4007/machine-learning-deep-learning-practice

Implementation of Linear/Logistic Reg, K-NN, SVM, Clustering, K-Means, ConvNet, ResNet, MobileNet, RNN, LSTM etc. using Pandas, SciKitLearn, NumPy & TensorFlow 2

convolutional-neural-networks matplotlib scikit-learn tensorflow2

Last synced: 15 May 2026

https://github.com/imswappy/brain-tumor-detection

🧠 Deep learning project for brain tumor classification using MRI images. Built with transfer learning (VGG16 + fine-tuning), TensorFlow/Keras, and deployed via Streamlit. Dataset & model loaded dynamically from KaggleHub. Includes training notebook, evaluation, and interactive web app.

kagglehub keras numpy pandas scikit-learn streamlit tensorflow vgg16-model

Last synced: 13 Apr 2026

https://github.com/supriya811106/healthcare-recommedation-system

A Flask-based web app that predicts diseases based on symptoms and recommends specialized doctors. It uses machine learning for accurate health predictions and location-based doctor searches.

css flask-application healthcare-application html javascript machine-learning numpy pandas recommendation-system scikit-learn

Last synced: 04 Mar 2026

https://github.com/takkii/pylean

Data analysis ( 🐍 💎 📈 )

analayze matplotlib numpy pandas python scikit-learn

Last synced: 09 Sep 2025

https://github.com/edisedis777/pyspark-ml-features

A PySpark implementation of 6 lesser-known Scikit-Learn features optimized for Azure Databricks. This project translates powerful machine learning techniques from Scikit-Learn into PySpark's distributed computing framework.

azure databricks databricks-notebooks large-scale machine-learning pyspark python scikit-learn scikitlearn-machine-learning

Last synced: 13 Apr 2026

https://github.com/lordmitrii/win-prediction-django

A web application on Django framework. It predicts a winning team based on given sets of dota2 heroes.

django dota2 jupyter-notebook machine-learning python scikit-learn web

Last synced: 13 Apr 2026

https://github.com/abdullah321umar/internee.pk-dataanalytics_internship-assignment4

🌟 Fraud Detection in Application 🌟 Through Isolation Forest and K-Means Clustering, the project detects suspicious patterns like inconsistent income, duplicate entries, and unrealistic employment data. This end-to-end workflow transforms raw data into actionable fraud insights — enhancing trust and accuracy.

anomaly-detection csv-handling data-cleaning data-exporting data-import data-normalization exploratory-data-analysis export interpretation matplotlib model-evaluation pandas pca python reporting scaling scikit-learn seaborn

Last synced: 06 May 2026

https://github.com/finite-sample/stagecoachml

Build two-stage models when your features arrive in two batches at different times.

machine-learning scikit-learn two-stage-models

Last synced: 14 Jan 2026

https://github.com/adrien-1997/bike-forecast-paris-velib

Bike-sharing demand forecasting in Paris (Vélib’). A data science and machine learning project leveraging open urban mobility data to predict bike availability, analyze time series usage patterns, and provide interactive dashboards for visualization.

bike-sharing dashboard data-science duckdb forecasting machine-learning matplotlib open-data pandas paris predictive-modeling python scikit-learn streamlit transportation urban-mobility velib

Last synced: 11 Apr 2026

https://github.com/uhstray-io/pyrizon

Data Collection, Analysis, Mapping, Pipelining & Transformation, & API using Python

api data-engineering etl numpy pandas plotly python pytorch raw-data scikit-learn seaborne sql sqlite tensorflow

Last synced: 09 Apr 2026

https://github.com/varun-khorgade/churnshield-customer-retention-predictor

Built an ML-based classification model to predict customer churn. Applied data preprocessing, feature engineering, and ensemble algorithms to improve prediction accuracy and help businesses implement retention strategies.

classification-algorithm datapreprocessing f1-score feature-engineering hyperparameter-tuning logistic-regression matplotlib model-evaluation numpy pandas python ran roc-auc scikit-learn seaborn xgboost

Last synced: 07 May 2026

https://github.com/elifirinci/mushrooms-plants-classification

This project features AI models for identifying mushrooms and plants as poisonous or edible using image-based predictions. Both models are tested through an interactive Gradio interface, ensuring user-friendly and accurate identification for foragers and researchers.

classification cnn cnn-classification gradio image-classification machine-learning mushroom-classification plant-classification scikit-learn

Last synced: 17 May 2026

https://github.com/rakibhhridoy/easywaydiveinto-datascience

Data Science is not as easy as it seems at first. The most problem faced by new learner are lack of resource knowledge as well as confusion in using the various resources. I hope this repository will benefit confusion learner.

algorithms algorithms-implemented bayesian-statistics data-science deep-learning deep-neural-networks linear-algebra machine-learning matplotlib multivariate-calculus numpy optimization pandas python scikit-learn scipy seaborn statistics statsmodels tensorflow

Last synced: 06 Apr 2026

https://github.com/fohlen/stats-experiment

A tiny stats experiment with GENESIS data

matplotlib python3 scikit-learn

Last synced: 17 May 2026

https://github.com/somjit101/human-activity-recognition

This project is to build a model that predicts the human activities such as Walking, Walking Upstairs, Walking Downstairs, Sitting, Standing or Laying using readings from the sensors on a smartphone carried by the user.

decision-tree-classifier eda feature-engineering gradient-boosting-classifier grid-search human-activity-recognition keras logistic-regression lstm random-forest-classifier rbf-kernel scikit-learn seaborn-plots signal-processing support-vector-classifier support-vector-machine t-sne tensorflow uci-har-dataset uci-machine-learning

Last synced: 23 Feb 2026

https://github.com/udityamerit/curafind-powered-by-ai

CuraFind AI is a web-based application leveraging Natural Language Processing (NLP) to intelligently recommend medicines. Users can search using symptoms, medicine names, or free-text descriptions, and receive suggestions along with brand substitutes for drugs

ai machine-learning nlp numpy pandas scikit-learn

Last synced: 18 Sep 2025

https://github.com/evangks/hierarchical-clustering-mall-customers

A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.

clustering data-science hierarchical-clustering jupyter-notebook machine-learning mall-customers portfolio-project python scikit-learn unsupervised-learning

Last synced: 07 Mar 2026

https://github.com/soroush-04/incrementalsvm-road-accident-prediction

Enhance SVM and incremental SVM machine learning models for road accident severity prediction

incremental-learning machine-learning python scikit-learn svm

Last synced: 09 Apr 2026

https://github.com/tasninanika/heart-disease-analysis

The Heart Disease Analysis project is a comprehensive machine learning study aimed at predicting the presence of heart disease using the Heart Disease UCI Dataset.

knn logistic-regression matplotlib numpy pandas python3 random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/artikumari28/movie-recommender-system

This project is a content-based movie recommendation system, where movies are recommended based on their similarity in content. The system analyzes various features such as genres, cast, and descriptions to suggest similar movies.

google-colab machine-learning nltk numpy pandas pickle scikit-learn streamlit

Last synced: 06 Apr 2026

https://github.com/armanjscript/fusion-rag

A powerful web-based application designed to answer questions based on the content of uploaded PDF documents. This project leverages the **Fusion-in-Decoder (FiD)** approach for **Retrieval-Augmented Generation (RAG)**, combining semantic similarity, technical term relevance, and recency to deliver accurate and contextually relevant responses

chroma chromadb fusion-rag langchain langchain-ollama ollama pypdf qwen2-5 rag rag-chatbot scikit-learn streamlit tf-idf-score tf-idf-vectorizer vector-database

Last synced: 10 Apr 2026

https://github.com/andystmc/nextflownyc

Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting

Last synced: 07 Apr 2026

https://github.com/jai0212/cash-app-bias-busters

A platform developed with Cash App to help ML engineers detect and visualize biases in models using Fairlearn. Features include a collaborative and interactive dashboard (React, Chart.js), a Flask backend, and a secure MySQL database for data storage and analysis.

bias-detection chartjs fairlearn flask machine-learning mysql numpy pandas pytest python react scikit-learn scipy

Last synced: 16 Feb 2026

https://github.com/tasninanika/coded_data_prediction-knn

K-Nearest Neighbors (KNN) is a supervised machine learning algorithm

knn pandas python3 scikit-learn

Last synced: 07 Apr 2026

https://github.com/guoshijiang/scikit-learn

带你一起学习scikit-learn

nlp-machine-learning scikit-learn

Last synced: 14 Sep 2025

https://github.com/f-aguzzi/ChemFuseKit

Chemometrics library for data fusion, model training and prediction of data from multiple sensor sources.

chemometrics datafusion knn lda pca plsda scikit-learn svm

Last synced: 21 Sep 2025

https://github.com/evangks/k-means-clustering-synthetic-dataset

Customer Segmentation using K-Means Clustering: A complete machine learning workflow for segmenting customers based on synthetic demographic and spending data, with visualizations, evaluation metrics, and reproducible Jupyter notebook.

clustering customer-segmentation data-science jupyter-notebook k-means-clustering machine-learning portfolio-project python27 scikit-learn unsupervised-learning

Last synced: 10 Mar 2026

https://github.com/viveksapkal2793/advertisement-response-analysis

This project analyzes advertisement responses using a Django backend and a Vite+React frontend. It includes scripts to load, clean, and transform data, which are executed within Docker containers. Data is stored in a MongoDB database, and the project can be run with or without Docker by adjusting the MongoDB connection strings.

advertisement advertisement-analysis container-image containerization django docker machine-learning mongodb react scikit-learn vite

Last synced: 23 Sep 2025

https://github.com/catlikeflyer/rsp-recognition

A computer vision project to recognize thumbs up

machine-learning mediapipe-hands python scikit-learn

Last synced: 16 May 2026

https://github.com/rexsimiloluwah/fastapi-ml-apps

Machine learning apps built with FastAPI

docker fastapi machine-learning python scikit-learn tensorflow

Last synced: 05 Apr 2026

https://github.com/tlapanco/knn-project

Projecto para la materia de Sistemas inteligentes haciendo uso de KNN oversampling.

jupyter-notebook knn pandas python scikit-learn smote

Last synced: 09 Apr 2026

https://github.com/gokulgowthams/clickstream-customer-conversion

Analyzes clickstream data from an e-commerce platform to predict customer conversions, estimate potential revenue, and segment users for personalized marketing strategies. By leveraging machine learning techniques, the project enhances decision-making for businesses seeking to optimize user engagement and sales.

data-preprocessing feature-engineering machine-learning matplotlib model-deployment numpy pandas pipeline python scikit-learn seaborn streamlit-web-application tensorflow xgboost

Last synced: 07 Apr 2026

https://github.com/gokularaman-c/ev-charging-log-anomaly-detection

EV charging log anomaly detection using Isolation Forest, engineered telemetry features, and a CLI inference pipeline.

anomaly-detection ev-charging feature-engineering isolation-forest machine-learning mlops python scikit-learn time-series

Last synced: 23 May 2026

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 09 Apr 2026

https://github.com/jersongb22/computervision

Links to my repositories with a wide variety of Computer Vision models using CNNs, Transfer Learning, and Vision Transformer with TensorFlow, PyTorch, Hugging Face and Ultralytics.

cnn computer-vision convnextv2 efficientnetv2 hugging-face image-captioning image-classification image-segmentation lenet-5 object-detection opencv plotly python pytorch scikit-learn tensorflow ultralytics video-classification vision-transformer yolo11

Last synced: 12 Apr 2026

https://github.com/upul/chocolate-quality-analysis

This repository contains a Jupiter notebook which describes how to use basic machine learning tools such Scikit-Learning, Pandas, and Numpy for buiding models.

machine-learning numpy pandas predictive-analytics scikit-learn

Last synced: 04 May 2026

https://github.com/gperdrizet/ensembleset

Ensemble dataset generator for tabular data prediction and modeling projects.

classification ensemble feature-engineering machine-learning regression scikit-learn

Last synced: 07 Mar 2026

https://github.com/shreeparab1890/movie-recommender-system

This notebook is trying to build a model which will recommend the movie based on given movie and genre. In this we use Popularity Based Recommendation, Content Based Recommendation and Collaborative Filtering based Recommendation.

bag-of-words cosine-similarity matplotlib numpy pandas python scikit-learn sklearn vectorization

Last synced: 09 Apr 2026

https://github.com/vimal0156/ruaroa-ai

🧙‍♂️ Zero-Code Machine Learning Wizard - Transform ideas into intelligent solutions without writing code. AI-powered ML pipeline automation with interactive web interface.

ai-agents ai-assistant artificial-intelligence automated-machine-learning code-generation data-analysis data-science deep-learning jupyter machine-learning machine-learning-pipeline neural-networks no-code openai python scikit-learn streamlit visualization

Last synced: 09 Apr 2026

https://github.com/mariamabidi/pinn-based-flow-prediction

This repository contains code and experiments for predicting 3D aerodynamic flow around car geometries using Physics-Informed Neural Networks (PINNs) and for analyzing flow features via autoencoder-based clustering.

computer-vision machine-learning neural-network numpy pytorch pyvista scikit-learn

Last synced: 05 Aug 2025

https://github.com/veb-101/machine-learning-practice

Contains code-works from the Hands on scikit-learn and tensorflow book

deep-learning keras machine-learning python3 scikit-learn tensorflow-gpu

Last synced: 19 Apr 2026

https://github.com/aymen016/film-recommendation-engine

A machine learning-powered movie recommender system designed to provide personalized recommendations based on user preferences and data analysis. This project includes a backend recommendation engine, a Streamlit-based interface, and a web-based frontend for an enhanced user experience.

flask numpy pandas pickle python scikit-learn streamlit

Last synced: 09 Apr 2026

https://github.com/vedanty3/heart-disease-prediction

This project aims to build a machine learning model using K-Nearest Neighbor, LogisticRegression, RandomForestClassifier to classify whether or not a person has heart disease based upon his medical attributes. (accuracy achieved : 88.52%)

confusion-matrix correlation-matrices jupyter-notebook knn-classification logistic-regression machine-learning matplotlib numpy pandas python random-forest randomforestclassifier roccurve scikit-learn sklearn zerotomastery

Last synced: 09 Apr 2026

https://github.com/malleswarigelli/real_estate_house_price_prediction

Build end-to-end ML Regression pipeline for predicting housing price, deploy Flask app to cloud platform:Heroku with Docker, CI/CD tool: GitHub Actions

ci-cd-pipeline docker heroku-deployment machine-learning mlops mongodb python scikit-learn

Last synced: 09 Apr 2026

https://github.com/vidhi1290/text-classification-model-with-attention-mechanism-nlp

This Python project utilizes PyTorch to perform text classification with an attention mechanism. Pre-trained GloVe embeddings are processed for word representation, and a custom attention model is trained on consumer complaint data to categorize complaints into product categories.🎯

attention-mechanism deeplearning machine-learning nlp nltk numpy pandas python pytorch scikit-learn text-classification tqdm

Last synced: 06 Apr 2026

https://github.com/dustinmichels/bayesian-values-guesser

Uses some user input, data from the World Values Survey <www.worldvaluessurvey.org>, and Bayes Rule to guess a number of beliefs the user might have. STATUS: In progress.

bayes-rule bayesian-values-guesser naive-bayes-classifier pandas python scikit-learn values-survey

Last synced: 09 Apr 2026

https://github.com/gaurav9364/credit-card-fraud-detection

Credit Card Fraud Detection using Machine Learning – A classification project that detects fraudulent credit card transactions using supervised learning, with data preprocessing, handling class imbalance, and model evaluation (ROC-AUC, Precision, Recall, F1-score).

googlecolab imbalanced-learn matplotlib numpy pandas python scikit-learn seaborn xgboost

Last synced: 08 Apr 2026

https://github.com/PFS-AI/PFS

The AI-powered desktop tool for finding, classifying, and understanding your files. Search by keyword, ask questions, and get insights from your scattered files instantly.

ai cross-platform data-science document-classification fastapi file-management file-organizer file-search huggingface-transformers knowledge-management langchain machine-learning productivity-tools rag scikit-learn search-engine semantic-search vector-search

Last synced: 30 Dec 2025

https://github.com/aaa1928/iris-ml-classifier

PyTorch model that classifies Iris species based on characteristics about the length and width of sepals and petals.

deep-learning iris-classification iris-dataset machine-learning neural-network numpy pandas python pytorch scikit-learn

Last synced: 05 Apr 2026

https://github.com/uhstray-io/pystockbot

Platform & exchange agnostic Stock, Crypto, and Asset automated Machine Learning & AI Trading Bot

automation docker machine-learning python scikit-learn statistical-analysis trading-algorithms

Last synced: 13 Aug 2025

https://github.com/yvesemmanuel/machine_learning

Implements data problems solved with machine learning algorithms.

data-science keras keras-tensorflow linear-algebra machine-learning neural-network python scikit-learn

Last synced: 09 Apr 2026

https://github.com/rizz1406/spam-email-detector

Spam Email Classifier using Python and Streamlit A simple machine learning project that classifies emails as **spam** or **ham** using the **Naive Bayes algorithm** and **TF-IDF** for text feature extraction. The project includes a user-friendly web app built with Streamlit

nlp pandas pytho3 scikit-learn streamlit

Last synced: 09 Apr 2026

https://github.com/nekruzash/regression-correlation

This is from CS2023 - AI/DS/ML class, trained a model based on different categories of data and predicted using a linear regression for the best feature that has the greatest effect on the housing prices.

jupyter-notebook python scikit-learn

Last synced: 04 May 2026

https://github.com/mhmudfzli/exploring-mental-health-data

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

catboost lightgbm matplotlib numpy pandas scikit-learn seaborn xgboost

Last synced: 09 Apr 2026