Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/billy0402/python-machine-learning

A learning project from NTUB machine learning course.

ai course jupyter-notebook python scikit-learn tensorflow

Last synced: 14 Jan 2025

https://github.com/hmasdev/sklearn-protocols

sklearn-protocols: protocols for sklearn regressors, classifiers and transformers

protocol python python3 scikit-learn

Last synced: 29 Dec 2024

https://github.com/khaymanii/big_mart_prediction_model

This model was built using Python and Logistic Regression Algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/khaymanii/diabetes_prediction_model

This is a Machine learning model built using Python

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/kirtipratihar/python_libraries_for_ds

This repository serves as a comprehensive guide to Python programming for Data Science. It covers essential topics like data manipulation, data visualization, machine learning, and statistical analysis using popular libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn tensorflow

Last synced: 23 Jan 2025

https://github.com/lingumd/neural_network_charity_analysis

Machine learning and neural networks used to create a binary classifier capable of predicting whether applicants will be successful if funded by Alphabet Soup.

deep-learning machine-learning matplotlib-pyplot neural-networks onehotencoder pandas scikit-learn seaborn standardscaler tensorflow

Last synced: 23 Jan 2025

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 31 Dec 2024

https://github.com/lingumd/cryptocurrencies

Unsupervised machine learning models used to group the cryptocurrencies to help prepare for a new investment.

concatenate elbow-curves get-dummies hvplot jupyterlab kmeans matplotlib-pyplot minmaxscaler pandas path pca-analysis plotly-express scikit-learn unsupervised-machine-learning

Last synced: 23 Jan 2025

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/lilivalgo/machine-learning-projects

This repository hosts the machine learning project developed during my learning journey. It showcases my progress and the skills acquired in the field of machine learning

lag-feature linear-regression ml-models scikit-learn scipy-stats seaborn-plots

Last synced: 02 Feb 2025

https://github.com/khaymanii/house-price-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/shubhamkarampure/asl-streamlit-signlingo

streamlit based web-app for teaching sign language through real-time hand gesture recognition.

learning-exercise mediapipe opencv-python python scikit-learn sign-language streamlit-webapp

Last synced: 21 Jan 2025

https://github.com/daniel-furman/RecFeatureSelect

Feature selection functions (1) using the multi-collinearity matrix and recursively proceeding to a spearman threshold and (2) using Forward Stepwise Selection running on an ensemble sklearner (with options for HPO).

correlation-threshold machine-learning modeling multicollinearity recursion recursive-algorithm scikit-learn spearman-rho

Last synced: 20 Nov 2024

https://github.com/khaymanii/rock-or-mine-detection_model

This is a Machine learning model built using Python to detect between rock and mine

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/khaymanii/parkinsons-disease-detection-model

This model was built with Python and Support Vector Machine Algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/vietanh152/astrophysical-objects-classification

Project applies machine learning techniques to classify astrophysical objects using observational data from the Large Synoptic Survey Telescope (LSST).

adaboost adaptive-boosting-algorithm down-sampling gradient-boosting keras machine-learning neural-network python scikit-learn static-analysis supervised-learning support-vector-machines time-series time-series-analysis

Last synced: 09 Feb 2025

https://github.com/marconicivitavecchia/stazione-monitoraggio-ambientale

Codice in MicroPython per ESP32 per il corso tenuto dalla nostra scuola rivolto ai docenti sulla creazione di una stazione di monitoraggio ambientale che copre gli argomenti di Python, IoT ed Intelligenza Artificiale.

ai esp32 micropython micropython-esp32 python school-project scikit-learn

Last synced: 22 Dec 2024

https://github.com/spignelon/letter-recognition_project-ml

A machine learning project for letter recognition using SVM, KNN, and Decision Tree, Random Forest, and Naive Bayes algorithms. Includes data preprocessing, model training, evaluation, and visualization, and research report.

decision-tree-classifier knn-classifier letter-recognition machine-learning machinelearning naive-bayes-classifier random-forest-classifier research research-paper research-project scikit-learn svm-classifier uci-dataset uci-machine-learning

Last synced: 18 Jan 2025

https://github.com/khaymanii/titanic_survival_prediction_-model

This Model was built using Python and Logistic Regression algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/elprofesoriqo/chrome-extension-gmail-spam-filter

Chrome extension that automatically identifies and moves emails marked as spam to the spam folder in Gmail.

api-client chrome-extension firebase-database javascript machine-learning python scikit-learn

Last synced: 09 Jan 2025

https://github.com/srilaasya/handwriting-recognition-using-k-means

Used K-means clustering and scikit-learn to cluster images of handwritten digits.

handwriting-recognition k-means python scikit-learn

Last synced: 25 Jan 2025

https://github.com/srilaasya/breast-cancer-classifier

Used several Python libraries to make a K-Nearest Neighbor classifier that is trained to predict whether a patient has breast cancer

knearest-neighbor-classifier python scikit-learn

Last synced: 25 Jan 2025

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 10 Feb 2025

https://github.com/bartventer/ann-loan-default-profiler

Artificial Neural Network built with Tesorflow to predict whether new customer application will repay or default on a new loan.

artificial-neural-network data-visualisation deep-learning exploratory-data-analysis numpy pandas pipelines scikit-learn tensorflow2

Last synced: 24 Dec 2024

https://github.com/minhtran241/ml-dl-llm-genai

Showcasing ML/DL fundamentals, paper implementations, deep learning models, and other projects. The purpose of this repository is to provide a playground for me to explore and learn about PyTorch, deep learning, and generative AI.

deep-learning generative-ai llm machine-learning paper-implementations pytorch scikit-learn

Last synced: 02 Feb 2025

https://github.com/bartventer/survival-predictor

A simple machine learning application developed with Django, AWS, Gunicorn and Nginx

aws django ec2 gunicorn linux nginx scikit-learn

Last synced: 24 Dec 2024

https://github.com/lemma-osu/sklearn-raster

Spatial raster prediction with scikit-learn estimators

raster scikit-learn xarray

Last synced: 16 Jan 2025

https://github.com/tnleite/loan-approval-prediction

Este repositório apresenta um modelo preditivo de aprovação de empréstimos, focado em minimizar o risco de inadimplência. Utilizando EDA e algoritmos de machine learning (Random Forest, XGBoost), ajustamos o threshold para maximizar o recall de inadimplentes, contribuindo para uma gestão de riscos eficiente.

classification-algorithm data-science exploratory-data-analysis machine-learning-algorithms machine-learning-models matplotlib numpy scikit-learn scipy seaborn xgboost-classifier

Last synced: 29 Jan 2025

https://github.com/ourahma/python-data-viz

This project focuses on data analysis and visualization to estimate used car prices. It includes preprocessing, statistical analysis, regression modeling, and visualizations to provide insights into the car pricing market.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/radoslawregula/binary-classification-metrics

A model implementing a solution to the binary classification problem along with several accuracy metrics.

binary-classification classification jupyter-notebook machine-learning matplotlib pandas python scikit-learn stochastic-gradient-descent

Last synced: 21 Jan 2025

https://github.com/radoslawregula/geo-music-classification

Jupyter notebook implementing a classification solution to the geographical origins of music problem.

classification jupyter-notebook machine-learning pandas python random-forest-classifier scikit-learn

Last synced: 21 Jan 2025

https://github.com/nicolascoiado/nivel-mar

Este projeto realiza uma análise detalhada do nível médio global do mar (GMSL), utilizando uma base de dados pública que abrange medições históricas. O objetivo é explorar tendências, calcular a taxa média de elevação e visualizar os dados por meio de gráficos.

google-colab jupyter-notebook matplotlib numpy pandas python python3 scikit-learn

Last synced: 18 Jan 2025

https://github.com/jt2m0l3y/quantified-self

The final project for an introduction to data science, this project is a practice in supervised machine learning with kNN and Decision Tree Classifiers.

data-visualization decision-trees eda jupyter-notebooks knn-classification latex markdown numpy pandas python scikit-learn statistical-analysis supervised-learning

Last synced: 27 Dec 2024

https://github.com/kumailn/machinelearning

Machine learning with Python

machine-learning python scikit-learn tensorflow

Last synced: 02 Jan 2025

https://github.com/archie-cm/churn-analysis-ecommerce-customer

The objective of this project to is to predict customer churn, loss opportunity and provide recommendations to the business team so the company can implement a customer persona in retention strategy and can monitoring throught dashboard interactive.

data-science feature-engineering machine-learning python scikit-learn

Last synced: 20 Jan 2025

https://github.com/lefteris-souflas/modern-slavery-analysis

Jupyter notebook using machine learning techniques to explore the complex drivers of modern slavery. Models from a research paper are replicated and evaluated . Actions also include filling missing data, training regression models, and analyzing feature importance.

decision-tree feature-importance grid-search-cv imputation jupyter-notebook lasso-regression linear-regression matplotlib mean-absolute-error numpy pandas preprocessing principal-component-analysis python3 random-forest ridge-regression scikit-learn seaborn

Last synced: 12 Jan 2025

https://github.com/petrosdemetrakopoulos/flight-passengers-prediction

A supervised learning problem given as a project in the "Data Mining in Databases and World Wide Web" course in Computer Science Department of AUEB in Winter semester of 2019.

classification classifier data-science machine-learning python scikit-learn sklearn university-project

Last synced: 29 Jan 2025

https://github.com/serdaraydem1r/10dayaichallenge101

In the 10-day camp, we experienced the basics of machine learning by coding

artificial-intelligence machine-learning-algorithms model-evaluation-and-selection scikit-learn

Last synced: 22 Dec 2024

https://github.com/hayatiyrtgl/topic_modeling_sklearn_lda

Python code: Uses LDA for topic modeling on text data.

lda lda-model machine-learning nlp python scikit-learn sklearn topic-modeling

Last synced: 22 Dec 2024

https://github.com/afkewolczyk/data_science_bootcamp

A data science project to learn data science essentials such as: pandas, Matplotlib, Scikit learn

ai data-science machine-learning pandas scikit-learn

Last synced: 10 Feb 2025

https://github.com/wesleybertipaglia/data-science-small-business

Undergraduate Data Science project for small businesses, covering data-driven solutions for optimizing operations, increasing efficiency, and driving growth.

business-intelligence data-science machine-learning matplotlib pandas python scikit-learn

Last synced: 18 Jan 2025

https://github.com/karimosman89/energy-consumption-forecasting

Predict future energy consumption based on historical data.Create a model that predicts energy consumption in households or businesses to optimize energy distribution and reduce costs.Assist energy companies in planning and managing supply efficiently.

arima lstm matplotlib pandas python scikit-learn

Last synced: 13 Feb 2025

https://github.com/alchemine/diabetes-prediction

Diabetes Prediction and Analysis (NHIS-2018)

eda jupyter python scikit-learn streamlit

Last synced: 16 Jan 2025

https://github.com/thd-ai-2023/ain-b-3-assistant-systems

Assistance Systems Project is a web application that provides personalized health recommendations and data analysis. The frontend is built using Streamlit, and it integrates a chatbot developed with Rasa. Machine learning models created with scikit-learn are used to generate recommendations based on user input. The application is containerized with

chatbot data-analysis docker docker-compose machine-learning python rasa scikit-learn sklearn streamlit

Last synced: 29 Jan 2025

https://github.com/apal21/tensorflow-pima-indians-dataset-classification

Pima Indians Dataset classification using Tensorflow Linear Classifier and DNN Classifier.

classification deep-neural-networks kaggle linear-classifier pandas pima-indians-dataset scikit-learn tensorflow

Last synced: 31 Dec 2024

https://github.com/pb319/california_house-price-prediction

This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".

evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split

Last synced: 21 Jan 2025

https://github.com/akapich/clustermatic

Python AutoML library for clustering tasks

automl clustering machine-learning scikit-learn

Last synced: 28 Dec 2024

https://github.com/anastasiaschmidt1/sqli-detection-ml

UNI-PROJEKT: Erkennung von SQL-Injection-Angriffen durch maschinelles Lernen (SVM-Modell)

bht-berlin machine-learning scikit-learn sqli svm

Last synced: 28 Dec 2024

https://github.com/moanassiddiqui/handsonml_ml

This is the complete part I of the Hands-On Machine Learning book which was about the classical machine learning models.

hands-on machine-learning scikit-learn

Last synced: 18 Jan 2025

https://github.com/nickklos10/seriea_machine_learning_predictions_2025

This project involves scraping data, processing the data, and building machine learning models to predict the standings for the 2024-2025 Serie-A season.

beatifulsoup data-scraping keras matplotlib pandas scikit-learn shap tensorflow

Last synced: 29 Jan 2025

https://github.com/suvasish114/house-price-estimation

A machine learning model that estimate housing prices in California using the California census data

jupyter-notebook machine-learning python scikit-learn

Last synced: 29 Jan 2025

https://github.com/subratamondal1/machine-learning

Machine Learning Notes with tools like Numpy, Pandas, Scikit-Learn.

machine-learning numpy pandas scikit-learn

Last synced: 11 Jan 2025

https://github.com/zsailer/skspline

A Scikit-learn interface on Scipy's spline.

scikit-learn scipy

Last synced: 22 Dec 2024

https://github.com/sarincr/training-on-artificial-intelligence

Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training

artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn

Last synced: 21 Jan 2025

https://github.com/prakashjha1/customer-segmentation

This repository contains a customer segmentation project implemented in a Jupyter Notebook using Python. Customer segmentation is a crucial strategy for businesses aiming to understand their customer base better, enabling targeted marketing strategies and personalized customer experiences.

clustering-algorithm customer-segmentation kmeans-clustering matplotlib python scikit-learn seaborn

Last synced: 16 Jan 2025

https://github.com/junya737/weighted-pls-regression

A Python implementation of Weighted Partial Least Squares Regression with support for sample weights.

machine-learning partial-least-squares-regression scikit-learn

Last synced: 28 Dec 2024

https://github.com/virajbhutada/article-clustered-recommendation-system-ml

This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.

anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy

Last synced: 15 Oct 2024

https://github.com/andriamahay11master/machinelearning

This project is created to practice supervised ML using the libraries existing in Python

linear-regression multiple-linear-regression python scikit-learn scikitlearn-machine-learning

Last synced: 22 Jan 2025

https://github.com/atharvapathak/size_estimator_project

This project in Python aims to provide a tool for estimating the size of objects in images or videos. Using computer vision techniques, the project analyzes the input media, detects objects of interest, and provides an estimation of their size based on known reference points or objects.

cicd cnn opencv python pytorch rnn scikit-learn sql tensorflow

Last synced: 15 Jan 2025

https://github.com/jayadavv/dynamic-ml-model-selector

An interactive web application that allows users to upload their datasets and dynamically select, train, and evaluate various machine learning models. The app provides comprehensive performance metrics and visualizations, making it easy for users to analyze their data effectively.

decision-trees linear-regression logistic-regression matplotlib-pyplot plotly python random-forest scikit-learn streamlit

Last synced: 10 Feb 2025

https://github.com/lexxai/goit_python_ds_hw_03

Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.

latex linear-regression matplotlib numpy pandas python scikit-learn

Last synced: 24 Jan 2025

https://github.com/lexxai/goit_python_ds_hw_05

Метод опорних векторів -Support Vector Machine, SVM. Дерева рішень - RandomForestClassifier, XGBClassifier

classification numpy pandas python random-forest-classifier reporting scikit-learn svm-classifier svm-model xgbclassifier

Last synced: 24 Jan 2025

https://github.com/mecha-aima/fake-bills-detection

This Python project implements a simple classification model comparison using scikit-learn to classify banknotes as either "Authentic" or "Counterfeit" based on four features

classification-model machine-learning model-selection scikit-learn

Last synced: 22 Jan 2025

https://github.com/achronus/data-exploration

A repository dedicated to interesting data exploration projects I've completed

data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 22 Jan 2025

https://github.com/gayathri2200/car-price-prediction---machine-learning

Car price prediction Machine Learning --- Which is used to predict the price of used cars based on the features.

data-science machine-learning modeldeployment pandas price-prediction python regression scikit-learn streamlit visual-studio visualization

Last synced: 10 Feb 2025

https://github.com/jmromer/warbler

A recommendation engine powered by scikit-learn and served by Flask.

flask flask-api numpy pandas python3 scikit-learn

Last synced: 13 Jan 2025

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 22 Jan 2025

https://github.com/filsan95/project-iot_malware_identification

This repository contains the code and data for a project that detects malware from IoT devices using a publish-subscribe model with Confluent and Databricks. The project streams IoT device data to Kafka, analyzes it, and detects malware using machine learning models such as Random Forest and Gradient Boosted Trees.

apache-kafka classification confluent databricks machine-learning-algorithms scikit-learn sql

Last synced: 22 Jan 2025

https://github.com/gamowy/systemysi

Zadania z systemów sztucznej inteligencji

artificial-intelligence python scikit-learn university-coursework

Last synced: 25 Jan 2025

https://github.com/lijesh010/ml_project_data_preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn

Last synced: 03 Feb 2025

https://github.com/venky-1710/superhero-recruitment

Superhero Recruitment System predicts hero selection using machine learning. Users input hero attributes through a web interface. A Random Forest model analyzes abilities, strengths, weaknesses, success rates, and missions completed. The Flask app displays results, showing if a hero is selected.

css flask html numpy pandas python scikit-learn

Last synced: 20 Jan 2025

https://github.com/kosmoswerner/scikitlearn.net

C# bindings for Scikit-Learn (WIP 🏗)

machine-learning scikit scikit-learn sklearn

Last synced: 15 Nov 2024

https://github.com/lakshitalearning/codsoft

Machine Learning Projects - CODSOFT Internship: This repository showcases my machine learning projects completed during my internship at Codsoft. It demonstrates my skills in developing innovative solutions using various ML techniques and tools.

churn-prediction codsoft codsoftinternship deep-learning handwritten-text-recognition internship-project keras machine-learning python rnn-tensorflow scikit-learn spam-detection

Last synced: 13 Feb 2025

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 15 Jan 2025

https://github.com/mpolinowski/fisher-discriminant-analysis

LDA is a widely used dimensionality reduction technique built on Fisher’s linear discriminant.

linear-discriminant-analysis matplotlib-pyplot python scikit-learn

Last synced: 28 Jan 2025

https://github.com/mpolinowski/local-linear-embedding

Improve Data Quality by discarding non-correlating, noisy Dimensions

locally-linear-embedding pyplot python scikit-learn

Last synced: 28 Jan 2025