Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/ayaarbi/prediction_des_maladies_cardiovasculaires_avec_ml

Ce projet, développé au sein de cours de Machine Learning, utilise des algorithmes de classification supervisée pour prédire la présence de maladies cardio-vasculaires à partir de données médicales publiées sur Kaggle.

cardiovascular-diseases jupyter-notebook machine-learning matplotlib pandas python scikit-learn

Last synced: 23 Jan 2025

https://github.com/otuemre/obesity-classification

Machine learning project to classify obesity levels based on health metrics like age, sex, height, weight, and BMI.

classification data-science healthcare machine-learning obesity-classification scikit-learn

Last synced: 23 Jan 2025

https://github.com/jatin-mehra119/churn_modeling

This repository is dedicated to predicting customer churn using machine learning techniques. It includes comprehensive scripts for data preprocessing, model training, and evaluation, along with detailed visualizations and insights.

classification-model datavisualization pandas scikit-learn

Last synced: 17 Jan 2025

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 23 Jan 2025

https://github.com/veranyagaka/credit-card-fraud-detection

Credit Card Fraud Detection using data preprocessing, analysis, visualization, and machine learning to accurately identify fraudulent transactions. -Final Project

ai anomaly-detection classification credit-card-fraud-detection machine-learning scikit-learn supervised-learning

Last synced: 23 Jan 2025

https://github.com/namratha2301/carprice_analysisandprediction

This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.

data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn

Last synced: 23 Jan 2025

https://github.com/parag000/content-based-movie-recommender

This project builds a content-based movie recommendation system using the TMDB dataset. By combining metadata features like cast, genres, and directors into a "metadata soup," it calculates movie similarity with vectorizers (Count) and cosine similarity. Ideal for learning content-based filtering and text vectorization techniques.

cosine-similarity countvectorizer recommendation-system scikit-learn tfidf-vectorizer vectorization

Last synced: 23 Jan 2025

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 23 Jan 2025

https://github.com/nirmaldeepponnada/codeclauseinternshipproject1

This project involves Customer Segmentation using K-Means clustering to group customers based on Recency, Frequency, and Monetary (RFM) analysis from the Online Retail dataset. It also performs Sentiment Analysis on Amazon Product Reviews using Natural Language Processing techniques & Logistic Regression to classify reviews as positive or negative.

kmeans logistic-regression numpy pandas python3 regular-expressions scikit-learn tf-idf-vectorizer

Last synced: 23 Jan 2025

https://github.com/hmasdev/ssbgm

Score Based Generative Model with scikit-learn

generative-model scikit-learn

Last synced: 23 Jan 2025

https://github.com/emms21/thoughtful_ai

Technical Screening: an agent that responds to semantically similar questions with pre-defined answers, falling back to generic LLM responses otherwise

openai python scikit-learn streamlit

Last synced: 22 Jan 2025

https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon

An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.

matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud

Last synced: 23 Jan 2025

https://github.com/purcellcjp/credit-risk-classification

This project utilized Python and scikit-learn libraries to train and evalute a Machinge Learning model based on loan risk.

machine-learning numpy pandas-dataframe python scikit-learn

Last synced: 23 Jan 2025

https://github.com/bastianlq/gym-prediccion-churn-y-agrupacion-clustering

Predicción de churn, agrupación de clientes mediante clustering y recomendaciones de marketing para gym

aprendizaje-automatico clustering machine-learning scikit-learn

Last synced: 23 Jan 2025

https://github.com/andresmg07/real-time-sign-language-translator

AI-driven real-time American Sign Language translator. Implemented leveraging Support Vector Machines (SVM), OpenCV library and MediaPipe hands module.

ai computer-vision machine-learning mediapipe opencv pattern-recognition scikit-learn support-vector-machines

Last synced: 23 Jan 2025

https://github.com/jt2m0l3y/quantified-self

The final project for an introduction to data science, this project is a practice in supervised machine learning with kNN and Decision Tree Classifiers.

data-visualization decision-trees eda jupyter-notebooks knn-classification latex markdown numpy pandas python scikit-learn statistical-analysis supervised-learning

Last synced: 27 Dec 2024

https://github.com/abdullahashfaq-ds/sms-spam-detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 17 Dec 2024

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 11 Dec 2024

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn

Last synced: 17 Dec 2024

https://github.com/albarji/teachingcontainer

A Docker container I use for my lectures

docker keras machine-learning scikit-learn

Last synced: 17 Dec 2024

https://github.com/scikit-learn/pairwise-distances-reductions-asv-suite

A dedicated asv suite for scikit-learn private PairwiseDistancesReductions

asv benchmarks cython scikit-learn

Last synced: 17 Dec 2024

https://github.com/pratishtha-abrol/sentimentanalysis

Logistic Regression: A sentiment analysis case study

logistic-regression nltk-python scikit-learn sentiment-analysis

Last synced: 12 Dec 2024

https://github.com/dmarks84/coursework_project_ml-classification

Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics

classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml

Last synced: 23 Dec 2024

https://github.com/dmarks84/coursework_project_ml-classifier-eval-selection

Project for University of Michigan Applied Data Science Specialization -- Predicted viewer engagement based on features related to video metrics; evaluated a large set of classifiers under different scoring metrics to select the "optimal" one.

classification cross-validation data-modeling data-reporting data-visualization databases dataframes eda grid-search matplotlib numpy pandas python scikit-learn statistics supervised-ml

Last synced: 23 Dec 2024

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 23 Dec 2024

https://github.com/dmarks84/coursework_project_ml-model-eval-refine

Project for IBM Data Science course on ML Models & Analysis -- Read in large dataset of home sales and utilized polynomial linear regression analysis to make predictions of future home sales prices

classification communication data-modeling dataframes machine-learning matplotlib numpy pandas programming python regression scikit-learn scipy seaborn supervised-ml visualization

Last synced: 23 Dec 2024

https://github.com/saahilanande/naivebayes

Implimenting Naive Bayes classifier from scratch for sentiment analysis of IMDB dataset

machine-learning naive-bayes-classifier python-3 scikit-learn

Last synced: 10 Jan 2025

https://github.com/martinkersner/kmeans-meetup

Presentation about k-Means for Seoul AI Meetup on July 22, 2017.

kmeans numpy python scikit-learn

Last synced: 01 Jan 2025

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 23 Dec 2024

https://github.com/yuweaec/wine_quality_prediction

The Wine Quality Prediction project aims to predict the quality of wine based on its chemical properties using machine learning algorithms.

flask jupyter-notebook machine-learning python scikit-learn

Last synced: 31 Oct 2024

https://github.com/rishi035/advanced-house-price-predictions

This is my First Project and also participated in kaggle competition

linear-regression machine-learning python random random-forest regressor-models scikit-learn

Last synced: 01 Jan 2025

https://github.com/themihirmathur/machine-failure-predictor

The 'Machine Failure Predictor' is a web application designed to enable predictive maintenance for industrial equipment. Leveraging machine learning techniques, this application predicts whether equipment will fail and identifies the type of failure based on key process parameters.

cicd digitalocean docker fastapi flask github github-actions machine-learning matplotlib mlops numpy plotly python randomforest scikit-learn seaborn streamlit

Last synced: 01 Jan 2025

https://github.com/themihirmathur/soiligator

Soiligator is an advanced machine learning project designed to optimize irrigation management by predicting whether irrigation is necessary based on environmental and soil-related data.

auc-score logistic-regression machine-learning matplotlib numpy pandas python random-forest-classifier roc-curve scikit-learn seaborn standardscaler support-vector-machine

Last synced: 01 Jan 2025

https://github.com/dionixius7/titanic-disaster-ml-model

This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances

data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction

Last synced: 18 Jan 2025

https://github.com/tedim52/discjockey

a content-based recommender system for your party playlist preferences

jupyter-notebook matplotlib pandas scikit-learn spotify-web-api

Last synced: 12 Dec 2024

https://github.com/kumailn/machinelearning

Machine learning with Python

machine-learning python scikit-learn tensorflow

Last synced: 02 Jan 2025

https://github.com/raphael-ufrj/analise_algodao

Análise histórica de plantio de algodão, analise do plantio com base no clima e nos dados históricos.

analysis data-science data-visualization dataset docker pandas provenance python python3 scikit-learn seaborn streamlit

Last synced: 12 Dec 2024

https://github.com/myahninsi/customer-segmentation-recommendation-ml

This project addressed challenges in understanding customer behavior and personalizing shopping experiences for an e-commerce platform. Developed ML solutions including K-Means clustering for segmentation, Random Forest regression for CLV prediction, and collaborative filtering for product recommendations.

collaborative-filtering k-means-clustering pandas python random-forest scikit-learn

Last synced: 30 Dec 2024

https://github.com/rcghpge/pymo

A framework in Mojo for AI/ML/DL applications and other domains.

deep-learning machine-learning magic-cli modular mojo mojo-language pymo python scikit-learn

Last synced: 21 Jan 2025

https://github.com/csengupta1101/housing-price-prediction

Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.

jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode

Last synced: 29 Dec 2024

https://github.com/sundarmd/breast-cancer-detection

Breast-Cancer-Detection is a machine learning project that utilizes logistic regression to predict whether a tumor is benign or malignant based on the Breast Cancer Wisconsin (Diagnostic) dataset. The project demonstrates data preprocessing, model training, and evaluation using the `scikit-learn` library.

logistic-regression machine-learning python scikit-learn

Last synced: 30 Dec 2024

https://github.com/wlopezm-unal/titanic_ship-streamlit

Machine Learning model, where using titanic ship data and see if is be able to predict if a passager was salved or died. This apply use Machien learning (Random Forest, gassianNB and Logistic Regressión) . Further, using streamlit together to FastApi be able to see the predict result

docker machine-learning optuna scikit-learn streamlit supervised-machine-learning

Last synced: 08 Jan 2025

https://github.com/rahulb813/mvscr

An alternative movie recommendation system for the CineMatchSM with better predictions. Trained with a Kaggle dataset of over 450000+ entries.

gridsearchcv knn python scikit-learn scipy-sparse svd tensorflow xgboost

Last synced: 02 Jan 2025

https://github.com/deliprofesor/ridge-regression-for-sales-prediction-model-evaluation-and-hyperparameter-tuning

This project builds and optimizes a model on a dataset using Ridge regression and polynomial features. Model accuracy is enhanced through regularization and polynomial transformations. Grid search and cross-validation are used to find the best parameters, and the model's performance is evaluated.

cross-validation data-science data-visualization grid-search machine-learning model-optimization mse overfitting-prevention polynomial-regression python r2-score regression-analysis regularization ridge-regression rmse scikit-learn

Last synced: 30 Dec 2024

https://github.com/myahninsi/housing-price-prediction-ml

Final project for Big Data Visualization for Business Communications 01 (DSMM Group 1). Analyzes housing data, identifies key price factors, and builds predictive models using machine learning. Includes Power BI dashboards for interactive visualizations and Flask for deployment.

elastic-net lasso-regression linear-regression matplotlib numpy pandas ridge-regression scikit-learn seaborn

Last synced: 30 Dec 2024

https://github.com/adadalshabab/human-stress-analysis-greadsearch-classifier

The project leverages data from physiological signals, self-reported surveys, behavioral observations, or other relevant sources to infer and analyze stress levels.

classification knn-classification machine-learning machine-learning-algorithms matplotlib pandas scikit-learn

Last synced: 21 Jan 2025

https://github.com/thmslmr/scikitlearn-examples

💡Scikit Learn examples - Python

python scikit-learn tutorials

Last synced: 29 Dec 2024

https://github.com/dmschauer/aws-sagemaker-deployment-test

I did a simple test to see how deploying a machine learning model on AWS Sagemaker and thus turning it into an API works. Since scikit-learn models require less dependencies than e.g. TensorFlow models I went with them for this test. To do so I used a tutorial.

aws boto3 python sagemaker scikit-learn

Last synced: 08 Jan 2025

https://github.com/yuanx749/mlp-pytorch

PyTorch MLP with scikit-learn API.

pytorch scikit-learn

Last synced: 08 Jan 2025

https://github.com/anras5/criteo-search-data

EDA and statistical tests on CriteoSearchData dataset

data-science pandas scikit-learn statistics

Last synced: 21 Jan 2025

https://github.com/rakibhhridoy/appliedmachinelearninghousing-regression

Let's take the Housing dataset which contains information about different houses in Boston. This data was originally a part of UCI Machine Learning Repository and has been removed now. We can also access this data from the scikit-learn library. The objective is to predict the value of prices of the house using the given features.

deep-learning housing-market housing-prices machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 25 Dec 2024

https://github.com/adamouization/superconductors-critical-temperature-predictions-regression

:chart_with_upwards_trend: Machine Learning & Data Visualisation/Processing techniques for predicting the critical temperatures required for different superconductors to conduct electrical current with no resistance.

jupyter jupyter-lab jupyter-notebook machine-learning matplotlib matplotlib-pyplot numpy pandas prediction python python-3 regression regression-algorithms scikit-learn scikitlearn-machine-learning scipy

Last synced: 05 Jan 2025

https://github.com/ericmaddox/crime-analyst-ai

AI-powered Python project that analyzes crime data, predicts trends using advanced models, and visualizes insights on interactive maps with validation for accuracy and reliability.

ai artifical-intelligence crime-analysis crime-data crime-data-analysis crime-prediction crime-statistics ollama ollama-api python python3 scikit-learn

Last synced: 08 Jan 2025

https://github.com/tanaybhadula/ml-preprocessing-cli

A CLI tool with python to preprocess datasets for performing supervised learning to save time for users. Input data can be preprocessed using simple commands and preprocessed dataset can be downloaded later

cli data-cleaning data-preprocessing machine-learning pandas python scikit-learn

Last synced: 10 Jan 2025

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 14 Jan 2025

https://github.com/darkdk123/customer-churn-prediction-innobytes

Predicting Customer churns as an Internship project at Innobytes services.

data-science python scikit-learn streamlit xgboost-classifier

Last synced: 28 Dec 2024

https://github.com/karimosman89/resume-screening

Screen resumes to identify the best candidates.Build a machine learning model that screens resumes and ranks candidates based on job descriptions.Streamline the hiring process for HR departments by automating candidate screening.

machine-learning-algorithms nlp-machine-learning nltk-python python scikit-learn spacy text-processing

Last synced: 25 Dec 2024

https://github.com/akshaya13/recommendation-system

Content Based Recommendation system using tags!

nltk scikit-learn similarity-search tmdb-database

Last synced: 08 Jan 2025

https://github.com/elifirinci/mushrooms-plants-classification

This project features AI models for identifying mushrooms and plants as poisonous or edible using image-based predictions. Both models are tested through an interactive Gradio interface, ensuring user-friendly and accurate identification for foragers and researchers.

classification cnn cnn-classification gradio image-classification machine-learning mushroom-classification plant-classification scikit-learn

Last synced: 08 Jan 2025

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 14 Jan 2025

https://github.com/srosalino/prediction_of_seoul_bikes_demand

The objective of this project is to predict the number of bicycles needed to be made available each hour in order to make the service as efficient as possible

cross-validation data-exploration-and-preprocessing hyperparameter-tuning machine-learning regularization-methods scikit-learn

Last synced: 08 Jan 2025

https://github.com/aysenurcftc/breast_cancer_streamlit

Breast Cancer Wisconsin Dataset Classifier with Scikit-learn and Streamlit

breast-cancer classification gridsearch scikit-learn streamlit

Last synced: 14 Jan 2025

https://github.com/dummumounika/ecommerce-sales-categorization

This repository contains Python code for text classification and analysis of e-commerce sales data. The script processes textual descriptions of products and categorizes them into predefined categories using a Naive Bayes classifier. It also includes various analysis and visualization methods to explore the dataset.

machine-learning matplotlib-pyplot ntlk numpy pandas python scikit-learn

Last synced: 02 Jan 2025

https://github.com/mastermindromii/car-price-prediction-model

Here is My Regression Project based on Predicting Price of Car using Linear Regression.

linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 08 Jan 2025

https://github.com/iamwatchdogs/cardiovascular-risk-prediction

This mini-project uses machine learning algorithms to predict possible risks of heart disease by analyzing given data.

jupyter-notebook machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Jan 2025

https://github.com/thekartikeyamishra/predictive-sales-analytics

The Predictive Sales Analytics tool aims to help MSMEs forecast future sales using historical data. This advanced version leverages Machine Learning for accurate predictions and provides a dashboard to visualize sales trends, seasonality, and predictions.

joblib machine-learning matplotlib pandas python scikit-learn streamlit

Last synced: 26 Dec 2024

https://github.com/taeefnajib/predicting-heart-disease-using-logistic-regression

In this project, we trained a model that predicts whether a person will have heart disease or not. We also created a web app using Flask and deployed our application on Render.

classification deployment disease flask heart logistic-regression render scikit-learn sklearn

Last synced: 21 Jan 2025

https://github.com/drtey/tryingml

Hands on Machine Learning´s fundamentals

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project

A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.

cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow

Last synced: 02 Jan 2025

https://github.com/armahdavi/code-data-analytics-pipelines-statistics-ml-pm-psd_residential-qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 10 Jan 2025

https://github.com/eco786786/spotify-playlist-generator

This project uses machine learning to cluster songs by features like tempo, genre and mood with K-Means. It then creates personalised Spotify playlists based on these clusters, providing dynamic, genre specific track collections. Integrating the Spotify API, it enables users to explore new music within custom groupings.

flask matplotlib pandas python3 scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/arizdn234/spotify-api-with-colab

Crawling, Analyzing, Clustering music data from Spotify API

machile-learning scikit-learn spotify-api spotipy-library

Last synced: 30 Dec 2024

https://github.com/aravind-selvam/student_exam_performance_predictor

Sample Machine learning project, The project uses Sklearn’s regression techniques such as XGboost and Random forests to train and test the model on student data. Deployed on Heroku with Flask application

flask-application heroku machine-learning python scikit-learn

Last synced: 14 Jan 2025

https://github.com/dyarleniber/hands-on-machine-learning

This repository contains code examples, exercises, and projects related to the concepts covered in the book "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron.

ai artificial-intelligence keras machine-learning matpolotlib numpy pandas scikit-learn tensorflow

Last synced: 13 Jan 2025