Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 21 Dec 2024

https://github.com/thiagohrcosta/movieapp-ml

The Movie APP is a project created to apply some of the concepts learned throughout the post-graduation degree at XP Educação in Artificial Intelligence with an emphasis on Machine Learning. While this project is not integrated into the curriculum of the course, some of the concepts used were learned during the program.

docker flask-api machine-learning mysql-database postgresql python scikit-learn

Last synced: 03 Feb 2025

https://github.com/davgiles/ut-austin-data-science-program

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 21 Dec 2024

https://github.com/5hraddha/optimize-oil-well-locations

In the quest for harnessing valuable energy resources, the OilyGiant mining company wants to expand its operations by discovering new oil well locations. To achieve this, a data-driven approach is adopted, leveraging geological exploration data from three distinct regions and employing techniques in data analysis and modeling.

linear-regression numpy pandas scikit-learn supervised-learning

Last synced: 07 Jan 2025

https://github.com/5hraddha/megaline-plan-recommendations

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate.Megaline has found out that many of their subscribers use legacy plans. They want to develop a model that would analyze subscribers' behavior and recommend one of Megaline's newer plans: Smart or Ultra.

decision-tree-classifier logistic-regression random-forest-classifier scikit-learn supervised-learning

Last synced: 07 Jan 2025

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 10 Oct 2024

https://github.com/jingjing515/purchase-behavior-analysis

Purchase Behavior Analysis for Targeted Customer Segmentation

clustering-algorithm data-mining machine-learning python scikit-learn

Last synced: 03 Nov 2024

https://github.com/akash-nath29/mnist_test

Handwritten digit recognition API using Scikit-Learn and FastAPI

fastapi machine-learning scikit-learn

Last synced: 10 Oct 2024

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 10 Oct 2024

https://github.com/joekakone/prostate-cancer-prediction

Predict Prostate Cancer Diagnosis using ML

machine-learning scikit-learn

Last synced: 14 Jan 2025

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 10 Oct 2024

https://github.com/bniladridas/churnprediction

Customers who'll likely stop using subscriptions.

dataset keras numpy pandas scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 10 Oct 2024

https://github.com/bniladridas/sentiment-analysis

A sentiment analysis project using Naive Bayes in Python.

jupyter-notebook pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/sergeimakarovv/energy-data-analytics-ml

Analyzing global data on sustainable energy, predicting CO2 emissions per capita

machine-learning pandas plotly python scikit-learn streamlit

Last synced: 10 Oct 2024

https://github.com/ronylpatil/customer-churn-prediction

This is Machine Learning based project where different Ensemble Techniques are used to train our model. The whole project is created using Scikit-learn Pipeline.

machine-learning scikit-learn scikit-learn-pipelines

Last synced: 21 Jan 2025

https://github.com/shaharband/calcofi-oceanographic-analysis

This repository contains an analysis of the CalCOFI (California Cooperative Oceanic Fisheries Investigations) dataset, which represents one of the longest and most complete time series of oceanographic and larval fish data in the world.

pandas regression scikit-learn

Last synced: 27 Dec 2024

https://github.com/mahdi-meyghani/movie-recommendation-system

A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.

data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning

Last synced: 10 Oct 2024

https://github.com/brenofariasdasilva/dagster-education-model

Dagster Education Model using Dagster 1.3.11 and Python 3.7.17.

dagster makefile matplotlib pandas pyenv python3 scikit-learn seaborn shellscript

Last synced: 01 Feb 2025

https://github.com/dionixius7/titanic-disaster-ml-model

This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances

data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction

Last synced: 18 Jan 2025

https://github.com/rickcontreras/modelos1

Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.

classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance

Last synced: 10 Oct 2024

https://github.com/sarowarahmed/advertising-sales-app

📈 Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 21 Dec 2024

https://github.com/pranav-tank/heart-disease-prediction-model

I have created this project as my Python term assignment. In this project I have trained a ML model to predict the heart disease using Scikit-learn library in python.

google-colaboratory jupyter-notebook machine-learning prediction-model python scikit-learn

Last synced: 21 Dec 2024

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 10 Oct 2024

https://github.com/akhil888binoy/intelligent-supplychain-management-system

Blockchain-powered supply chain management system with ML-driven sales prediction. Streamlines supplier-employee transactions and inventory management. Built with MERN stack, Solidity, and Flask.

blockchain decentralized-payments ethereum express flask foundry hackathon-project inventory-management machine-learning mern-stack mongodb nodejs python react sales-prediction scikit-learn smart-contracts solidity supply-chain-management wagmi

Last synced: 10 Oct 2024

https://github.com/ismaelvr1999/air-quality-clustering

This project focuses on analyzing air quality data and categorizing it into clusters using the K-Means algorithm.

jupyter-notebook machine-learning matplotlib pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/pejpero/machine_learning

This repository contains two comprehensive machine learning projects using scikit-learn, demonstrating ensemble learning with a Voting Classifier and the comparison of linear and polynomial regression models on different datasets.

ensemble-learning linear-regression logistic-regression machine-learning polynomial-regression random-forest scikit-learn svm

Last synced: 21 Dec 2024

https://github.com/kavyachouhan/fake-news-detection-dravidian-language

This repository contains the code and resources for a machine learning project focused on detecting fake news in the Malayalam language, developed as part of the IITM-PAN BS AI-ML Challenge.

jupyter-notebook machine-learning numy pandas python scikit-learn

Last synced: 21 Dec 2024

https://github.com/bilalm04/email-spam-classifier

A machine learning project that classifies emails as spam or not spam using Logistic Regression, with a deployable Flask API for real-time classification.

api flask jupyter-notebook machine-learning matplotlib nlp numpy pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/mohammad95labbaf/churn-prediction

This project aims to predict customer churn using machine learning algorithms. The project includes data preprocessing, feature engineering, and model evaluation.

adaboost bagging churn churn-analysis churn-prediction decisiontree ensemble-learning knn randomforest scikit-learn sklearn svm voting

Last synced: 10 Oct 2024

https://github.com/chengetanaim/beatrecommendersystembackend

A system for music producers and rappers/singers. I was trying to implement the product recommendation feature for music uploaded by producers. I used the collaborative filtering algorithm to be able to recommend songs to users.

fastapi scikit-learn sqlalchemy unsupervised-learning

Last synced: 10 Oct 2024

https://github.com/harris-giki/cancerdetectionmodel_ml

Simple Logistic Regression and Neural Network powered Machine Learning models that predicts whether a breast tumor is malignant or benign based on input features extracted from a breast cancer dataset.

cancer-detection development keras keras-tensorflow logistic-regression machine-learning neural-network scikit-learn streamlit tensorflow

Last synced: 03 Feb 2025

https://github.com/cmdecker95/learning-ml

My first foray into machine learning, comprised of my early undergrad work (circa 2021-2022) 🤓

jupyter-notebook keras machine-learning scikit-learn tensorflow2

Last synced: 21 Dec 2024

https://github.com/lucasfranklinsilva/rnn-lstm

Modelo de Prevenção de Falhas em Turbinas Simuladas Utilizando Redes Neurais Recorrentes

data-visualization deep-learning jupyter-notebook keras machine-learning neural-networks python scikit-learn

Last synced: 14 Jan 2025

https://github.com/jawwad-fida/data-science-salary-estimator

A tool that estimates data science salaries (MAE ~ $ 11K) to help data scientists negotiate their income when they get a job.

data-science machine-learning project scikit-learn

Last synced: 07 Jan 2025

https://github.com/xprithvi/random-forest-regressor

This Jupyter notebook serves as a machine learning template to quickly make predictions and analyse feature importance in a dataset.

data-science feature-extraction machine-learning random-forest random-forest-regression scikit-learn

Last synced: 20 Jan 2025

https://github.com/messierandromeda/sentiment-analysis

Sentiment analysis with the IMDB movie review dataset.

imdb-dataset python scikit-learn sentiment-analysis

Last synced: 10 Oct 2024

https://github.com/kristishqau/sentimentanalysis_nlp

A project for sentiment analysis of tweets using various NLP techniques and machine learning models.

datascience jupyter-notebook machine-learning nlp nltk python scikit-learn sentiment-analysis xgboost

Last synced: 20 Dec 2024

https://github.com/presizhai/iris-predictor-fastapi

A web application for predicting the species of Iris flowers using a machine learning model trained with the Iris dataset, with FastAPI, a modern web framework for building APIs.

essemblelearning fastapi python random-forest-classifier scikit-learn uvicorn

Last synced: 01 Nov 2024

https://github.com/johanneswiesner/skplot

A python package for extracting, plotting and reporting information from one or multiple sklearn classification & prediction pipelines.

plotting python scikit-learn sklearn visualization

Last synced: 11 Jan 2025

https://github.com/abdel-17/facial-recognition

Facial recognition using Machine Learning in Python

machine-learning pca python scikit-learn

Last synced: 25 Dec 2024

https://github.com/dragonscypher/feastfinderai

Discover the best dining spots with FeastFinderAI!

folium pandas python scikit-learn sql

Last synced: 07 Jan 2025

https://github.com/jeffandyalltogether/mlrecommendationsystem

project code for a recommendation system for Amazon using collaborative filtering, ranking, and matrix factorization to enhance customer satisfaction and product discovery.

eda matplotlib pandas python scikit-learn seaborn tensorflow

Last synced: 02 Feb 2025

https://github.com/javi-cc/python-ml-portcanto

Portcanto és un projecte de simulació d'un trajecte en bicicleta. S'ha definit 4 tipus de ciclistes que es diferencien en el temps que tarda a fer el trajecte. L'objectiu és descobrir els 4 patrons amb l'algoritme de clustering KMeans.

clustering docker docker-compose kmeans machine-learning mlfow pydoc pylint python scikit-learn testing venv

Last synced: 02 Feb 2025

https://github.com/zenitsu272/fault-detection-ml

Machine Learning based Fault Detection in machines using sensor data

artificial-intelligence decsion-tree machine-learning pandas pandas-dataframe pandas-python scikit-learn

Last synced: 02 Feb 2025

https://github.com/djdhairya/football-match-prediction

In this project, we'll predict the winner of football matches in the English Premier League (EPL).

jupyter-notebook machine-learning pandas python3 requests scikit-learn vscode

Last synced: 07 Jan 2025

https://github.com/mohammedhaq/safestream

SafeStream is a machine learning project that utilizes machine learning to predict the potability of water. By analyzing various water quality parameters, SafeStream helps in determining whether a water source is safe for consumption. This project leverages Python, PyTorch, and scikit-learn.

logistic-regression machine-learning neural-network python pytorch scikit-learn

Last synced: 02 Feb 2025

https://github.com/shamikaredkar/nextbinge-old

NextBinge is a personalized movie recommender system designed to suggest movies similar to the ones you've watched and liked. It leverages content-based filtering, recommending movies by analyzing genres, keywords, cast, and crew information.

machine-learning nlp-machine-learning python scikit-learn scikitlearn-machine-learning streamlit streamlit-webapp

Last synced: 20 Jan 2025

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 19 Dec 2024

https://github.com/adamouization/superconductors-critical-temperature-predictions-regression

:chart_with_upwards_trend: Machine Learning & Data Visualisation/Processing techniques for predicting the critical temperatures required for different superconductors to conduct electrical current with no resistance.

jupyter jupyter-lab jupyter-notebook machine-learning matplotlib matplotlib-pyplot numpy pandas prediction python python-3 regression regression-algorithms scikit-learn scikitlearn-machine-learning scipy

Last synced: 05 Jan 2025

https://github.com/andystmc/nextflownyc

Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting

Last synced: 07 Jan 2025

https://github.com/billy0402/python-machine-learning

A learning project from NTUB machine learning course.

ai course jupyter-notebook python scikit-learn tensorflow

Last synced: 14 Jan 2025

https://github.com/hmasdev/sklearn-protocols

sklearn-protocols: protocols for sklearn regressors, classifiers and transformers

protocol python python3 scikit-learn

Last synced: 29 Dec 2024

https://github.com/khaymanii/big_mart_prediction_model

This model was built using Python and Logistic Regression Algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/khaymanii/diabetes_prediction_model

This is a Machine learning model built using Python

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/ezeparziale/tweet-clasification

:bird: Tweet sentiment analysis

bootstrap flask nltk python scikit-learn

Last synced: 20 Jan 2025

https://github.com/githubuserpek/cnn-based_handwritten_digit_recognition

This repository contains a project focused on handwritten digit classification using a Convolutional Neural Network (CNN). The goal was to classify digits (0-9) from the widely-used MNIST dataset.

alphabet-recognition artificial-neural-networks cnn-keras datasets docker fastapi flask html machine-learning neural-networks pillow pooling-max-average pytorch scikit-learn

Last synced: 26 Jan 2025

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/khaymanii/house-price-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/khaymanii/rock-or-mine-detection_model

This is a Machine learning model built using Python to detect between rock and mine

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/khaymanii/parkinsons-disease-detection-model

This model was built with Python and Support Vector Machine Algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/khaymanii/titanic_survival_prediction_-model

This Model was built using Python and Logistic Regression algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/samkazan/structural_discovery_of_macromolecules_data_analysis

This research project uses machine learning techniques and neural network to uncover key factors that contribute to successful protein structure discovery using Python and R

classification clustering ipython-notebook jupyter-notebook keras-neural-networks keras-tensorflow machine-learning neural-network numpy python r rmarkdown scikit-learn scipy tensorflow

Last synced: 12 Jan 2025

https://github.com/pierrekieffer/datapreprocessing

Custom data preprocessing library made for machine learning

data-preparation data-preprocessing machine-learning preprocessing scikit-learn

Last synced: 13 Dec 2024

https://github.com/bartventer/ann-loan-default-profiler

Artificial Neural Network built with Tesorflow to predict whether new customer application will repay or default on a new loan.

artificial-neural-network data-visualisation deep-learning exploratory-data-analysis numpy pandas pipelines scikit-learn tensorflow2

Last synced: 24 Dec 2024

https://github.com/bartventer/survival-predictor

A simple machine learning application developed with Django, AWS, Gunicorn and Nginx

aws django ec2 gunicorn linux nginx scikit-learn

Last synced: 24 Dec 2024

https://github.com/kumartusha/machine_learning_fundamentals

🤖 Machine Learning Journey 🚀 A repository showcasing my learning journey in Machine Learning with hands-on projects, algorithms, and practice notebooks.

matplotlib numpy pandas python3 scikit-learn seaborn sql sqlite

Last synced: 02 Feb 2025

https://github.com/nisch-mhrzn/house_prediction

This project predicts house prices using data exploration, feature engineering, and machine learning models like Linear Regression and Random Forest. It demonstrates how to optimize models and evaluate their performance to accurately forecast house prices.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Feb 2025

https://github.com/baponkar/scikit-logisticregression-application

A simple and detail application analysis of sci kit learn LogisticRegression model .

classification-algorithm logistic-regression machine-learning python3 scikit-learn

Last synced: 27 Dec 2024

https://github.com/chris-santiago/tsfeast

A collection of Scikit-Learn compatible time series transformers and tools.

data-science feature-engineering python scikit-learn time-series timeseries-features transformers

Last synced: 14 Dec 2024

https://github.com/vibhore-sagar/stock-movement-analysis-based-on-social-media-sentiment

This project aims to develop a machine learning model that predicts stock movements by analyzing sentiment in social media discussions (Reddit posts).

joblib numpy pandas praw python scikit-learn textblob-sentiment-analysis yfinance

Last synced: 02 Feb 2025

https://github.com/alphacrypto246/grape-quality-prediction

The Grape Quality Prediction project uses machine learning to predict the quality of grapes based on chemical properties like acidity, sugar content, and alcohol levels. It applies regression models to forecast the quality score, helping in wine production and quality assessment.

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 02 Feb 2025

https://github.com/pb319/california_house-price-prediction

This is going to be my first end to end ML project implementation covering all required stages taking guidence from book called "Hands On Machine Learning".

evaluation-metrics hyperparameter-tuning jupyter-notebook kfold-cross-validation machine-learning matplotlib numpy pandas python scikit-learn seaborn train-test-split

Last synced: 21 Jan 2025

https://github.com/luizassimoes/sklearn-kaggle-titanic

This repository was created to store all the code for tackling the Titanic challenge on Kaggle.

kaggle machine-learning scikit-learn

Last synced: 05 Jan 2025

https://github.com/subratamondal1/machine-learning

Machine Learning Notes with tools like Numpy, Pandas, Scikit-Learn.

machine-learning numpy pandas scikit-learn

Last synced: 11 Jan 2025

https://github.com/freakwill/dred

🔴 dred = dimension reducing for machine learning (suit to sklearn)

dimension-reduction scikit-learn sklearn sklearn-estimator

Last synced: 26 Jan 2025