Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/mtlh/fyp_prempredict

In PremPredict, players will predict all Premier League games. Compete against the algorithm and other users across a full season. Scoring points for every correct result/prediction.

django prediction premierleague python scikit-learn tailwindcss

Last synced: 03 Jan 2025

https://github.com/analitico-771/machine_learning_trading_bot

This is an Application that implements an algorithmic trading strategy that uses machine learning to automate the trade decisions

financial-analysis hvplot logistic-regression machine-learning moving-average pandas-dataframe predictive-modeling python scikit-learn stock-price-prediction support-vector-machine

Last synced: 03 Jan 2025

https://github.com/teja-1403/coursera-machine-learning-with-python-honors

This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.

classification hierarchical-clustering machine-learning regression scikit-learn scipy

Last synced: 03 Jan 2025

https://github.com/kengz/feature_transform

Build ColumnTransformers (Scikit or DaskML) for feature transformation by specifying configs.

column-transformer dask-ml dataset feature-engineering feature-transformation machine-learning scikit-learn

Last synced: 03 Jan 2025

https://github.com/armahdavi/code-data-processing-statistics-plotting-airborne-sampling-of-pm

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 10 Jan 2025

https://github.com/andrewsy1004/mask-detection

Mask detection system capable of identifying individuals with or without masks

kaggle keras python scikit-learn tensorflow

Last synced: 10 Jan 2025

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 10 Jan 2025

https://github.com/eusha425/housing-market-analysis

Implementation of supervised learning algorithms for real estate price prediction, featuring Ridge Regression optimization, IQR-based outlier detection, and extensive feature engineering. Includes detailed visualizations, statistical analysis, and model performance comparisons using various evaluation metrics.

data-preprocessing data-science exploratory-data-analysis house-price-prediction machine-learning python scikit-learn supervised-learning

Last synced: 04 Jan 2025

https://github.com/leftcoastnerdgirl/supervised_learning

This project demonstrates supervised machine learning using scikit-learn.

classification-reports confusion-matrix jupyter-notebook numpy pandas-python pathlib scikit-learn sklearn

Last synced: 04 Jan 2025

https://github.com/leftcoastnerdgirl/deep_learning

This project introduces neural networks, deep learning, and Tensorflow.

deep-learning jupyter-notebook neural-networks pandas-python scikit-learn tensorflow

Last synced: 04 Jan 2025

https://github.com/ghoumbadji/water-potability-checker

A machine learning model that takes some data on water and tells if this water is potable or not

kaggle machine-learning pandas scikit-learn

Last synced: 04 Jan 2025

https://github.com/bjpcjp/scikit-learn-v0.23

My Jupyter Lab notebooks on Scikit-Learn v0.23. Work in progress.

matplotlib-pyplot numpy python3 scikit-learn scipy

Last synced: 04 Jan 2025

https://github.com/bjpcjp/scikit-learn

Updates in progress. Jupyter workbooks will be added as time allows.

python python3 scikit-learn

Last synced: 04 Jan 2025

https://github.com/elazzouzihassan/si-fraud-detection-prototype

Système de Détection des Fraudes avec Python (Prototype).

googlecolab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 04 Jan 2025

https://github.com/hrolive/recommendation-systems-ibm

Analyze the interactions that users have with articles on the IBM Watson Studio platform and make recommendations to them about new articles, using various recommendation engines.

machine-learning natural-language-processing pandas python recomendation-system scikit-learn

Last synced: 04 Jan 2025

https://github.com/hrolive/deep-learning-nanodegree

As one of the top 3% students from the first phase, "PyTorch Scholarship Challenge" by Facebook AI, I have earned a full scholarship to Udacity’s Deep Learning Nanodegree program

api-gateway aws aws-lambda aws-sagemaker computer-vision convolutional-neural-networks deep-learning deployment machine-learning natural-language-processing numpy pandas python pytorch scikit-learn

Last synced: 04 Jan 2025

https://github.com/hrolive/disaster-response-pipeline

A machine learning pipeline that categorizes disaster related messages so that they can be sent to the appropriate disaster relief agency

flask machine-learning natural-language-processing nltk pandas plotly python scikit-learn sql sqlalchemy

Last synced: 04 Jan 2025

https://github.com/nickklos10/compressive-strenght-prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 04 Jan 2025

https://github.com/donmaruko/sentiment-analysis-api

Flask-based API for sentiment analysis using deep learning models and includes endpoints for text and file input, database storage, and integrated Swagger documentation.

api deep-learning deep-neural-networks flask keras lstm machine-learning neural-network rnn scikit-learn scikitlearn-machine-learning sklearn sqlite3 swagger swagger-ui tensorflow

Last synced: 04 Jan 2025

https://github.com/dan-niles/iris-ml

Machine learning on the Iris dataset

iris-dataset machine-learning scikit-learn

Last synced: 11 Jan 2025

https://github.com/johnnixon6972/cirrhosis-outcomes-prediction

This leverages advanced machine learning techniques to predict patient outcomes for those suffering from cirrhosis. Utilizing a comprehensive dataset from a Mayo Clinic study, this project explores various data imputation methods and class balancing techniques to enhance prediction accuracy.

ai algorithms analytics artificial-intelligence machine-learning ml pandas python3 scikit-learn

Last synced: 18 Jan 2025

https://github.com/moeeinaali/nlp-lsa

Applying Latent Semantic Analysis (LSA) to text data using scikit-learn.

lsa nlp scikit-learn

Last synced: 18 Jan 2025

https://github.com/leabrodyheine/ml-kaggle-cirrhosis-data

This project showcases skills in machine learning, data preprocessing, and model evaluation using Python libraries such as scikit-learn, XGBoost, and Optuna. It involves implementing various machine learning models, handling imbalanced data, and employing imputation techniques to enhance model performance for predicting cirrhosis outcomes.

data-analysis data-pre imbalanced-data imputation machine-learning optuna pipeline scikit-learn xgboost

Last synced: 11 Jan 2025

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 11 Jan 2025

https://github.com/sadmansakib93/mental-resilience-analysis-using-machine-learning

Utilized supervised and unsupervised ML techniques to analyze mental health and resilience levels of medical students [Project completed on December, 2019]

artificial-intelligence classification clustering correlation linear-regression machine-learning machine-learning-algorithms mental-health python regression resilience scikit-learn statistical-analysis

Last synced: 12 Jan 2025

https://github.com/kianoushamirpour/end_to_end_text_classification

Developing feature engineering pipelines, building packages, automating tests, and creating FastAPI endpoints.

apache-airflow ci docker-compose factory-design-pattern fastapi feast grafana hyperopt mlflow prometheus pytorch scikit-learn tox transformers xgboost-classifier

Last synced: 18 Jan 2025

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 12 Jan 2025

https://github.com/sxv357/xtern-artificial-intelligence-work-based-assessment

This application takes in data regarding undergraduate college students in the state of Indiana such as their year, what major they're pursuing, which university they attend, and makes a prediction about their food order.

jupyter-notebook matplotlib pandas pickle scikit-learn seaborn

Last synced: 12 Jan 2025

https://github.com/medicharlakarthik/credit-card-fraud-detection

Credit Card Fraud Detection using machine learning to distinguish fraudulent transactions from legitimate ones. This project includes data analysis, model training, and evaluation to achieve high accuracy and recall, minimizing false negatives for better fraud detection

machine-learning python random-forest-classifier scikit-learn

Last synced: 12 Jan 2025

https://github.com/zenklinov/regression_logistic_-_sentiment_analysis_movie_data

This repository contains code for performing sentiment analysis using scikit-learn and logistic regression

llm natural-language-processing nlp nltk scikit-learn sentiment-analysis

Last synced: 12 Jan 2025

https://github.com/sanchariii/order_amt_prediction

Order Amount Prediction is a machine learning project that predicts customer order amounts based on past behavior. It includes milestones for data cleaning, exploratory data analysis, feature engineering, and model building. The framework can be customized to suit specific needs and provides insights for better decision-making.

jupyter-notebook machine-learning python scikit-learn

Last synced: 12 Jan 2025

https://github.com/sanchariii/multiple-disease-prediction-system-using-streamlit

This prediction system is a web based application using Streamlit framework which can predict multiple diseases like Heart Disease , Parkinson's Disease and Diabetes.

pickle scikit-learn spyder-python-ide streamlit-webapp

Last synced: 12 Jan 2025

https://github.com/r-gg/ml-37

Amazon Reviews ~ Sentiment analysis evaluation: fine-tuned BERT vs LSTM. (+ Extensive Data Mining & Visualization)

bert deep-learning ipynb-jupyter-notebook lstm machine-learning python scikit-learn uni-project

Last synced: 12 Jan 2025

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 12 Jan 2025

https://github.com/mohd-faizy/preprocess_ml

This repository hosts Python code that utilizes the Scikit-learn preprocessing API for data preprocessing. The code presents a comprehensive range of tools that handle missing data, scale data, encode categorical variables, and perform other functions.

data-science feature-engineering feature-engineering-algorithm feature-extraction feature-selection machine-learning outlier-detection preprocessing-data preprocessor scikit-learn

Last synced: 12 Jan 2025

https://github.com/lc-rezende/eqx_boston_dataset

Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.

data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn

Last synced: 18 Jan 2025

https://github.com/aldotestino/word-freq-email-classification

Simple email classifier using word frequency and Logistic Regression

docker email-classification fastapi logistic-regression python react scikit-learn

Last synced: 18 Jan 2025

https://github.com/ledsouza/nlp-article-classification

This project aims to develop a machine learning model capable of classifying news articles into different categories based on their titles. Two different word embedding models (CBOW and Skip-gram) are trained and used to vectorize the article titles. These vectorized representations are then used to train a Logistic Regression classifier.

gensim-word2vec natural-language-processing nlp nlp-machine-learning pandas python scikit-learn spacy spacy-nlp

Last synced: 30 Jan 2025

https://github.com/rajan-bhateja/machine-learning-with-python

Machine learning algorithms implemented using Scikit-learn

classification clustering machine-learning regression scikit-learn sklearn

Last synced: 18 Jan 2025

https://github.com/atkaridarshan04/machine-learning-algorithms

Machine Learning Algorithms with Python and SciKit-Learn

machine-learning machine-learning-algorithms python scikit-learn

Last synced: 30 Jan 2025

https://github.com/rishi-sutar/healwise-ai-your-way-to-wellness

Healwise-AI is a health diagnostic tool that uses a Support Vector Classifier (SVC) model to predict diseases based on user-reported symptoms. After predicting, it offers detailed health advice, including descriptions, diets, medications, and workouts related to the diagnosis.

machine-learning scikit-learn support-vector-machine

Last synced: 18 Jan 2025

https://github.com/jianninapinto/bandersnatch

This project implements a machine learning model using Random Forest, XGBoost, and Support Vector Machines algorithms with oversampling and undersampling techniques to handle imbalanced classes for classification tasks in the context of predicting the rarity of monsters.

altair imbalanced-classification imblearn machine-learning mongodb oversampling pycharm-ide pymongo python random-forest-classifier scikit-learn smote support-vector-machines undersampling xgboost

Last synced: 19 Jan 2025

https://github.com/korpog/br_cancer

Binary classifier for Breast Cancer Wisconsin Data Set created with scikit-learn and xgboost.

classification data-science machine-learning pandas python scikit-learn xgboost

Last synced: 19 Jan 2025

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 19 Jan 2025

https://github.com/lexxai/goit_python_ds_hw_06

Модуль 6. Навчання без вчителя.  Кластерізація. KMeans. Principal Component Analysis

dbscan-clustering hdbscan-clustering kmeans kmeans-clustering opentsne optics-clustering pca python scikit-learn tsne

Last synced: 24 Jan 2025

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 24 Jan 2025

https://github.com/dllllb/ds-pipeline

Data Science model pipeline based on SciKit-Learn Estimator API

data-science machine-learning python scikit-learn

Last synced: 19 Jan 2025

https://github.com/prakharchoudhary/mlchallenge-2

My submission for machine learning challenge #2, organised by hackerEarth.

adaboost gradient-boosting-classifier jupyter-notebook machine-learning python scikit-learn

Last synced: 19 Jan 2025

https://github.com/davipythonweb/price_api

API de Previsão de Preço de casa com python/Machine-Learn

flask machine-learning pickle python python-dotenv scikit-learn venv

Last synced: 19 Jan 2025

https://github.com/z-fran/walmart-store-sales-forecasting

Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.

machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting

Last synced: 19 Jan 2025

https://github.com/lren-chuv/sklearn_to_pfa

Convert Scikit Learn models to PFA

pfa-standard scikit-learn

Last synced: 19 Jan 2025

https://github.com/amirdora/python_ml_supervisedlearning_example

Building Classification Models with scikit-learn

machine-learning python3 scikit-learn

Last synced: 19 Jan 2025

https://github.com/ax-va/python-machine-learning-recipes-gallatin-albon-2023

Machine learning recipes in Python with scikit-learn, OpenCV, PyTorch, and other libraries, including classical machine learning and neural networks, based on the book "Machine Learning with Python Cookbook: Practical Solutions from Preprocessing to Deep Learning", Second Edition, by Kyle Gallatin and Chris Albon published by O'Reilly Media in 2023

ax-va data-science deep-learning image-processing machine-learning neural-networks opencv opencv-python python pytorch scikit-learn

Last synced: 19 Jan 2025

https://github.com/enricobolzonello/ml_homeworks

Homeworks for the Machine Learning Course 2022/23 @ Unipd

linear-regression machine-learning neural-network scikit-learn svm

Last synced: 24 Jan 2025

https://github.com/balajig-24/titanic_data_analysics-

Project Title: Titanic Survival Prediction Project Overview The Titanic Survival Prediction project is a classic machine learning problem that aims to predict whether a passenger survived the Titanic disaster based on various features such as age, gender, passenger class, and more. This project demonstrates my ability to clean, analyze, and model.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 31 Jan 2025

https://github.com/edgar-mendonca/resvox-resume-ats-analyser

ResuVox is a Flask-based web application designed to help job seekers optimize their resumes for Applicant Tracking Systems (ATS).

bootstrap5 css3 html5 nltk python scikit-learn

Last synced: 19 Jan 2025

https://github.com/timothyjan/intro-machine-learning-classifiers

We will use the scikit-learn library, which is a higher-level machine learning library that will work with NumPy data, and Pandas, a library that makes it easier to manipulate data. We will explore a variety of classification algorithms, and compare their performance on a “real-world” dataset, which will introduce its own set of challenges.

numpy pandas python scikit-learn

Last synced: 31 Jan 2025

https://github.com/katiebristol/epsilon_fe2o3_controls

Exploratory Data Analysis using machine learning techniques as an exercise for GLY6932 (Data Science and Machine Learning Methods in the Geosciences) at the University of Florida.

biplot exploratory-data-analysis k-means-clustering machine-learning one-hot-encoding paleomagnetism principal-component-analysis random-forest rock-magnetism scikit-learn

Last synced: 24 Jan 2025

https://github.com/martolen1/data-science

Comprehensive repository of Data Science projects spanning Machine Learning, Deep Learning, and Natural Language Processing. Demonstrates practical applications of algorithms and tools on real-world datasets.

cnn-model data-analysis data-science data-visualization deep-learning gans-models keras machine-learning-algorithms natural-language-processing python3 rnn-lstm scikit-learn tensorflow transfer-learning transformers

Last synced: 20 Jan 2025

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 24 Jan 2025

https://github.com/eljandoubi/disasterresponsepipeline

Project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis.

flask nltk numpy pandas plotly scikit-learn scipy sqlalchemy

Last synced: 24 Jan 2025

https://github.com/szymon-budziak/ai_football_game_analysis

Football game analysis using YOLOv8 for object detection, Optical Flow for motion tracking, speed and distance calculations, perspective transformation, and K-Means clustering for pixel segmentation.

ai computer-vision kmeans object-detection optical-flow python3 pytorch roboflow scikit-learn segmentation supervision ultralytics yolov8

Last synced: 25 Jan 2025

https://github.com/apfirebolt/titanic_survival_prediction

Titanic survival prediction GUI application using scikit-learn and PyQT5

jupyter-notebook pandas prediction pyqt5 python scikit-learn titanic-kaggle

Last synced: 25 Jan 2025

https://github.com/loudji971/chatbot-intent-classifier

Chatbot based on natural language processing (NLP) and deep learning for accurate intent classification in conversations. - Artificial Inteligence Tecniques

ai atis-dataset bert chat-bot-deep-learning deep-neural-networks fastapi intent-classification keras nlp nltk nltk-keras-python nlu-engine scikit-learn tridib-samanta

Last synced: 25 Jan 2025

https://github.com/gamowy/urbansounds-classification

Classification of urban sounds using Tensorflow Keras

keras machine-learning python scikit-learn tensorflow

Last synced: 25 Jan 2025

https://github.com/hvalfangst/azure-functions-pandas

Azure Functions for ETL operations using Pandas. Uploaded CSV files trigger data processing, calculating correlations and storing results in a JSON file. Automated deployment via GitHub Actions and Terraform.

az-204 azure azure-functions azure-functions-python pandas python scikit-learn terraform

Last synced: 25 Jan 2025

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 25 Jan 2025

https://github.com/ipascrlet/pakistan-infant-mortality-analysis

Explore the factors affecting infant mortality rates in Pakistan through this comprehensive analysis project. Dive into the data to uncover patterns and insights that could potentially inform healthcare policies and interventions.

api correlation-matrix data-analysis data-science data-visualisation machine-learning numpy pakistan ridge-regression scikit-learn seaborn team-project unicef wdi

Last synced: 25 Jan 2025

https://github.com/mhmudfzli/loan-approval-prediction

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

automl catboost numpy pandas python scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/hafidaso/predicting-industrial-machine-downtime-level-3

This project aims to develop a predictive model using machine learning techniques to forecast machine failures based on historical operational data.

imbalanced-learning numpy pandas python scikit-learn seaborn xgboost

Last synced: 25 Jan 2025

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/jbizzlefoshizzle/linear-and-ridge-regression

The purpose of this project was to analyze and predict housing prices using attributes or features such as square footage, number of bedrooms, number of floors, and so on.

linear-regression machine-learning machine-learning-algorithms regression-analysis regression-models ridge-regression scikit-learn scikitlearn-machine-learning train-test-split train-test-using-sklearn

Last synced: 25 Jan 2025

https://github.com/jbizzlefoshizzle/ibm_capstone_project

Used K-means clustering and mapping libraries to determine best cities in San Diego to open a Mexican restaurant

beautifulsoup4 folium-maps geopy pandas-python scikit-learn

Last synced: 25 Jan 2025

https://github.com/priyanshul28/ml_regression_eda_waiterstip

An EDA and Machine Learning Regression exercise on the Waiter's Tip dataset demonstrating the use of Linear Regression, Neural Network Regressors, Decision Trees, Random Forests, Linear SVR, XGBoost, etc. The models are optimized using hyperparameter tuning through GridSearchCV.

eda machine-learning regression scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/chris-seoul/github-handson-ml

ML Research using Hands-on, Hugging Face, CrewAI, Gemini, Langchain etc

deep-learning ml neural-networks pandas scikit-learn tensorflow

Last synced: 25 Jan 2025