Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/pejpero/machine_learning

This repository contains two comprehensive machine learning projects using scikit-learn, demonstrating ensemble learning with a Voting Classifier and the comparison of linear and polynomial regression models on different datasets.

ensemble-learning linear-regression logistic-regression machine-learning polynomial-regression random-forest scikit-learn svm

Last synced: 21 Dec 2024

https://github.com/kavyachouhan/fake-news-detection-dravidian-language

This repository contains the code and resources for a machine learning project focused on detecting fake news in the Malayalam language, developed as part of the IITM-PAN BS AI-ML Challenge.

jupyter-notebook machine-learning numy pandas python scikit-learn

Last synced: 21 Dec 2024

https://github.com/cmdecker95/learning-ml

My first foray into machine learning, comprised of my early undergrad work (circa 2021-2022) 🤓

jupyter-notebook keras machine-learning scikit-learn tensorflow2

Last synced: 21 Dec 2024

https://github.com/divinenaman/color-extraction-api

Extract colours from images using K-means, along with FastAPI pipeline.

fastapi k-means-clustering scikit-learn

Last synced: 17 Dec 2024

https://github.com/joewlos/fantasy_football_monte_carlo_draft_simulator

Monte Carlo Fantasy Football Draft Simulator Featuring FastAPI, NextUI, and ODMantic

fantasy-football monte-carlo nextjs nextui odmantic pydantic python scikit-learn

Last synced: 15 Jan 2025

https://github.com/jpc963/previsao-de-vendas

Previsão de vendas que uma determinada empresa faria com base nas mídias utilizadas

jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 13 Jan 2025

https://github.com/lucasfranklinsilva/rnn-lstm

Modelo de Prevenção de Falhas em Turbinas Simuladas Utilizando Redes Neurais Recorrentes

data-visualization deep-learning jupyter-notebook keras machine-learning neural-networks python scikit-learn

Last synced: 14 Jan 2025

https://github.com/jawwad-fida/data-science-salary-estimator

A tool that estimates data science salaries (MAE ~ $ 11K) to help data scientists negotiate their income when they get a job.

data-science machine-learning project scikit-learn

Last synced: 07 Jan 2025

https://github.com/h-sarhan/hate-speech-classifier

Automatic Detection of Hate Speech and Offensive Content

nlp python scikit-learn

Last synced: 17 Jan 2025

https://github.com/vancenceho/college_projects

🎓 House of school/college homework, problem sets & mini-projects which consists mainly of data structures & algorithms

data-structures matlab pandas python queue scikit-learn seaborn stack

Last synced: 07 Jan 2025

https://github.com/abdel-17/facial-recognition

Facial recognition using Machine Learning in Python

machine-learning pca python scikit-learn

Last synced: 25 Dec 2024

https://github.com/dragonscypher/feastfinderai

Discover the best dining spots with FeastFinderAI!

folium pandas python scikit-learn sql

Last synced: 07 Jan 2025

https://github.com/adhadse/hands-on-machine-learning-book-notes-and-practice

This repo holds the Jupyter notebooks and datasets containing notes/comments on things I learned from this book. Feel free to use and learned from them.

data-science deep-learning jupyter-notebooks keras machine-learning python scikit-learn tensorflow

Last synced: 17 Dec 2024

https://github.com/alisonmitchell/titanic

Exploration of a subset of the Titanic passenger manifest to create a predictive classification model to determine which passengers were more likely to survive.

deep-learning keras machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn tensorflow

Last synced: 15 Jan 2025

https://github.com/heyitsjoealongi/fantasy-football-qbwr-model

Fantasy Football: Quarterback / Wide Receiver - Gaussian Process Regression (GPR) Machine Learning Model

machine-learning matplotlib model numpy python scikit-learn

Last synced: 14 Dec 2024

https://github.com/djdhairya/football-match-prediction

In this project, we'll predict the winner of football matches in the English Premier League (EPL).

jupyter-notebook machine-learning pandas python3 requests scikit-learn vscode

Last synced: 07 Jan 2025

https://github.com/jihoonerd/restricted-discriminant-analysis

RDA implementation compatible with Scikit-learn API

discriminant-analysis rda scikit-learn

Last synced: 15 Jan 2025

https://github.com/andystmc/nextflownyc

Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting

Last synced: 07 Jan 2025

https://github.com/billy0402/python-machine-learning

A learning project from NTUB machine learning course.

ai course jupyter-notebook python scikit-learn tensorflow

Last synced: 14 Jan 2025

https://github.com/hmasdev/sklearn-protocols

sklearn-protocols: protocols for sklearn regressors, classifiers and transformers

protocol python python3 scikit-learn

Last synced: 29 Dec 2024

https://github.com/netcodez/climate-prediction-pipeline

Predicting London's climate using machine learning techniques. This project aims to forecast mean temperature in Celsius (°C) using various regression models and logging experiments with MLflow

huggingface machine-learning mlflow mlflow-tracking mlflow-tracking-server mlops python scikit-learn streamlit

Last synced: 15 Jan 2025

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 12 Jan 2025

https://github.com/mohd-faizy/preprocess_ml

This repository hosts Python code that utilizes the Scikit-learn preprocessing API for data preprocessing. The code presents a comprehensive range of tools that handle missing data, scale data, encode categorical variables, and perform other functions.

data-science feature-engineering feature-engineering-algorithm feature-extraction feature-selection machine-learning outlier-detection preprocessing-data preprocessor scikit-learn

Last synced: 12 Jan 2025

https://github.com/hassanislam463/nyc_airbnb_eda

This project is a comprehensive data analysis of Airbnb listings in New York City, exploring pricing trends, seasonality effects, host market dynamics, rental preferences, and revenue estimation. It provides valuable insights for hosts, investors, and policymakers to optimize Airbnb operations and understand the short-term rental landscape in NYC.

exploratory-data-analysis matplotlib python scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/1adore1/deadlock-match-tracker-bot

Telegram bot for tracking real-time Deadlock matches of top 250 players. Fetches match data and predicts winners using a machine learning model.

aiogram api deadlock pandas python scikit-learn

Last synced: 25 Jan 2025

https://github.com/yelamankarassay/personal-health-wellness-dashboard

A Streamlit-based dashboard for visualizing and analyzing personal daily data—weight, mood, meals, sleep, and more. This project uses pandas, plotly, matplotlib, seaborn, scikit-learn, and wordcloud to present insights about your health and daily habits.

matplotlib pandas plotly scikit-learn seaborn wordcloud

Last synced: 25 Jan 2025

https://github.com/lc-rezende/eqx_boston_dataset

Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.

data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn

Last synced: 18 Jan 2025

https://github.com/aldotestino/word-freq-email-classification

Simple email classifier using word frequency and Logistic Regression

docker email-classification fastapi logistic-regression python react scikit-learn

Last synced: 18 Jan 2025

https://github.com/priyanshul28/ml_classification_eda_parkinsonsdisease

A guided Machine Learning Classification exercise on the Parkinson's Disease dataset demonstrating the use of Logistic Regression, Neural Network Classifiers, Decision Trees, Random Forests and XGBoost algorithms, as well as Data Preprocessing and Exploratory Data Analysis.

classification machine-learning pandas python scikit-learn statistics

Last synced: 25 Jan 2025

https://github.com/darshhv/fraud-detection-system

A machine learning project for detecting fraudulent transactions using Random Forest and XGBoost models, with data preprocessing and model evaluation.

data-preprocessing-and-cleaning fraud-detection-using-machine-learning model-evaluation pandas random-forest scikit-learn xgboost

Last synced: 25 Jan 2025

https://github.com/abhivur/connections-ai---tamu-datathon-2024

Contributors: Meet Gamdha, Gaurav Nimmagadda

bert python scikit-learn

Last synced: 25 Jan 2025

https://github.com/aryar-06/linear-regression

A Python project demonstrating basic linear regression with gradient descent and matrix operations, alongside scikit-learn comparison.

data-analysis data-preprocessing educational-project gradient-descent linear-regression machine-learning python regression-algorithms scikit-learn

Last synced: 25 Jan 2025

https://github.com/avtorgenii/ml-playground

A repository for exploring and experimenting with datasets, building machine learning models, and testing various techniques in data preprocessing, feature engineering, and model evaluation.

matplotlib ml pandas scikit-learn

Last synced: 25 Jan 2025

https://github.com/ledsouza/nlp-article-classification

This project aims to develop a machine learning model capable of classifying news articles into different categories based on their titles. Two different word embedding models (CBOW and Skip-gram) are trained and used to vectorize the article titles. These vectorized representations are then used to train a Logistic Regression classifier.

gensim-word2vec natural-language-processing nlp nlp-machine-learning pandas python scikit-learn spacy spacy-nlp

Last synced: 30 Jan 2025

https://github.com/ki3mono/naive_bayes_classifier

This project implements Naive Bayes Classifiers for two data types: Multinomial Naive Bayes Classifier and Gaussian Naive Bayes Classifier

iris-dataset mushroom-dataset naive-bayes-classifier numpy python scikit-learn

Last synced: 25 Jan 2025

https://github.com/rajan-bhateja/machine-learning-with-python

Machine learning algorithms implemented using Scikit-learn

classification clustering machine-learning regression scikit-learn sklearn

Last synced: 18 Jan 2025

https://github.com/atkaridarshan04/machine-learning-algorithms

Machine Learning Algorithms with Python and SciKit-Learn

machine-learning machine-learning-algorithms python scikit-learn

Last synced: 30 Jan 2025

https://github.com/rishi-sutar/healwise-ai-your-way-to-wellness

Healwise-AI is a health diagnostic tool that uses a Support Vector Classifier (SVC) model to predict diseases based on user-reported symptoms. After predicting, it offers detailed health advice, including descriptions, diets, medications, and workouts related to the diagnosis.

machine-learning scikit-learn support-vector-machine

Last synced: 18 Jan 2025

https://github.com/jianninapinto/bandersnatch

This project implements a machine learning model using Random Forest, XGBoost, and Support Vector Machines algorithms with oversampling and undersampling techniques to handle imbalanced classes for classification tasks in the context of predicting the rarity of monsters.

altair imbalanced-classification imblearn machine-learning mongodb oversampling pycharm-ide pymongo python random-forest-classifier scikit-learn smote support-vector-machines undersampling xgboost

Last synced: 19 Jan 2025

https://github.com/anshvaid4/ml_practice

This is the new repository, where I have added all the notebooks demonstrating the usage of various transformers and models for Supervised and Unsupervised algorithms

anaconda jupyter-notebook machine-learning machine-learning-algorithms python scikit-learn

Last synced: 25 Jan 2025

https://github.com/korpog/br_cancer

Binary classifier for Breast Cancer Wisconsin Data Set created with scikit-learn and xgboost.

classification data-science machine-learning pandas python scikit-learn xgboost

Last synced: 19 Jan 2025

https://github.com/parthapray/nlp_pipeline_openai

This repo contains nlp pipeline and openai API integration

gradio matplotlib networkx nltk openai rake-nltk scikit-learn seaborn spacy textblob textstat wordcloud

Last synced: 26 Dec 2024

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 19 Jan 2025

https://github.com/lexxai/goit_python_ds_hw_06

Модуль 6. Навчання без вчителя.  Кластерізація. KMeans. Principal Component Analysis

dbscan-clustering hdbscan-clustering kmeans kmeans-clustering opentsne optics-clustering pca python scikit-learn tsne

Last synced: 24 Jan 2025

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 24 Jan 2025

https://github.com/dllllb/ds-pipeline

Data Science model pipeline based on SciKit-Learn Estimator API

data-science machine-learning python scikit-learn

Last synced: 19 Jan 2025

https://github.com/kengz/feature_transform

Build ColumnTransformers (Scikit or DaskML) for feature transformation by specifying configs.

column-transformer dask-ml dataset feature-engineering feature-transformation machine-learning scikit-learn

Last synced: 03 Jan 2025

https://github.com/prakharchoudhary/mlchallenge-2

My submission for machine learning challenge #2, organised by hackerEarth.

adaboost gradient-boosting-classifier jupyter-notebook machine-learning python scikit-learn

Last synced: 19 Jan 2025

https://github.com/teja-1403/coursera-machine-learning-with-python-honors

This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.

classification hierarchical-clustering machine-learning regression scikit-learn scipy

Last synced: 03 Jan 2025

https://github.com/donmaruko/sentiment-analysis-api

Flask-based API for sentiment analysis using deep learning models and includes endpoints for text and file input, database storage, and integrated Swagger documentation.

api deep-learning deep-neural-networks flask keras lstm machine-learning neural-network rnn scikit-learn scikitlearn-machine-learning sklearn sqlite3 swagger swagger-ui tensorflow

Last synced: 04 Jan 2025

https://github.com/analitico-771/machine_learning_trading_bot

This is an Application that implements an algorithmic trading strategy that uses machine learning to automate the trade decisions

financial-analysis hvplot logistic-regression machine-learning moving-average pandas-dataframe predictive-modeling python scikit-learn stock-price-prediction support-vector-machine

Last synced: 03 Jan 2025

https://github.com/davipythonweb/price_api

API de Previsão de Preço de casa com python/Machine-Learn

flask machine-learning pickle python python-dotenv scikit-learn venv

Last synced: 19 Jan 2025

https://github.com/mtlh/fyp_prempredict

In PremPredict, players will predict all Premier League games. Compete against the algorithm and other users across a full season. Scoring points for every correct result/prediction.

django prediction premierleague python scikit-learn tailwindcss

Last synced: 03 Jan 2025

https://github.com/z-fran/walmart-store-sales-forecasting

Data analysis and machine learning solution in Python for the Kaggle competition Walmart Recruiting - Store Sales Forecasting.

machine-learning sales-analysis sales-forecasting sales-prediction scikit-learn walmart-sales-forecasting

Last synced: 19 Jan 2025

https://github.com/vhnegrisoli/machine-learning-linguagens-programacao

Projeto de Data Science e Machine Learning de análise de linguagens de programação de 2004 a 2021

data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 10 Jan 2025

https://github.com/muhdhammad/machine-learning

Crafted for hands-on learning and implementation of machine learning concepts and techniques.

data-science jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 17 Jan 2025

https://github.com/luliatuccu/weather_analysis

This project highlights a combination of data science techniques and Python programming to explore real-world weather data.

data-preprocessing eda feature-engineering machine-learning matplotlib numpy pandas regex scikit-learn seab seaborn weather weather-patterns

Last synced: 25 Jan 2025

https://github.com/lren-chuv/sklearn_to_pfa

Convert Scikit Learn models to PFA

pfa-standard scikit-learn

Last synced: 19 Jan 2025

https://github.com/vishant007/annadataa

A Website For Farmers To Guide Them Regarding Crop Prouction In Their Native Language

django flask-application google-collab kaggle machine-learning-algorithms numpy pandas python3 scikit-learn

Last synced: 10 Jan 2025

https://github.com/taquynhnga2001/regression-calories-burnt-prediction

Develop regression models which can predict the total calories a person has burnt during a workout based on some biological measures.

machine-learning python regression-analysis scikit-learn

Last synced: 25 Jan 2025

https://github.com/amirdora/python_ml_supervisedlearning_example

Building Classification Models with scikit-learn

machine-learning python3 scikit-learn

Last synced: 19 Jan 2025

https://github.com/ax-va/python-machine-learning-recipes-gallatin-albon-2023

Machine learning recipes in Python with scikit-learn, OpenCV, PyTorch, and other libraries, including classical machine learning and neural networks, based on the book "Machine Learning with Python Cookbook: Practical Solutions from Preprocessing to Deep Learning", Second Edition, by Kyle Gallatin and Chris Albon published by O'Reilly Media in 2023

ax-va data-science deep-learning image-processing machine-learning neural-networks opencv opencv-python python pytorch scikit-learn

Last synced: 19 Jan 2025

https://github.com/enricobolzonello/ml_homeworks

Homeworks for the Machine Learning Course 2022/23 @ Unipd

linear-regression machine-learning neural-network scikit-learn svm

Last synced: 24 Jan 2025

https://github.com/edgar-mendonca/resvox-resume-ats-analyser

ResuVox is a Flask-based web application designed to help job seekers optimize their resumes for Applicant Tracking Systems (ATS).

bootstrap5 css3 html5 nltk python scikit-learn

Last synced: 19 Jan 2025

https://github.com/abhivur/graduate-income-forecaster---aggie-data-science-club-2024

Contributors: Abdussalam Raheem, Chiara Su, and Joseph Botros

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/himanshkr03/loan_default_prediction_using_machine_learning

This repository contains a Python-based project that uses machine learning to predict loan defaults. It explores data preprocessing, feature engineering, and model training techniques to build a predictive model for assessing loan risk.

data-science finance loan-default-prediction machine-learning pandas prediction-model python risk-assessment scikit-learn

Last synced: 25 Jan 2025

https://github.com/thaisgarcia/scikit-learn

Utilizei aprendizado supervisionado, mais especificamente regressão linear, para prever salários com base no tempo dedicado aos estudos mensais. O modelo treinado estabeleceu uma relação matemática entre salário e horas de estudo, ajustando parâmetros durante o treinamento.

pandas scikit-learn seaborn

Last synced: 10 Jan 2025

https://github.com/waikato-datamining/shallowflow-sklearn

scikit-learn support for the shallowflow Python workflow system.

python3 scikit-learn sklearn workflow-engine

Last synced: 02 Jan 2025

https://github.com/ranimeshehata/softmax-regression-on-mnist

A PyTorch-based project for classifying the MNIST dataset using Softmax Regression, including training, validation, results and visualization.

matplotlib mnist python3 pytorch scikit-learn softmax-regression torchvision

Last synced: 25 Jan 2025

https://github.com/katiebristol/epsilon_fe2o3_controls

Exploratory Data Analysis using machine learning techniques as an exercise for GLY6932 (Data Science and Machine Learning Methods in the Geosciences) at the University of Florida.

biplot exploratory-data-analysis k-means-clustering machine-learning one-hot-encoding paleomagnetism principal-component-analysis random-forest rock-magnetism scikit-learn

Last synced: 24 Jan 2025

https://github.com/brianlesko/maze-runner

Developed a Python-based maze-crawling application using a PS5 controller interface. This project highlights skills in software-hardware integration and low-code UI design, demonstrating expertise ideal for advanced software engineering.

communication dualsense engineer engineering hacking hardware hardware-hacking interface low-code-ui mechanical-engineer mechanical-engineering protocol ps5 python robotics-engineer scikit-learn software sony streamlit ui

Last synced: 26 Dec 2024

https://github.com/martolen1/data-science

Comprehensive repository of Data Science projects spanning Machine Learning, Deep Learning, and Natural Language Processing. Demonstrates practical applications of algorithms and tools on real-world datasets.

cnn-model data-analysis data-science data-visualization deep-learning gans-models keras machine-learning-algorithms natural-language-processing python3 rnn-lstm scikit-learn tensorflow transfer-learning transformers

Last synced: 20 Jan 2025

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 25 Jan 2025

https://github.com/pkini2002/hpe_cty

Repository to maintain the learnings of the technologies used for the CTY'23 Project Work provided by HPE

computer-networks docker docker-container linux python scikit-learn swarm-learning ubuntu

Last synced: 02 Jan 2025

https://github.com/selcia25/iris-dataset-classification

☘This repository contains a Python script for classifying the Iris dataset using the Random Forest algorithm.

data-processing iris-classification pandas random-forest-classifier scikit-learn

Last synced: 02 Jan 2025

https://github.com/nirmaldeepponnada/codeclauseinternshipproject2

Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.

nltk numpy pandas pickle python scikit-learn scipy

Last synced: 02 Jan 2025