Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/udityamerit/breast-cancer-prediction-using-different-ml-models

The `all_MLpackage` is a Python tool for breast cancer detection using scikit-learn's machine learning algorithms. It automates model training, generates accuracy comparisons, and helps identify the best-performing algorithm for diagnosis.

classification knn-algorithm logistic-regression machine-learning numpy pandas scikit-learn svm-model

Last synced: 15 Dec 2024

https://github.com/nazchanel/fake-news-detection-algorithm

A fake news detection algorithm. This repository contains the various variations of my original project. WIP.

dataset deep-learning fake-news-detection machine-learning-algorithms natural-language-processing scikit-learn work-in-progress

Last synced: 09 Nov 2024

https://github.com/shourya1997/programming-language-detector

The Software will enable the user to detect the programming language without the help of file extension.

machine-learning python scikit-learn scikitlearn-machine-learning

Last synced: 11 Nov 2024

https://github.com/bhavik-jikadara/ai-ml-roadmap

Welcome to the ultimate guide for starting your journey in Artificial Intelligence and Machine Learning in 2024! This roadmap provides a step-by-step approach to mastering AI and ML, from fundamentals to advanced topics.

artificial-intelligence computer-vision deep-learning deployment fundamentals-of-programming keras libraries machine-learning mathematics mlops natural-language-processing production-code pytorch reinforcement-learning roadmap scikit-learn tensorflow tools

Last synced: 09 Nov 2024

https://github.com/kohlerhector/dpdt-py

Implementation of Dynamic Programming Decision Tree algorithm (Kohler et. al. 2024).

decision-tree-classifier decision-trees dynamic-programming scikit-learn scikitlearn-machine-learning sklearn sklearn-classifier

Last synced: 08 Nov 2024

https://github.com/alextanhongpin/spam-api

Microservices for spam filtering system

python scikit-learn

Last synced: 02 Dec 2024

https://github.com/shreyansh055/time-series-forecasting_055

The Time Series Forecasting Project predicts future trends using historical data with Python, Pandas, and models like ARIMA, LSTM, and Prophet, focusing on scalable, accurate forecasting for business and finance.

lstm matplotlib numpy pandas python scikit-learn seaborn

Last synced: 31 Oct 2024

https://github.com/sn1027/hand-gesture-recognition-using-machine-learning-and-mediapipe

This Hand gesture recognition project using mediapipe is developed to recognize various hand gestures. The user can custom train any number of various hand gestures to train a model.

computer-vision customization custommodels handgesture-recognition machine-learning mediapipe modeltraining scikit-learn scikitlearn-machine-learning

Last synced: 23 Nov 2024

https://github.com/aayushker/devfoolio

Uncover originality, empower authenticity

django-rest-framework keybert nextjs nltk plagiarism-detection scikit-learn

Last synced: 23 Nov 2024

https://github.com/h-fuzzy-logic/python-finding-nsf-award-themes

Using NLP to find themes and concepts in NSF Awards

nltk pandas python scikit-learn

Last synced: 15 Dec 2024

https://github.com/anans9/ham-spam-analysis

A significant aspect of this application is its advanced ham and spam analysis system. This feature is crucial in maintaining the integrity and quality of the email content.

flask matplotlib pandas python scikit-learn

Last synced: 28 Dec 2024

https://github.com/troublem1/mle

MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline

packages python3 scikit-learn sklearn

Last synced: 01 Dec 2024

https://github.com/jasper-koops/easy-gscv

This library allows you to quickly train machine learning classifiers by automatically splitting the data set and using both grid search and cross validation in the training process.

classification machine-learning python3 scikit-learn

Last synced: 20 Dec 2024

https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction

Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.

breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn

Last synced: 22 Nov 2024

https://github.com/yvesemmanuel/machine_learning

Implements data problems solved with machine learning algorithms.

data-science keras keras-tensorflow linear-algebra machine-learning neural-network python scikit-learn

Last synced: 16 Nov 2024

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 01 Dec 2024

https://github.com/zazi2002/machine-learning-project

Introduction to Machine Learning project with the goal of improving the classification performance on a dataset by optimizing the number of features and weak learners.

dimentionality-reduction ensemble-learning numpy pca random-forest scikit-learn

Last synced: 28 Dec 2024

https://github.com/spamfromaditya/drugs-consumption-prediction-model-eda-bagging-classifier

Drug consumption prediction models are like crystal balls for public health. By analyzing vast amounts of data, these models can identify individuals or communities at higher risk of drug use. They consider factors like demographics, social media activity, prescription history, and even economic indicators.

bagging-classifier machine-learning matplotlib numpy python scikit-learn

Last synced: 31 Dec 2024

https://github.com/francescopaolol/logisticregression

About predicting survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle logistic-regression machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/francescopaolol/sentimentanalysis

About sentiment analysis on IMDB Dataset of 50K Movie Reviews

jupyter-notebook kaggle machine-learning ml pandas scikit-learn sentiment-analysis

Last synced: 22 Dec 2024

https://github.com/magnuss0/movie-rec-system

The project extracts movie data using TheMovieDB API, processes it using TF-IDF and cosine similarity for generating recommendations, and stores the data in a DuckDB database. The system is encapsulated within a FastAPI web application and can be deployed using Docker. It provides movie recommendations in JSON format.

cosine-similarity docker duckdb movies-recommendation moviesdb-api ploomber poetry-python scikit-learn streamlit tf-idf

Last synced: 25 Nov 2024

https://github.com/francescopaolol/favoritatimeseriesforecasting

See: https://www.kaggle.com/competitions/store-sales-time-series-forecasting

jupyter-notebook kaggle-competition machine-learning pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/francescopaolol/titaniccompetition

It's my first kaggle competition about predict survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle-competition machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 01 Dec 2024

https://github.com/shaadclt/data-preprocessing-pipeline

This project contains a data preprocessing pipeline implemented in Python using the pandas and numpy libraries. The pipeline handles missing values, outliers, and normalizes numeric features in a dataset.

numpy pandas scikit-learn

Last synced: 07 Dec 2024

https://github.com/francescopaolol/decisiontree

About classify iris plants into three species in this classic dataset

decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/aryank1511/wattwise

WattWise is an innovative energy-saving app that uses an Arduino-powered device to monitor and predict household electricity usage and bills in real-time.

arduino docker flask machine-learning mqtt nextjs scikit-learn

Last synced: 10 Oct 2024

https://github.com/oneapi-src/product-recommendations

AI Starter Kit for product recommendation system using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/ax-va/numpy-pandas-matplotlib-scikit-learn-vanderplas-2023

These examples provide an introduction to Data Science and classic Machine Learning using NumPy, Pandas, Matplotlib, and scikit-learn. They are taken, with some changes, from the book "Python Data Science Handbook: Essential Tools for Working with Data", Second Edition, written by Jake VanderPlas and published by O'Reilly Media in 2023.

ax-va classic-machine-learning data-science machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 18 Nov 2024

https://github.com/jdavydovportfolio/careerpredictor

A project leveraging AI and Machine Learning (Logistic Regression) to predict graduate job placements. Includes data preprocessing, exploratory analysis, and predictive modeling.

artificial-intelligence exploratory-data-analysis jupyter-notebook linear-regression logistic-regression machine-learning machine-learning-algorithms machine-learning-models matplotlib ml numpy pandas pandas-dataframe predictive-modeling programming python scikit-learn

Last synced: 07 Dec 2024

https://github.com/mehmoodulhaq570/machine-learning-models

A repository consisting of machine learning models for predicting the future instance. More specifically this repository is a Machine Learning course for those who are interested in learning the basics of machine learning algorithms.

decision-trees gradient-descent gradient-descent-algorithm knn-algorithm linear-regression linear-regression-models logistic-regression-algorithm machine-learning-algorithms machine-learning-models ml naive-bayes-algorithm one-hot-encoding pca python random-forest-classifier scikit-learn svm-model

Last synced: 22 Dec 2024

https://github.com/colinwu0403/heartbpmusic

Music discovery platform that recommends you a song based on your heart's BPM and your mood using Machine Learning.

django neurokit2 scikit-learn spotify-web-api vuejs

Last synced: 30 Dec 2024

https://github.com/ladityagogoi/shadowguard

The ShadowGuard Browser Extension is a powerful tool designed to enhance user experience by identifying and highlighting potential dark patterns on websites. Our extension employs a combination of machine learning algorithms and natural language processing (NLP) models to detect and classify various deceptive design practices

css flask html javascript joblib numpy pandas python scikit-learn

Last synced: 13 Nov 2024

https://github.com/gmontamat/quora-question-pairs

Code for the Kaggle competition "Quora Question Pairs"

kaggle-competition quora-question-pairs scikit-learn spell-checker xgboost

Last synced: 30 Oct 2024

https://github.com/ewertondrigues02/previsao-de-vendas

Previsão de vendas de uma empresa fictícia onde foi feita análise com ferramentas como Jupyter Notebook, Google Colab, Python e bibliotecas de Machine Learn como: regressão linear, arvore de decisão, scikit-learn

analise-de-dados analise-exploratoria arvore-de-decisao ciencia-de-dados colab excel google-colab jupyter jupyter-notebook machine-learning previsao previsao-de-vendas python3 regressao-linear scikit-learn

Last synced: 18 Nov 2024

https://github.com/m-rishab/credbet

A loan prediction web app which tells You that you are eligible for loan or not!

decision-tree-classifier matplotlib numpy pandas python scikit-learn

Last synced: 21 Nov 2024

https://github.com/aditya172926/text_summarization

Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.

glove machine-learning nlp python scikit-learn spacy

Last synced: 24 Nov 2024

https://github.com/grampers-dev/co2oracle

The CO2 Oracle project uses machine learning and AI to analyze and predict CO2 emissions for environmental management. Using a Kaggle dataset, it demonstrates predictive analytics to understand and forecast emissions. Written in Python, it employs libraries like Pandas, NumPy, and Scikit-Learn.

artificial-intelligence machine-learning numpy pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/2003harsh/house-price-prediction-using-machine-learning

This project features a web app that predicts house prices using a linear regression model. Users can input details like location, square footage, bathrooms, and bedrooms through an HTML form. I've added a CI/CD pipeline with GitHub Actions, unit testing with pytest, and automated Docker containerization to improve deployment and robustness.

ci-cd data-analysis docker-image flask linear-regression machine-learning matplotlib mlops-workflow requests scikit-learn

Last synced: 10 Oct 2024

https://github.com/upul/chocolate-quality-analysis

This repository contains a Jupiter notebook which describes how to use basic machine learning tools such Scikit-Learning, Pandas, and Numpy for buiding models.

machine-learning numpy pandas predictive-analytics scikit-learn

Last synced: 18 Nov 2024

https://github.com/sethios-notebook/__ia_learnig__

Formation Python spéciale Machine Learning francais. Apprendre Python en 30 fichiers qui contiennent une formation sur Numpy, Pandas, Matplotlib, Scipy, Sklearn, Seaborn, H5py, et bien d'autres techniques. Python est le langage d'excellence pour le machine learning, le deep learning, et la data science.

iac machine-learning matplotlib numpy python scikit-learn

Last synced: 21 Dec 2024

https://github.com/aakanksha1406/fake-news-classifier

to identify when an article might be fake news

keras lstm lstm-neural-networks nltk python scikit-learn tensorflow

Last synced: 10 Oct 2024

https://github.com/guoshijiang/scikit-learn

带你一起学习scikit-learn

nlp-machine-learning scikit-learn

Last synced: 24 Nov 2024

https://github.com/soumya6tiwari/customer-segmentation-using-rfm-analysis

This project focuses on customer segmentation using RFM (Recency, Frequency, Monetary) analysis and K-Means clustering. It enables businesses to identify high-value customers, optimize marketing strategies, and improve customer retention through data-driven insights.

backend clustering flask frontend kmeans-clustering matplotlib numpy pandas python rfm-analysis scikit-learn unsupervised-learning

Last synced: 21 Dec 2024

https://github.com/bhuvaneshwarguttula/student-performance-indicator

To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).

exploratory-data-analysis machine-learning pandas python scikit-learn student-performance-analysis

Last synced: 10 Oct 2024

https://github.com/prithivsakthiur/data-board

Data Boards - Visualization of various plots ( Analysis )

data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces

Last synced: 21 Dec 2024

https://github.com/zen204/airbnb_availability

A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.

binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning

Last synced: 03 Nov 2024

https://github.com/gappeah/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 10 Oct 2024

https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis

Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.

data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn

Last synced: 21 Dec 2024

https://github.com/khaymanii/multiple-disease-prediction-system

This system predicts if a patient has heart, parkinsons and diabetes disease

matplotlib numpy pandas python scikit-learn

Last synced: 20 Nov 2024

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 01 Dec 2024

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 10 Oct 2024

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 30 Nov 2024

https://github.com/offchan42/thai-thesis-classification

Classify each document inside the corpus using Python machine learning module: scikit-learn

nlp python python2 scikit-learn segment thai thai-language thai-thesis-classification

Last synced: 29 Oct 2024

https://github.com/tharindanimnajith/deep-learning-spam-detection

Deep Learning classifiers to detect spam SMS messages - LSTM Model, DenseNet CNN Models - NLP, Python, Jupyter Notebook, Tensorflow, Keras, Numpy, Pandas, Matplotlib, Scikit-Learn

deep-learning densenet keras lstm nlp python3 scikit-learn tensorflow

Last synced: 25 Nov 2024

https://github.com/farrajota/kaggle_titanic

My solutions to the "Titanic: Machine Learning from Disaster" kaggle competition

docker docker-compose kaggle kaggle-competition kaggle-titanic notebook pyspark python scikit-learn

Last synced: 17 Nov 2024

https://github.com/md-emon-hasan/ml-project-car-price-prediction

🚗 End-to-end ML project for predicting car prices based on various features. Includes data preprocessing, model training, and a Flask web for predictions.

car-price-prediction car-price-predictor data-science feature-engineering ml predictive-modeling scikit-learn

Last synced: 10 Oct 2024

https://github.com/lfenzo/ml-solar-sao-paulo

Implementation of scientific project regarding the use of Machine Learning in Solar Radiation Prediction

forecasting machine-learning python scikit-learn

Last synced: 17 Nov 2024

https://github.com/f-aguzzi/chemfusekit

Chemometrics library for data fusion, model training and prediction of data from multiple sensor sources.

chemometrics datafusion knn lda pca plsda scikit-learn svm

Last synced: 21 Dec 2024

https://github.com/rickiepark/ml-ko

머신러닝, 딥러닝 한글 번역 저장소

deep-learning keras machine-learning python scikit-learn tensorflow

Last synced: 21 Dec 2024

https://github.com/rickiepark/ml-with-python-cookbook-2nd

<실무로 통하는 ML 문제 해결 with 파이썬>

deep-learning machie-learning pytorch scikit-learn

Last synced: 21 Dec 2024

https://github.com/subhangisati/identification-of-gemstone-using-mobilenetv2-and-transfer-learning

The project focuses on Identification of various Gemstone. The dataset consists of 87 classes.It shows the whole progress and model used to achieve final accuracy. You will gain knowledge of Computer Vision, The model used are CNN(Convolutional Neural Network), MobileNetV2 and VGGNet,The final model used was transfer learning with model MobileNetV2

artificial-intelligence classification computer-vision convolutional-neural-networks cv2-library data-science deep-learning deep-neural-networks keras-neural-networks keras-tensorflow machine-learning multiclass-image-classification numpy-library pandas-dataframe python scikit-learn seaborn-python tensorflow

Last synced: 08 Nov 2024

https://github.com/texnoforge/texnomagic

TexnoMagic library for digital Magic

gmm magic numpy python recognition scikit-learn scipy

Last synced: 12 Nov 2024

https://github.com/dustinmichels/bayesian-values-guesser

Uses some user input, data from the World Values Survey <www.worldvaluessurvey.org>, and Bayes Rule to guess a number of beliefs the user might have. STATUS: In progress.

bayes-rule bayesian-values-guesser naive-bayes-classifier pandas python scikit-learn values-survey

Last synced: 14 Dec 2024

https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023

This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.

data-science machine-learning ml numpy pandas python3 scikit-learn

Last synced: 11 Oct 2024

https://github.com/swimshahriar/heart-attack-prediction

Heart attack prediction from 13 features.

jupyter-notebook pandas python3 scikit-learn

Last synced: 20 Dec 2024

https://github.com/kingabzpro/github-actions-for-machine-learning-beginners

A project on automating ML workflow using scikit-learn pipelines, CML, and GitHub actions.

cml github-actions machine-learning mlops scikit-learn

Last synced: 17 Nov 2024

https://github.com/kingabzpro/ml-workflow-orchestration-with-prefect

An introductory project to streamline the machine learning pipeline using Prefect and Discord Notifications, from data ingestion to model saving

discord mlops prefect scikit-learn

Last synced: 17 Nov 2024

https://github.com/sralter/happy_customers

Predicting whether a customer is happy based on the results from a survey.

eda ensemble-classifier hyperopt lazypredict ml scikit-learn

Last synced: 17 Nov 2024

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 19 Dec 2024

https://github.com/sralter/classifire

Wildfire Prediction Model: Samuel Alter's BrainStation 2023 Data Science Capstone Project

qgis scikit-learn tensorflow

Last synced: 17 Nov 2024

https://github.com/sarthak-1408/rain-fall-prediction

This repository represents the End to End Machine Learning Project (Rain Fall Prediction in Australia).

heroku heroku-deployment machine-learning numpy pandas rain-fall rain-fall-prediction scikit-learn xgboost-algorithm

Last synced: 16 Nov 2024