Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/moritzkoerber/text_analysis_app

A web app that classifies the content of messages that are usually sent during disasters such as earthquakes.

flask machine-learning nltk python scikit-learn

Last synced: 09 Jan 2025

https://github.com/moritzkoerber/tune_preprocessing_algos

Files for this blogpost https://moritzkoerber.github.io/python/tutorial/2019/11/18/blogpost/

cross-validation hyperparameter-tuning machine-learning python scikit-learn

Last synced: 09 Jan 2025

https://github.com/moritzkoerber/data_science_posts

This repository hosts the code for my data science related blog posts.

hyperparameter-tuning machine-learning pipeline python scikit-learn

Last synced: 09 Jan 2025

https://github.com/sk-g/mnist_beginners

Model search in traditional machine learning algorithms (non DL) and DL starter codes on MNIST dataset. This is a good starter code for beginners trying to learn about curse of dimensionality, overfitting and other concepts in general

keras machine-learning machine-learning-algorithms mnist mnist-beginners mnist-classification mnist-dataset numpy overfitting python pytorch pytorch-implmention resnet resnet-50 scikit-learn scikitlearn-machine-learning sklearn tensorflow

Last synced: 16 Dec 2024

https://github.com/s-matke/eco-forecast

Machine learning model used for predicting European country with most green surplus energy generated

data-science green-energy machine-learning scikit-learn supervised-learning

Last synced: 09 Dec 2024

https://github.com/daniil-leshchev/spotify_ml

Track Popularity Prediction based on Spotify Data

eda keras ml pandas scikit-learn

Last synced: 05 Dec 2024

https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction

A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation

linear-regression python scikit-learn xgbregressor

Last synced: 04 Dec 2024

https://github.com/nahom32/mlp-assignment

This repository is an implementation for machine learning assignment demonstrating the machine learning process.

eda logistic-regression machine-learning scikit-learn

Last synced: 05 Dec 2024

https://github.com/vijaykumarr1452/black_friday_sales_analysis

Black Friday Sales Analysis python machine learning project using pandas and scikit-learn for data preprocessing, model training, and performance evaluation.

confusion-matrix jupyter-notebook machine-learning pandas python random-forest-classifier sales-analysis scikit-learn

Last synced: 05 Dec 2024

https://github.com/giatraskon/machine_learning_assignments

Machine learning assignments covering regression, classification, neural networks, adversarial examples, and real-time emotion detection using Python. Includes theoretical insights and practical implementations.

adversarial-examples bayesian-inference bias-variance-tradeoff cifar10 classification deep-learning emotion-recognition iris-dataset k-nearest-neighbours keras machine-learning mnist neural-networks opencv pima-indians-diabetes python regression ridge-regression scikit-learn tensorflow

Last synced: 05 Dec 2024

https://github.com/dadvaiahpavan/ai-data-scientist-

AI-powered tool for dataset analysis, featuring data preprocessing, classification, regression, anomaly detection, and text analysis. Built with scikit-learn, pandas, and Plotly for visualization. Includes an interactive Streamlit web interface for real-time data analysis.

ai anomaly-detection classification data-analysis data-science machine-learning panda plotu regression scikit-learn sentiment-analysis streamlit

Last synced: 05 Dec 2024

https://github.com/bhaveshbhakta/diabetes-prediction

Note* The hosted website link might take some time to load. Please be patient while the application initializes.

diabetes-prediction flask machine-learning python scikit-learn svm web-development

Last synced: 05 Dec 2024

https://github.com/galaxy092/samsung-innovation-campus-big-data-capstone-project

Samsung Innovation Campus Big Data Capstone Project - Weather Prediction

hadoop jupyter-notebook pandas pyspark scikit-learn sparksql

Last synced: 05 Dec 2024

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 05 Dec 2024

https://github.com/gdapriana/clickbait-detector-backend

This repository contains the backend logic for the “Clickbait Detector” app. Built using Python, it employs an Artificial Neural Network (ANN) to predict the likelihood of a news headline being clickbait. It provides REST API endpoints to interact with the model.

flask python scikit-learn tensorflow

Last synced: 05 Dec 2024

https://github.com/dukebw/ml-model-selection

Machine learning model selection using Dlib and scikit-learn.

dlib machine-learning ranking scikit-learn

Last synced: 05 Dec 2024

https://github.com/dahsie/spam_classification

Ce fut mon prémier projet NLP où j'ai réalisé la détection de spam en utilisant les algorithmes d'embedding pour encorder mes textes. J'ai utilisé Random Forest et Milti-Layres Perceptrons pour la phase de classification. Ce qui a pemit l'obtension des précisions respective de 97% et 98%. J'ai aussi appris à documenter mes codes via sphinx

doc2vec fasttext-embeddings gensim glove-embeddings python scikit-learn sphinx-doc word2vec-algorithm

Last synced: 23 Dec 2024

https://github.com/viniciusds2020/ml_pycaret_classificacao

Sistema de preprocessamento e treinamento de modelos de machine learning utilizando PyCaret. Uma metodologia low-code para processos de MLops

machine-learning mlops preprocessing pycaret python scikit-learn

Last synced: 10 Dec 2024

https://github.com/aryan-coder-student/smart-agricultural-monitering-aryanpahari037-

Increase agricultural productivity and sustainability by offering a smart farming assistant that helps farmers make data-driven decisions to optimize resource use and crop management.

hackathon-project machine-learning numpy pandas python scikit-learn

Last synced: 10 Dec 2024

https://github.com/vivekky57/car-price-prediction

Now you can get Car Price with this wonderful end-to-end project.

flask machine-learning machine-learning-algorithms python python3 random-forest-classifier scikit-learn

Last synced: 23 Dec 2024

https://github.com/justsecret123/nba-players-stats-analysis

A quick interactive Notebook to visualize some NBA players stats (points, assists, steals, blocks...) and totals, rankings and comparisons. Feel free to add any player in the .csv data files. 🏀

csv ipython-notebook ipywidgets jupyter-notebook jupyterlab matplotlib pandas python scikit-learn seaborn

Last synced: 23 Dec 2024

https://github.com/crispengari/ml-web-applications

✔ This repository contains a series of machine learning web applications, using python.

artificial-intelligence deeplearning flask javascript machinelearning nueral-networks python scikit-learn sentiment-analysis webapplication

Last synced: 16 Dec 2024

https://github.com/saswatamcode/datascienceapi

This is a RESTful API built using Flask and Scikit-Learn. It provides a host of Classification and Regression algorithms that can be used readily and returns results in the form of predictions, confusion matrices, accuracy scores and more.

api flask ml python3 scikit-learn

Last synced: 09 Jan 2025

https://github.com/abdullahashfaq-ds/sms-spam-detection

A machine learning application designed to classify SMS messages as spam or non-spam, offering real-time analysis to identify potentially harmful content.

css3 docker flask html5 javascript matplotlib nltk numpy pandas python scikit-learn seaborn tailwindcss xgboost

Last synced: 17 Dec 2024

https://github.com/jol79/python_exercises

Solving interesting python exercises on different topics

matplotlib-pyplot numpy pandas python3 pythonexercises scikit-learn seaborn

Last synced: 11 Dec 2024

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn

Last synced: 17 Dec 2024

https://github.com/albarji/teachingcontainer

A Docker container I use for my lectures

docker keras machine-learning scikit-learn

Last synced: 17 Dec 2024

https://github.com/scikit-learn/pairwise-distances-reductions-asv-suite

A dedicated asv suite for scikit-learn private PairwiseDistancesReductions

asv benchmarks cython scikit-learn

Last synced: 17 Dec 2024

https://github.com/pratishtha-abrol/sentimentanalysis

Logistic Regression: A sentiment analysis case study

logistic-regression nltk-python scikit-learn sentiment-analysis

Last synced: 12 Dec 2024

https://github.com/dmarks84/coursework_project_ml-classification

Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics

classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml

Last synced: 23 Dec 2024

https://github.com/dmarks84/coursework_project_ml-classifier-eval-selection

Project for University of Michigan Applied Data Science Specialization -- Predicted viewer engagement based on features related to video metrics; evaluated a large set of classifiers under different scoring metrics to select the "optimal" one.

classification cross-validation data-modeling data-reporting data-visualization databases dataframes eda grid-search matplotlib numpy pandas python scikit-learn statistics supervised-ml

Last synced: 23 Dec 2024

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 23 Dec 2024

https://github.com/dmarks84/coursework_project_ml-model-eval-refine

Project for IBM Data Science course on ML Models & Analysis -- Read in large dataset of home sales and utilized polynomial linear regression analysis to make predictions of future home sales prices

classification communication data-modeling dataframes machine-learning matplotlib numpy pandas programming python regression scikit-learn scipy seaborn supervised-ml visualization

Last synced: 23 Dec 2024

https://github.com/saahilanande/naivebayes

Implimenting Naive Bayes classifier from scratch for sentiment analysis of IMDB dataset

machine-learning naive-bayes-classifier python-3 scikit-learn

Last synced: 10 Jan 2025

https://github.com/martinkersner/kmeans-meetup

Presentation about k-Means for Seoul AI Meetup on July 22, 2017.

kmeans numpy python scikit-learn

Last synced: 01 Jan 2025

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 23 Dec 2024

https://github.com/rishi035/advanced-house-price-predictions

This is my First Project and also participated in kaggle competition

linear-regression machine-learning python random random-forest regressor-models scikit-learn

Last synced: 01 Jan 2025

https://github.com/themihirmathur/machine-failure-predictor

The 'Machine Failure Predictor' is a web application designed to enable predictive maintenance for industrial equipment. Leveraging machine learning techniques, this application predicts whether equipment will fail and identifies the type of failure based on key process parameters.

cicd digitalocean docker fastapi flask github github-actions machine-learning matplotlib mlops numpy plotly python randomforest scikit-learn seaborn streamlit

Last synced: 01 Jan 2025

https://github.com/themihirmathur/soiligator

Soiligator is an advanced machine learning project designed to optimize irrigation management by predicting whether irrigation is necessary based on environmental and soil-related data.

auc-score logistic-regression machine-learning matplotlib numpy pandas python random-forest-classifier roc-curve scikit-learn seaborn standardscaler support-vector-machine

Last synced: 01 Jan 2025

https://github.com/tedim52/discjockey

a content-based recommender system for your party playlist preferences

jupyter-notebook matplotlib pandas scikit-learn spotify-web-api

Last synced: 12 Dec 2024

https://github.com/raphael-ufrj/analise_algodao

Análise histórica de plantio de algodão, analise do plantio com base no clima e nos dados históricos.

analysis data-science data-visualization dataset docker pandas provenance python python3 scikit-learn seaborn streamlit

Last synced: 12 Dec 2024

https://github.com/emilyfelker/ieee_cis_fraud_detection

Which online transactions are fraudulent? Program that uses machine learning to detect fraud.

kaggle logistic-regression machine-learning pandas poetry python scikit-learn sklearn xgboost

Last synced: 13 Dec 2024

https://github.com/toscdom/spam_detection

This repository contains a project focused on analyzing and classifying emails to detect SPAM. It includes: Training a machine learning classifier for SPAM detection. Identifying key topics in SPAM emails using NLP techniques. Calculating semantic distances to evaluate topic similarity. Tools used include Python libraries like nlp frameworks

classifier nlp nltk scikit-learn semantic-analysis spam-detection

Last synced: 13 Dec 2024

https://github.com/thekartikeyamishra/ai-customer-feedback-summarizer

The AI Customer Feedback Summarizer is a Python-based application that processes customer feedback, extracts insights, and summarizes reviews. This basic version uses extractive summarization techniques, and the advanced version integrates advanced sentiment analysis, visualization, and industry-specific fine-tuning.

ai chatbot gpt machine-learning matplotlib nltk pandas python scikit-learn streamlit

Last synced: 13 Dec 2024

https://github.com/sundanc/movierecommendation

Simple movie recommendation system built with Streamlit

movie-recommendation-app python scikit-learn scikitlearn-machine-learning streamlib

Last synced: 13 Dec 2024

https://github.com/infinitode/scikit-learn-decisiontreeclassifier-updater

An open-source tool to convert older Scikit-learn DecisionTreeClassifier models to the newer version.

ai classifier cli converter decisiontree python scikit-learn sklearn tools

Last synced: 13 Dec 2024

https://github.com/sunilvarma-l/liver-disease-prediction

This project focuses on predicting liver disease using machine learning techniques. The dataset includes 615 records with 13 features, such as blood and urine analysis measurements. The target variable is categorical, with five classes: No Disease, Suspect Disease, Hepatitis C, Fibrosis, and Cirrhosis.

machine-learning numpy pandas python scikit-learn statistics streamlit tensorflow

Last synced: 13 Dec 2024

https://github.com/arnavk-09/phishing-detection

🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI

csv data fastapi flask python scikit-learn

Last synced: 13 Dec 2024

https://github.com/virajbhutada/house-price-prediction

This repository houses the implementation of a linear regression model designed to predict house prices based on key factors such as square footage, number of bedrooms, and number of bathrooms. The project delves into fundamental concepts in data analysis, machine learning, and model evaluation.

google-colab house-price-prediction matplotlib pandas python scikit-learn seaborn

Last synced: 10 Jan 2025

https://github.com/abhishekbagdiya01/movies-recommendation-system

This repository contains the code for a movie recommendation system built using Jupyter Notebook.

aiml jupyter-notebook numpy pandas python scikit-learn

Last synced: 23 Dec 2024

https://github.com/mangesh-balkawade/pythonautomationsscripts

This is the repository which contains the python automations scripts and machine learning case studies , and Python Projects that I have write to learn automations and ML using python.

automation data-science machine-learning-algorithms matplotlib mongodb pandas python3 scikit-learn seaborn webscraping

Last synced: 01 Jan 2025

https://github.com/debjyotisaha/web-application-projects

Web Applications developed with the help Python libraries and ML algorithms

algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit

Last synced: 17 Dec 2024

https://github.com/prakharchoudhary/mlchallenge-2

My submission for machine learning challenge #2, organised by hackerEarth.

adaboost gradient-boosting-classifier jupyter-notebook machine-learning python scikit-learn

Last synced: 01 Jan 2025

https://github.com/divinenaman/color-extraction-api

Extract colours from images using K-means, along with FastAPI pipeline.

fastapi k-means-clustering scikit-learn

Last synced: 17 Dec 2024

https://github.com/adhadse/hands-on-machine-learning-book-notes-and-practice

This repo holds the Jupyter notebooks and datasets containing notes/comments on things I learned from this book. Feel free to use and learned from them.

data-science deep-learning jupyter-notebooks keras machine-learning python scikit-learn tensorflow

Last synced: 17 Dec 2024

https://github.com/sonu275981/flight-fare-prediction

End to end implementation of Machine Learning Airline Flight Fare Prediction using python

flight-price-prediction gradient-boosting machine-learning predicts-flight-fares python3 regression-models scikit-learn

Last synced: 24 Dec 2024

https://github.com/cs50victor/cspaint

handwritten-text recognition application

ai machine-learning python scikit-learn

Last synced: 18 Dec 2024

https://github.com/henriqueotogami/imersao-dados-3-alura

Terceira edição da Imersão Dados da Alura (03 a 07/05/21). O projeto dessa edição foi inspirado em um desafio do Laboratory Innovation Science at Harvard disponibilizado no Kaggle.

alura bioinformatics data-science drug-discovery google-collab harvard-university imersaodados jupyter-notebook kaggle-challenge laboratory-innovation-science matplotlib pandas python3 scikit-learn seaborn

Last synced: 24 Dec 2024

https://github.com/sanikamal/deep-learning-atoz

A collection of deep learning architectures ,model, code snippets, tips and mini projects.

computer-vision deep-learning nlp scikit-learn skimage tensorflow

Last synced: 10 Jan 2025

https://github.com/pspanoudakis/machine-learning-nlp

NLP 🤖 📖 projects on Vaccine Sentiment Classification 💉 and Question Answering 💬

bert-fine-tuning glove-embeddings neural-networks pytorch question-answering rnn scikit-learn sentiment-classification softmax-regression squad

Last synced: 18 Dec 2024

https://github.com/alyssonmach/machine-learning-com-python

Aplicações de Machine Learning usando a linguagem de programação Python.

ia keras-tensorflow machine-learning matplotlib numpy pandas programming python scikit-learn scipy

Last synced: 24 Dec 2024

https://github.com/gayathri2200/car-price-prediction---machine-learning

Car price prediction Machine Learning --- Which is used to predict the price of used cars based on the features.

data-science machine-learning modeldeployment pandas price-prediction python regression scikit-learn streamlit visual-studio visualization

Last synced: 18 Dec 2024

https://github.com/jayadavv/dynamic-ml-model-selector

An interactive web application that allows users to upload their datasets and dynamically select, train, and evaluate various machine learning models. The app provides comprehensive performance metrics and visualizations, making it easy for users to analyze their data effectively.

decision-trees linear-regression logistic-regression matplotlib-pyplot plotly python random-forest scikit-learn streamlit

Last synced: 18 Dec 2024

https://github.com/danishtalpur/sentiview-website

SentiView is a sentiment analysis tool designed to analyze and interpret the emotions behind tweets on Twitter. The platform processes textual data from user-generated tweets to determine the sentiment behind them—whether they are positive, negative, or neutral.

css flask html java naive-bayes-classifier scikit-learn twitter-sentiment-analysis

Last synced: 18 Dec 2024

https://github.com/afkewolczyk/data_science_bootcamp

A data science project to learn data science essentials such as: pandas, Matplotlib, Scikit learn

ai data-science machine-learning pandas scikit-learn

Last synced: 18 Dec 2024

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 18 Dec 2024

https://github.com/aneeshmurali-n/ann-diabetes-prediction

Predicting diabetes progression using an Artificial Neural Network (ANN). This project leverages the scikit-learn diabetes dataset for training and evaluation. Includes data preprocessing, model building, and performance visualization.

ann data-preprocessing data-visualization deep-learning diabetes-prediction exploratory-data-analysis keras machine-learning matplotlib neural-network numpy pandas regression scikit-learn seaborn tensorflow visualization

Last synced: 18 Dec 2024

https://github.com/prosenjitjoy/webscraper---the-numbers

It is a basic Data Science project I made to implement my basic knowledge of the different libraries on a single project. This project will servers as an introduction to Data Science workflow to anyone interested.

data-science matplotlib python scikit-learn webscraper

Last synced: 19 Dec 2024

https://github.com/haekalsetiawan/banana-quality-ml-analysis

The Banana Quality Analysis ML project classifies the quality of bananas using machine learning. By analyzing banana images and relevant features, the model categorizes bananas into quality tiers. Key stages include data preprocessing, feature extraction, model training, and evaluation, resulting in high accuracy.

pandas python scikit-learn

Last synced: 19 Dec 2024

https://github.com/ngangawairimu/linear-regression-

This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.

data-analysis linear-regression machine-learning predictive-modeling python scikit-learn

Last synced: 19 Dec 2024

https://github.com/jupitvq/simple-uib-assistant

Chatbot sederhana berbasis scikit-learn untuk membantu mahasiswa memberikan informasi seputar akademik & administrasi UIB.

chatbot machine-learning scikit-learn virtual-assistant

Last synced: 19 Dec 2024

https://github.com/mayankyadav23/shipment-pricing-prediction

Shipment Pricing Prediction 📦🔍 is a machine learning project that forecasts shipment prices based on various supply chain factors. Using advanced regression models, it provides valuable insights 📊 to optimize pricing strategies in the supply chain analytics domain.

data-visulization flask ineuron-ai machine-learning python scikit-learn shipment-and-pricing

Last synced: 19 Dec 2024

https://github.com/gurramankit/censusproject_ml_randomforest

The objective of this project is to build a classification model using the Census Income dataset from the UCI Machine Learning Repository. The model predicts whether an individual's income exceeds $50,000 per year, based on their demographic and employment-related attributes.

matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 19 Dec 2024

https://github.com/musty-ess/shopping-prediction-ai-using-machine-learning

This project is an AI model designed to predict whether customers on an online shopping website will complete a purchase. It uses machine learning, specifically a k-nearest neighbors (KNN) classifier, to analyze user behavior and predict their purchasing intent.

ai-prediction classification-model k-nearest-neighbors knn-classifier machine-learning online-shopping python scikit-learn sensitivity-specificity user-behavior-prediction

Last synced: 19 Dec 2024

https://github.com/kheriberto/linear_regression_ecommerce

Simple project showcasing crafting a linear regression model with SciKit Learn

data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn

Last synced: 19 Dec 2024