Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/ssreeramj/hackerearth-exhibit-art

This is my solution to the HackerEarth Exhibit Art Machine Learning Challenge

gradient-boosting machine-learning pandas regression scikit-learn

Last synced: 02 Jan 2025

https://github.com/epomatti/python-machine-learning

Simple examples of ML using Python

machine-learning python scikit-learn

Last synced: 17 Jan 2025

https://github.com/johnnixon6972/cirrhosis-outcomes-prediction

This leverages advanced machine learning techniques to predict patient outcomes for those suffering from cirrhosis. Utilizing a comprehensive dataset from a Mayo Clinic study, this project explores various data imputation methods and class balancing techniques to enhance prediction accuracy.

ai algorithms analytics artificial-intelligence machine-learning ml pandas python3 scikit-learn

Last synced: 18 Jan 2025

https://github.com/the-developer-306/fake-review-detector

This project is a machine learning-based review classification system that predicts whether a product review is GENUINE or FAKE. It preprocesses review text, analyzes sentiment, and uses numerical features like ratings and helpfulness to make predictions. The model is deployed via a Flask web application for user interaction.

classification flask logistic-regression machine-learning numpy pandas python renderdeploy scikit-learn sentiment-analysis

Last synced: 26 Dec 2024

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 24 Jan 2025

https://github.com/eljandoubi/disasterresponsepipeline

Project aim is to build a Natural Language Processing (NLP) model to categorize messages on a real time basis.

flask nltk numpy pandas plotly scikit-learn scipy sqlalchemy

Last synced: 24 Jan 2025

https://github.com/sayed-ashfaq/delhivery-dataanalysis

In this project, I conducted basic analysis, feature engineering, normalization, and outlier handling, along with statistical and non-parametric testing to extract insights.

feature-engineering normalization outlier-detection pandas python scikit-learn statistcal-tests statistical-analysis

Last synced: 26 Dec 2024

https://github.com/brianlesko/maze-runner

Developed a Python-based maze-crawling application using a PS5 controller interface. This project highlights skills in software-hardware integration and low-code UI design, demonstrating expertise ideal for advanced software engineering.

communication dualsense engineer engineering hacking hardware hardware-hacking interface low-code-ui mechanical-engineer mechanical-engineering protocol ps5 python robotics-engineer scikit-learn software sony streamlit ui

Last synced: 26 Dec 2024

https://github.com/szymon-budziak/ai_football_game_analysis

Football game analysis using YOLOv8 for object detection, Optical Flow for motion tracking, speed and distance calculations, perspective transformation, and K-Means clustering for pixel segmentation.

ai computer-vision kmeans object-detection optical-flow python3 pytorch roboflow scikit-learn segmentation supervision ultralytics yolov8

Last synced: 25 Jan 2025

https://github.com/apfirebolt/titanic_survival_prediction

Titanic survival prediction GUI application using scikit-learn and PyQT5

jupyter-notebook pandas prediction pyqt5 python scikit-learn titanic-kaggle

Last synced: 25 Jan 2025

https://github.com/loudji971/chatbot-intent-classifier

Chatbot based on natural language processing (NLP) and deep learning for accurate intent classification in conversations. - Artificial Inteligence Tecniques

ai atis-dataset bert chat-bot-deep-learning deep-neural-networks fastapi intent-classification keras nlp nltk nltk-keras-python nlu-engine scikit-learn tridib-samanta

Last synced: 25 Jan 2025

https://github.com/gamowy/urbansounds-classification

Classification of urban sounds using Tensorflow Keras

keras machine-learning python scikit-learn tensorflow

Last synced: 25 Jan 2025

https://github.com/afonsojramos/feup-iart

Projects developed for Artificial Intelligence class.

feup feup-iart iart neural-network python scikit-learn tensorflow

Last synced: 25 Jan 2025

https://github.com/gurpreet0022/nlp_exploration

This repository explores various Natural Language Processing (NLP) techniques using the NLTK library in Python. It demonstrates these techniques on a sample dataset and performs sentiment analysis on movie reviews.

beginner-friendly nlp nlp-machine-learning nltk scikit-learn

Last synced: 26 Dec 2024

https://github.com/ranimeshehata/feed-forward-neural-network-on-mnist

A PyTorch-based project for classifying the MNIST dataset using Feed Forward Neural Networks, including training, validation, results and visualization.

feedforward-neural-network matplotlib mnist python3 pytorch scikit-learn torchvision

Last synced: 25 Jan 2025

https://github.com/hvalfangst/azure-functions-pandas

Azure Functions for ETL operations using Pandas. Uploaded CSV files trigger data processing, calculating correlations and storing results in a JSON file. Automated deployment via GitHub Actions and Terraform.

az-204 azure azure-functions azure-functions-python pandas python scikit-learn terraform

Last synced: 25 Jan 2025

https://github.com/coder5omkar/logistic-regression-customer-churn-prediction

This project uses Logistic Regression to predict customer churn in the telecom industry. To run, clone the repository, install dependencies, and run the Jupyter notebook for full analysis and predictions.

logistic-regression ml pandas scikit-learn seaborn statistics

Last synced: 26 Dec 2024

https://github.com/thekartikeyamishra/customer-retention-predictor

The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.

joblib machine-learning numpy pandas python scikit-learn tinker

Last synced: 26 Dec 2024

https://github.com/aksoni07/movie-recommendation

A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.

clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions

Last synced: 26 Dec 2024

https://github.com/hrolive/deep-learning-nanodegree

As one of the top 3% students from the first phase, "PyTorch Scholarship Challenge" by Facebook AI, I have earned a full scholarship to Udacity’s Deep Learning Nanodegree program

api-gateway aws aws-lambda aws-sagemaker computer-vision convolutional-neural-networks deep-learning deployment machine-learning natural-language-processing numpy pandas python pytorch scikit-learn

Last synced: 04 Jan 2025

https://github.com/adarshpheonix2810/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter

Last synced: 25 Jan 2025

https://github.com/ipascrlet/pakistan-infant-mortality-analysis

Explore the factors affecting infant mortality rates in Pakistan through this comprehensive analysis project. Dive into the data to uncover patterns and insights that could potentially inform healthcare policies and interventions.

api correlation-matrix data-analysis data-science data-visualisation machine-learning numpy pakistan ridge-regression scikit-learn seaborn team-project unicef wdi

Last synced: 25 Jan 2025

https://github.com/mhmudfzli/loan-approval-prediction

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

automl catboost numpy pandas python scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/hrolive/disaster-response-pipeline

A machine learning pipeline that categorizes disaster related messages so that they can be sent to the appropriate disaster relief agency

flask machine-learning natural-language-processing nltk pandas plotly python scikit-learn sql sqlalchemy

Last synced: 04 Jan 2025

https://github.com/hrolive/recommendation-systems-ibm

Analyze the interactions that users have with articles on the IBM Watson Studio platform and make recommendations to them about new articles, using various recommendation engines.

machine-learning natural-language-processing pandas python recomendation-system scikit-learn

Last synced: 04 Jan 2025

https://github.com/hafidaso/predicting-industrial-machine-downtime-level-3

This project aims to develop a predictive model using machine learning techniques to forecast machine failures based on historical operational data.

imbalanced-learning numpy pandas python scikit-learn seaborn xgboost

Last synced: 25 Jan 2025

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/nickklos10/compressive-strenght-prediction

This project predicts concrete compressive strength using a neural network regression model built with Keras.

jupyter-notebook keras matplotlib numpy pandas python scikit-learn

Last synced: 04 Jan 2025

https://github.com/elazzouzihassan/si-fraud-detection-prototype

Système de Détection des Fraudes avec Python (Prototype).

googlecolab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 04 Jan 2025

https://github.com/jbizzlefoshizzle/linear-and-ridge-regression

The purpose of this project was to analyze and predict housing prices using attributes or features such as square footage, number of bedrooms, number of floors, and so on.

linear-regression machine-learning machine-learning-algorithms regression-analysis regression-models ridge-regression scikit-learn scikitlearn-machine-learning train-test-split train-test-using-sklearn

Last synced: 25 Jan 2025

https://github.com/moeeinaali/nlp-lsa

Applying Latent Semantic Analysis (LSA) to text data using scikit-learn.

lsa nlp scikit-learn

Last synced: 18 Jan 2025

https://github.com/bjpcjp/scikit-learn

Updates in progress. Jupyter workbooks will be added as time allows.

python python3 scikit-learn

Last synced: 04 Jan 2025

https://github.com/bjpcjp/scikit-learn-v0.23

My Jupyter Lab notebooks on Scikit-Learn v0.23. Work in progress.

matplotlib-pyplot numpy python3 scikit-learn scipy

Last synced: 04 Jan 2025

https://github.com/ghoumbadji/water-potability-checker

A machine learning model that takes some data on water and tells if this water is potable or not

kaggle machine-learning pandas scikit-learn

Last synced: 04 Jan 2025

https://github.com/leftcoastnerdgirl/deep_learning

This project introduces neural networks, deep learning, and Tensorflow.

deep-learning jupyter-notebook neural-networks pandas-python scikit-learn tensorflow

Last synced: 04 Jan 2025

https://github.com/leftcoastnerdgirl/supervised_learning

This project demonstrates supervised machine learning using scikit-learn.

classification-reports confusion-matrix jupyter-notebook numpy pandas-python pathlib scikit-learn sklearn

Last synced: 04 Jan 2025

https://github.com/jbizzlefoshizzle/ibm_capstone_project

Used K-means clustering and mapping libraries to determine best cities in San Diego to open a Mexican restaurant

beautifulsoup4 folium-maps geopy pandas-python scikit-learn

Last synced: 25 Jan 2025

https://github.com/thariniselvakumar/kidney-disease-prediction

This project is about the kidney disease prediction using machine learning algorithms

machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 26 Dec 2024

https://github.com/kanika300393/loan_prediction

This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.

data-science machine-learning numpy pandas python scikit-learn svm-classifier

Last synced: 26 Dec 2024

https://github.com/eusha425/housing-market-analysis

Implementation of supervised learning algorithms for real estate price prediction, featuring Ridge Regression optimization, IQR-based outlier detection, and extensive feature engineering. Includes detailed visualizations, statistical analysis, and model performance comparisons using various evaluation metrics.

data-preprocessing data-science exploratory-data-analysis house-price-prediction machine-learning python scikit-learn supervised-learning

Last synced: 04 Jan 2025

https://github.com/ajxxxs/spotify-music-analysis

spotify Music (web scraped playlists ) analysis (over 3 states) , trends, features and a music recommendation system.

matplotlib numpy panda scikit-learn seaborn

Last synced: 10 Jan 2025

https://github.com/leabrodyheine/ml-kaggle-cirrhosis-data

This project showcases skills in machine learning, data preprocessing, and model evaluation using Python libraries such as scikit-learn, XGBoost, and Optuna. It involves implementing various machine learning models, handling imbalanced data, and employing imputation techniques to enhance model performance for predicting cirrhosis outcomes.

data-analysis data-pre imbalanced-data imputation machine-learning optuna pipeline scikit-learn xgboost

Last synced: 11 Jan 2025

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 11 Jan 2025

https://github.com/sadmansakib93/mental-resilience-analysis-using-machine-learning

Utilized supervised and unsupervised ML techniques to analyze mental health and resilience levels of medical students [Project completed on December, 2019]

artificial-intelligence classification clustering correlation linear-regression machine-learning machine-learning-algorithms mental-health python regression resilience scikit-learn statistical-analysis

Last synced: 12 Jan 2025

https://github.com/kianoushamirpour/end_to_end_text_classification

Developing feature engineering pipelines, building packages, automating tests, and creating FastAPI endpoints.

apache-airflow ci docker-compose factory-design-pattern fastapi feast grafana hyperopt mlflow prometheus pytorch scikit-learn tox transformers xgboost-classifier

Last synced: 18 Jan 2025

https://github.com/dan-niles/iris-ml

Machine learning on the Iris dataset

iris-dataset machine-learning scikit-learn

Last synced: 11 Jan 2025

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 12 Jan 2025

https://github.com/priyanshul28/ml_regression_eda_waiterstip

An EDA and Machine Learning Regression exercise on the Waiter's Tip dataset demonstrating the use of Linear Regression, Neural Network Regressors, Decision Trees, Random Forests, Linear SVR, XGBoost, etc. The models are optimized using hyperparameter tuning through GridSearchCV.

eda machine-learning regression scikit-learn seaborn

Last synced: 25 Jan 2025

https://github.com/chris-seoul/github-handson-ml

ML Research using Hands-on, Hugging Face, CrewAI, Gemini, Langchain etc

deep-learning ml neural-networks pandas scikit-learn tensorflow

Last synced: 25 Jan 2025

https://github.com/usmana5809/quran-recitation-audio-classification

Quran Recitation Audio Classification project aims to classify different recitations of the Quran using machine learning techniques. It involves preprocessing audio data, extracting features, training models, and evaluating their performance

audio-classification classification-model islamic-studies librosa machine-learning python quran scikit-learn

Last synced: 25 Jan 2025

https://github.com/sxv357/xtern-artificial-intelligence-work-based-assessment

This application takes in data regarding undergraduate college students in the state of Indiana such as their year, what major they're pursuing, which university they attend, and makes a prediction about their food order.

jupyter-notebook matplotlib pandas pickle scikit-learn seaborn

Last synced: 12 Jan 2025

https://github.com/santoshn86/dlp-ev-system-for-pa-optimization

This system is a game-changer, enabling smarter energy management through predictive insights and personalized optimization strategies.

aiml django flask keras pytorch scikit-learn tensorflow typescript

Last synced: 25 Jan 2025

https://github.com/aravindnathan02/credit-card-fraud-detection

This is a Machine Learning project on classifying fraudulent credit card transactions.

classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn

Last synced: 25 Jan 2025

https://github.com/medicharlakarthik/credit-card-fraud-detection

Credit Card Fraud Detection using machine learning to distinguish fraudulent transactions from legitimate ones. This project includes data analysis, model training, and evaluation to achieve high accuracy and recall, minimizing false negatives for better fraud detection

machine-learning python random-forest-classifier scikit-learn

Last synced: 12 Jan 2025

https://github.com/andrewsy1004/mask-detection

Mask detection system capable of identifying individuals with or without masks

kaggle keras python scikit-learn tensorflow

Last synced: 10 Jan 2025

https://github.com/zenklinov/regression_logistic_-_sentiment_analysis_movie_data

This repository contains code for performing sentiment analysis using scikit-learn and logistic regression

llm natural-language-processing nlp nltk scikit-learn sentiment-analysis

Last synced: 12 Jan 2025

https://github.com/sanchariii/order_amt_prediction

Order Amount Prediction is a machine learning project that predicts customer order amounts based on past behavior. It includes milestones for data cleaning, exploratory data analysis, feature engineering, and model building. The framework can be customized to suit specific needs and provides insights for better decision-making.

jupyter-notebook machine-learning python scikit-learn

Last synced: 12 Jan 2025

https://github.com/sanchariii/multiple-disease-prediction-system-using-streamlit

This prediction system is a web based application using Streamlit framework which can predict multiple diseases like Heart Disease , Parkinson's Disease and Diabetes.

pickle scikit-learn spyder-python-ide streamlit-webapp

Last synced: 12 Jan 2025

https://github.com/macdung123/fake-job-post-detection

This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.

classification data-analysis data-science deep-learning job-posting joblib machine-learning matplotlib-pyplot numpy pandas python scikit-learn tf-idf tkinter

Last synced: 25 Jan 2025

https://github.com/samsoumyajitd/food_ai

The AI Food Weather-Based Recommendation System provides personalized food and restaurant suggestions using AI. It uses GenAI and AI techniques like TF-IDF Vectorization, Cosine Similarity, and FuzzyWuzzy for tailored recommendations.

axios cosine-similarity flask flask-cors fuzzywuzzy generative-ai google-generativeai json nlp python-dotenv python3 reactjs scikit-learn sklearn tf-idf vanilla-css weather-api

Last synced: 25 Jan 2025

https://github.com/r-gg/ml-37

Amazon Reviews ~ Sentiment analysis evaluation: fine-tuned BERT vs LSTM. (+ Extensive Data Mining & Visualization)

bert deep-learning ipynb-jupyter-notebook lstm machine-learning python scikit-learn uni-project

Last synced: 12 Jan 2025

https://github.com/bjornmelin/ml-algorithm-playground

πŸ§ͺ Core ML algorithm implementations with GPU acceleration. Featuring optimized implementations across various libraries with comprehensive analysis. πŸ“ˆ

algorithms cuda gpu-computing lightgbm machine-learning python scikit-learn xgboost

Last synced: 25 Jan 2025

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 12 Jan 2025

https://github.com/mohd-faizy/preprocess_ml

This repository hosts Python code that utilizes the Scikit-learn preprocessing API for data preprocessing. The code presents a comprehensive range of tools that handle missing data, scale data, encode categorical variables, and perform other functions.

data-science feature-engineering feature-engineering-algorithm feature-extraction feature-selection machine-learning outlier-detection preprocessing-data preprocessor scikit-learn

Last synced: 12 Jan 2025