Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/lakshitalearning/spamfortress

A machine learning-based project to detect SMS spam messages with high accuracy, using the SMS Spam Collection Dataset and techniques like supervised learning, text preprocessing, and modelย comparison.

data-science google-colab machine-learning nlp scikit-learn sms-spam-detection

Last synced: 16 Jan 2025

https://github.com/szymonrucinski/pippi-lang

Elegant ๐Ÿ“‘ text preprocessing pipeline ๐Ÿšฐ available as pip package ๐Ÿ based on scikit-learn pipeline. Combines Transformer and Column Transformer into a single object.

data-cleaning data-science nlp pipeline scikit-learn

Last synced: 02 Feb 2025

https://github.com/id-andyyy/alfahack

๐Ÿ“ˆ๐Ÿ’ต Investment propensity prediction model

catboost hackathon-project jupyter lightgbm numpy optuna pandas python scikit-learn

Last synced: 06 Feb 2025

https://github.com/m-rishab/job-recruitment-prediction-and-hr-dashboard-using-plotly

This project features make it ideal for dynamic HR dashboards, offering insights into candidate profiles and recruitment processes.

correlation-analysis flask kmeans-clustering numpy pandas plotly python scikit-learn seaborn standardscaler

Last synced: 22 Jan 2025

https://github.com/kr1shnasomani/covidxraynet

Detection of COVID-19 from chest X-ray images using CNN (Xception architecture)

computer-vision deep-learning keras matplotlib neural-network numpy opencv pandas scikit-learn seaborn tensorflow

Last synced: 19 Dec 2024

https://github.com/kr1shnasomani/sentimentscope

Sentiment analysis on movie review using TensorFlow and GloVe embeddings

deep-learning keras matplotlib natural-language-processing neural-networks numpy pandas scikit-learn tensorflow

Last synced: 19 Dec 2024

https://github.com/hariprasath-v/av-dataverse-hack---insurance-claim-prediction

Create a machine learning model to predict if the policyholder will file a claim in the next 6 months or not based on the set of car and policy features.

analyticsvidhya classification exploratory-data-analysis f1-score matplotlib numpy pandas python randomforest-classification scikit-learn seaborn shap

Last synced: 13 Jan 2025

https://github.com/messierandromeda/sentiment-analysis

Sentiment analysis with the IMDB movie review dataset.

imdb-dataset python scikit-learn sentiment-analysis

Last synced: 09 Feb 2025

https://github.com/samuele-lolli/data-analytics-techniques

A practical approach to data analytics pipeline.

numpy pandas pytorch scikit-learn

Last synced: 22 Jan 2025

https://github.com/vishnu-vamshii/fraud-detection-using-machine-learning

Developed a machine learning pipeline to detect fraudulent credit card transactions, handling imbalanced data with SMOTE and scaling. Trained models like Logistic Regression and Random Forest. Conducted EDA to identify fraud patterns.

pandas python scikit-learn tensorflow

Last synced: 23 Jan 2025

https://github.com/akimuddinshaikh/domain-application-of-predictive-analysis

Data-Driven House Price Prediction "Predicting house prices using Machine Learning techniques

feature-engineering pca python random-forest scikit-learn

Last synced: 10 Feb 2025

https://github.com/otuemre/obesity-classification

Machine learning project to classify obesity levels based on health metrics like age, sex, height, weight, and BMI.

classification data-science healthcare machine-learning obesity-classification scikit-learn

Last synced: 23 Jan 2025

https://github.com/pjj11005/ml_with_pytorch_study

[๋จธ์‹  ๋Ÿฌ๋‹ ๊ต๊ณผ์„œ: ํŒŒ์ดํ† ์น˜ ํŽธ] -> ํ•™์Šตํ•œ ์ฝ”๋“œ ์ €์žฅ์†Œ

deep-learning graph-neural-networks machine-learning neural-networks pytorch scikit-learn transformer

Last synced: 22 Jan 2025

https://github.com/karanyeole/bank-loan-default-risk-analysis-

This project aims to analyze the risk of default on bank loans using machine learning techniques. The dataset used for analysis contains information about loan applicants, including their demographics, financial history, and loan details.

feature-engineering matplotlib numpy pandas python scikit-learn seaborn

Last synced: 28 Jan 2025

https://github.com/akimuddinshaikh/machine-learning-project

A comparative study of regression models (Decision Tree, Random Forest, Ridge, Lasso, SVM) for predicting real estate prices in King County, NYC, and California using PCA & Pipeline techniques.

machine-learning pca-analysis python regression-models scikit-learn statsmodels

Last synced: 10 Feb 2025

https://github.com/siddhantborse/atmosviz

Atmos Viz is a Python-based project designed to analyze, visualize, and predict global temperature trends across various cities and countries using time-series analysis and advanced data science techniques. Leveraging historical climate data, this project integrates machine learning models, geospatial mapping, and interactive visualizations to unco

geopandas geospatial-analysis gis matplotlib numpy pandas plotly python scikit-learn seaborn shapefiles time timeseries-analysis timeseries-data

Last synced: 19 Dec 2024

https://github.com/karanyeole/dragon-real-estate-price-predictor

The project predicts the real estate prices in the mythical land of Dragons. It uses a dataset of historical real estate prices along with features such as location, size, and amenities to train a model for predicting prices of new properties.

matplotlib numpy pandas python scikit-learn

Last synced: 28 Jan 2025

https://github.com/sudothearkknight/movie-recommendation-system

The primary goal of this project is to provide personalized movie recommendations to users based on their preferences and the characteristics of the movies. This is achieved through a multi-step process involving data preprocessing, text vectorization, and recommendation generation.

anaconda-environment data-science jupyter-notebook machine-learning movie-recommendation movies pandas python3 recommendation-system recommender-system scikit-learn scikitlearn-machine-learning

Last synced: 11 Nov 2024

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 23 Jan 2025

https://github.com/hmasdev/ssbgm

Score Based Generative Model with scikit-learn

generative-model scikit-learn

Last synced: 23 Jan 2025

https://github.com/no-country-simulation/s16-21-n-data-bi

Analisis del COVID-19 - insights sobre la evoluciรณn de la pandemia - impacto en 5 paises sudamericanos.

eda etl machine-learning matplotlib pandas powerbi python scikit-learn seabron streamlit

Last synced: 11 Nov 2024

https://github.com/abhivur/connections-ai

Contributors: Meet Gamdha, Gaurav Nimmagadda

bert python scikit-learn word2vec

Last synced: 10 Feb 2025

https://github.com/kr1shnasomani/orthovision

Bone fracture detection from X-ray image using CNN (EfficientNetB3 architecture)

computer-vision deep-learning keras matplotlib neural-network numpy opencv scikit-learn seaborn tensorflow

Last synced: 19 Dec 2024

https://github.com/raju-2003/indiaai-cyberguard-ai-hackathon

An NLP-powered system to simplify cybercrime reporting by analyzing descriptions, categorizing incidents, and providing actionable insights.

matplotlib nltk numpy pandas python random-forest-classifier re scikit-learn seaborn shap spacy wordcloud

Last synced: 23 Jan 2025

https://github.com/purcellcjp/credit-risk-classification

This project utilized Python and scikit-learn libraries to train and evalute a Machinge Learning model based on loan risk.

machine-learning numpy pandas-dataframe python scikit-learn

Last synced: 23 Jan 2025

https://github.com/bastianlq/gym-prediccion-churn-y-agrupacion-clustering

Predicciรณn de churn, agrupaciรณn de clientes mediante clustering y recomendaciones de marketing para gym

aprendizaje-automatico clustering machine-learning scikit-learn

Last synced: 23 Jan 2025

https://github.com/colinwu0403/weatherpredictor

ML model that predicts future weather temperatures. Dataset taken from NOAA's Climate Data Online

pandas scikit-learn

Last synced: 22 Jan 2025

https://github.com/kunalpisolkar24/dsbda_lab

Collection of practical codes for Savitribai Phule Pune University's Data Science and Big Data Analytics Laboratory (310256).

data-analytics data-preprocessing data-science data-wrangling descriptive-statistics linear-regression logistic-regression mapreduce scala scikit-learn sppu-computer-engineering tf-idf

Last synced: 16 Jan 2025

https://github.com/aasjunior/machinelearningapp

O Machine Learning App รฉ um aplicativo desenvolvido com Kotlin, Android Studio e Jetpack Compose, para aplicaรงรฃo de algoritmos de aprendizado de mรกquina e exibiรงรฃo dos resultados. Realizado como tarefa da disciplina de Laboratรณrio Mobile/Computaรงรฃo Natural no 5ยบ Semestre de Desenvolvimento de Software Multiplataforma.

fastapi jetpack-compose kotlin-android machine-learning material-design scikit-learn

Last synced: 30 Dec 2024

https://github.com/miyajianimation/spam-filter

Spam-Filter is a powerful tool used to automatically detect and remove unwanted or unsolicited electronic messages that often flood email inboxes. It helps users to efficiently manage their emails by filtering out irrelevant or potentially harmful content, allowing them to focus on important messages.

anti-spam antispam blocklist cold-calls docker fritz-box fritzbox lua rspamd scikit-learn spam-classification spamd support-vector-machines zabbix

Last synced: 10 Feb 2025

https://github.com/dmarks84/coursework_project_ml-classification

Project for IBM Data Science course on Machine Learning -- Trained ML models for classification, evaluating based on a variety of metrics

classification communication data-modeling dataframes numpy pandas python scikit-learn supervised-ml

Last synced: 23 Dec 2024

https://github.com/aasjunior/mlapp-api

Esta API fornece endpoints para aplicar algoritmos de aprendizado de mรกquina, como K-Nearest Neighbors (KNN), รrvore de Decisรฃo e Algoritmo Genรฉtico. Realizado como tarefa da disciplina de Laboratรณrio Mobile/Computaรงรฃo Natural no 5ยบ Semestre de Desenvolvimento de Software Multiplataforma.

fastapi machine-learning python scikit-learn

Last synced: 30 Dec 2024

https://github.com/asuquoaa/predicting_viewer_engagement_with_educational_videos

This project uses machine learning to predict video engagement based on features such as transcript complexity, speaker speed, and silence periods. By understanding the factors influencing engagement, we can improve content recommendations and educational experiences.

data-visualization exploratory-data-analysis machine-learning scikit-learn

Last synced: 10 Feb 2025

https://github.com/msikorski93/heart-failure-prediction

The subject of this repository was to perform binary classification based on respondent's collected features (age, cholesterol level, fasting blood sugar, thallium stress test results, etc.).

classification knn-classifier logistic-regression random-forest-classifier roc-curves scikit-learn svm-classifier

Last synced: 09 Jan 2025

https://github.com/gt7o3/loan-prediction

Predict loan approval status using machine learning techniques. This project demonstrates data preprocessing, feature engineering, model training, and evaluation, along with an interactive Streamlit app for real-time predictions. Ideal for financial decision-making.

accuracy-analysis juypter lending-club loan-application loan-data loan-default-prediction loan-prediction logistic-regression machine-learning pca predictive-analytics python scikit-learn visualization

Last synced: 10 Feb 2025

https://github.com/martinkersner/kmeans-meetup

Presentation about k-Means for Seoul AI Meetup on July 22, 2017.

kmeans numpy python scikit-learn

Last synced: 01 Jan 2025

https://github.com/kr1shnasomani/firefinder

Fire detection from images using CNN (ResNet50 architecture)

computer-vision deep-learning keras matplotlib neural-network numpy opencv scikit-learn seaborn tensorflow

Last synced: 19 Dec 2024

https://github.com/hariprasath-v/hackerearth-amazon-business-research-analyst-hiring-challenge

Build a machine learning model that can calculate the time the delivery person takes to deliver the order.

exploratory-data-analysis hackerearth machine-learning pandas pycaret python scikit-learn seaborn

Last synced: 13 Jan 2025

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 04 Feb 2025

https://github.com/leticiamilan/formacao-python-developer-dio

Formaรงรฃo Python Developer - Digital Innovation One

django flask pandas pymongo python scikit-learn sqlalchemy sqlite

Last synced: 28 Jan 2025

https://github.com/alessandromonolo/fraud-detection-binary-classification-model

This project builds a machine learning model to classify fraudulent clients using a banking dataset. Data preprocessing, statistical analysis, and feature selection were performed before training KNN and Random Forest Classifier. Model performance was evaluated using accuracy, precision, recall, and F1-score.

classification-model fraud-detection knn-classification machine-learning pandas python random-forest scikit-learn statistical-analysis

Last synced: 10 Feb 2025

https://github.com/kr1shnasomani/tonesense

Speech emotion recognition from audio clips using CNN

deep-learning keras librosa matplotlib neural-network pandas scikit-learn seaborn tensorflow

Last synced: 19 Dec 2024

https://github.com/myahninsi/credit_card_fraud_detection

This repository is for the Neural Networks and Deep Learning Course - Assignment 1, focusing on credit card fraud detection. The project utilizes a machine learning model to predict whether a transaction is fraudulent using a synthetic credit card dataset.

matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 10 Feb 2025

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 16 Jan 2025

https://github.com/sachinh123/cognitive-customer-insights-with-watson-ai

This project analyzes customer data to provide insights for personalized services, behavior prediction, and improved support.

flask ibm-cloud ibm-watson-assistant ibm-watson-nlu nltk python scikit-learn

Last synced: 10 Feb 2025

https://github.com/samudraneel05/stanford-open-policing

The Stanford Open Policing Project (SOPP) aims to bring transparency to police interactions by collecting and analyzing data on traffic stops across the United States. It accumulates a vast dataset on traffic stops, encompassing details such as demographics, location, and outcomes.

clustering heirarchical-clustering k-means-clustering machine-learning matplotlib pandas python scikit-learn

Last synced: 26 Jan 2025

https://github.com/belsabbagh/employee-turnover-and-customer-churn-classification

A data science project that tests mutliple models on an employee tunronver and customer churn problem

machine-learning pandas python scikit-learn

Last synced: 09 Jan 2025

https://github.com/ebadshabbir/decision_tree_algorithm

Decision Tree Classifier for Social Network Ads A Python implementation of a Decision Tree Classifier to predict user purchasing behavior based on age and estimated salary. Includes feature scaling, model evaluation (confusion matrix and accuracy), and visualizations of decision boundaries for both training and test sets.

decision-tree-classifier jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 17 Jan 2025

https://github.com/samp1012/email_sms_spam_detector

An Email/SMS spam classifier that aims to identify and distinguish between spam and non-spam messages.

multinomial-naive-bayes naive-bayes-classifier natural-language-processing numpy pandas python scikit-learn spam-detection text-vectorization tokenization

Last synced: 19 Dec 2024

https://github.com/adi3042/thyroid-disease-detection

๐Ÿ”๐ŸŒŸ Discover Thyroid Disease Detection! Dive into our advanced system designed to identify and predict thyroid disorders using cutting-edge machine learning techniques. Leverage our comprehensive models and data analysis tools to make informed decisions about thyroid health. ๐Ÿฉบ๐Ÿ”ฌ๐Ÿš€ ThyroidHealthTech

classification css detection-model functools html ipykernel javascript jupyter-notebook machine-learning matplotlib numpy pandas python3 scikit-learn setuptools thyroid-dataset thyroid-disease thyroid-disease-detection venv

Last synced: 22 Jan 2025

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn

Last synced: 09 Feb 2025

https://github.com/icepanorama/internship-visualizations-and-demonstrations

A collection of some of the programs that I've written over the course of my internship.

artificial-intelligence machine-learning matplotlib numpy pandas python3 pytorch scikit-learn

Last synced: 01 Feb 2025

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 08 Jan 2025

https://github.com/thekartikeyamishra/ai-news-aggregator

This project will create an AI-powered News Aggregator that collects news from selected sources, categorizes it using NLP-based techniques, and displays the results in a user-friendly Tkinter-based GUI.

ai machine-learning nltk python python3 requests scikit-learn

Last synced: 19 Dec 2024

https://github.com/himanshugoyal77/shell-detection-frontend

Fraud detection of companies using Machine learning and django

django scikit-learn

Last synced: 19 Jan 2025

https://github.com/richardbmk/datascience_machinelearning

projects related with data science and machine learning projects.

data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 23 Jan 2025

https://github.com/pradipnp/decisiontree-iris

Machine learning project to classify iris flowers using a decision tree

classification decision-tree iris-dataset machine-learning python scikit-learn

Last synced: 10 Feb 2025

https://github.com/ahmedheakl/diabetes_classification_svm

Classifying patients to know if they have diabetes using Supporting Vector Machine Model.

machine-learning python scikit-learn

Last synced: 13 Jan 2025

https://github.com/sohv/bangalore-house-price

A tool to find house prices in Bangalore

jupyter-notebook scikit-learn streamlit

Last synced: 19 Dec 2024

https://github.com/drorata/mnist-examples

ML examples for the MNIST dataset

machine-learning ml mnist python scikit-learn torch

Last synced: 15 Jan 2025

https://github.com/lazarust/jupyternotebooks

Storage spot for all my Jupyter Notebooks. Check some of them out!!

jupyter-notebook jupyter-notebooks keras scikit-learn sklearn

Last synced: 08 Jan 2025

https://github.com/s-matke/eco-forecast

Machine learning model used for predicting European country with most green surplus energy generated

data-science green-energy machine-learning scikit-learn supervised-learning

Last synced: 04 Feb 2025

https://github.com/peterchain/titanic

Script for the Titanic dataset for evaluating which passengers survived

kaggle machine-learning pandas-dataframe python3 scikit-learn

Last synced: 02 Feb 2025

https://github.com/andrewsy1004/logistic-regression-spam-classifier

This project implements a spam email classifier using Logistic Regression.

numpy pandas scikit-learn

Last synced: 19 Dec 2024

https://github.com/nikitalpopov/evotor_champ

solution for evotor data challenge

data-analysis data-science python scikit-learn

Last synced: 25 Jan 2025

https://github.com/ahmed122000/ml_model_deployment

The HR Analytics: Job Change Predictor is a Flask-based web application that uses machine learning to predict whether an employee will stay with a company or leave. It allows users to train models, evaluate their performance, and make predictions based on employee data, providing valuable insights for HR decision-making.

classification flask machine-learning python3 rest-api scikit-learn

Last synced: 02 Feb 2025

https://github.com/s0fft/learning-lab

Code Notes & Test-Learn // Micro Pet-Projects: Python / Asynchrony / FastAPI / Django-Tastypie / Parsing / SQL / Docker / DS / ML / etc.

asynchrony data-science django docker fastapi jupyter-lab jupyter-notebook mashine-learning matplotlib notes numpy pandas parsing python3 scikit-learn seaborn sql sqlalchemy tastypie telegram-bot

Last synced: 15 Jan 2025

https://github.com/asut00/machine-learning-piscine_42ai

Comprehensive Machine Learning Bootcamp by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.

linear-regression machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Dec 2024

https://github.com/kr1shnasomani/currencyshield

Fake and real currency detection using ResNet50 and image classification techniques

computer-vision deep-learning matplotlib neural-network numpy scikit-learn tensorflow

Last synced: 19 Dec 2024

https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction

A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation

linear-regression python scikit-learn xgbregressor

Last synced: 01 Feb 2025

https://github.com/vikneshsrv24/customer-segmentation

Segregation of customers based on purchasing pattern for targeted marketing.

jupyter-notebook matplotlib pandas python scikit-learn

Last synced: 19 Dec 2024

https://github.com/ehsan-behzadi/predicting-diabetes-a-machine-learning-approach-using-the-pima-indians-diabetes-dataset

This repository contains a machine learning model developed using the Pima Indians Diabetes dataset. The goal of this project is to predict the likelihood of diabetes in patients based on various medical attributes.

classification data-preprocessing diabetes-prediction feature-engineering imbalanced-data imputation-methods machine-learning navies-bayes-classifer outlier-detection pima-indians-diabetes python scikit-learn

Last synced: 19 Dec 2024

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 22 Jan 2025

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Jan 2025

https://github.com/netcodez/climate-prediction-pipeline

Predicting London's climate using machine learning techniques. This project aims to forecast mean temperature in Celsius (ยฐC) using various regression models and logging experiments with MLflow

huggingface machine-learning mlflow mlflow-tracking mlflow-tracking-server mlops python scikit-learn streamlit

Last synced: 15 Jan 2025

https://github.com/aliy98/navigation-sensor-data-classification

Classification of a Navigation Robot Sensor Dataset Using SVM, Random Forest and Neural Network

artificial-neural-networks keras multiclass-classification random-forest scikit-learn scitos-g5 support-vector-machines

Last synced: 02 Feb 2025