Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/netcodez/climate-prediction-pipeline

Predicting London's climate using machine learning techniques. This project aims to forecast mean temperature in Celsius (°C) using various regression models and logging experiments with MLflow

huggingface machine-learning mlflow mlflow-tracking mlflow-tracking-server mlops python scikit-learn streamlit

Last synced: 15 Jan 2025

https://github.com/paragon279/cancercellclassificationbyscikitlearn

Classifies cancer cell using Scikit-learn

machine-learning python scikit-learn

Last synced: 20 Jan 2025

https://github.com/dwija12903/mentorness-internship

Developed and applied technical skills in areas such as programming languages, data analysis, and machine learning methodologies.

matplotlib numpy pandas python scikit-learn

Last synced: 08 Jan 2025

https://github.com/akhileshthite/india-population

ML (simple linear regression) model for predicting India's population.

machine-learning numpy pandas python scikit-learn

Last synced: 22 Jan 2025

https://github.com/enyaude/california_house_price_prediction

Developed a California house price prediction model utilizing linear regression and Random Forest, and applied machine learning techniques such as Ridge, and Lasso for optimization in Python.

jupyter-notebook linear-regression python random-forest scikit-learn streamlit

Last synced: 19 Dec 2024

https://github.com/dipa09/riot_imgclf

Multi-class image classifier for RIOT-OS

arduino-mega-2560 emlearn esp32-cam m2cgen micromlgen riot-os scikit-learn tinyml

Last synced: 30 Jan 2025

https://github.com/nits2612/data-science-projects

Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.

data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning

Last synced: 19 Dec 2024

https://github.com/andrewsy1004/linear-regression-model-for-house-price-prediction

A linear regression model to predict house prices based on features like size, location, and number of rooms. This project demonstrates the application of machine learning in real estate price estimation

linear-regression python scikit-learn xgbregressor

Last synced: 01 Feb 2025

https://github.com/mitchmedeiros/mlcompare

Quickly compare machine learning models across libraries and datasets

huggingface-datasets kaggle openml pytorch scikit-learn xgboost

Last synced: 10 Oct 2024

https://github.com/adriantomin/bulldozer-price-prediction

Predicting the Sale Price of Bulldozers Using Machine Learning 🚜💰 This project uses machine learning to predict bulldozer sale prices based on historical data from the Kaggle Bluebook for Bulldozers competition. The goal is to minimize the RMSLE between actual and predicted prices.

data-science jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/shahaba83/airplane-ticket-cancellation

In this project, we try to predict the possibility of canceling the plane ticket by the buyer

datatime numpy pandas python scikit-learn seaborn

Last synced: 19 Dec 2024

https://github.com/s0fft/learning-lab

Code Notes & Test-Learn // Micro Pet-Projects: Python / Asynchrony / FastAPI / Django-Tastypie / Parsing / SQL / Docker / DS / ML / etc.

asynchrony data-science django docker fastapi jupyter-lab jupyter-notebook mashine-learning matplotlib notes numpy pandas parsing python3 scikit-learn seaborn sql sqlalchemy tastypie telegram-bot

Last synced: 15 Jan 2025

https://github.com/chrispsang/customerchurnanalysis

Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.

customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau

Last synced: 10 Oct 2024

https://github.com/rishavp15/aivshuman_text

In this project make user to decide that the text which is entered in text box is a human generated or a computer generated text.

django pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/s-matke/eco-forecast

Machine learning model used for predicting European country with most green surplus energy generated

data-science green-energy machine-learning scikit-learn supervised-learning

Last synced: 04 Feb 2025

https://github.com/lazarust/jupyternotebooks

Storage spot for all my Jupyter Notebooks. Check some of them out!!

jupyter-notebook jupyter-notebooks keras scikit-learn sklearn

Last synced: 08 Jan 2025

https://github.com/davgiles/ut-austin-data-science-program

This repository contains my projects from the Data Science & Business Analytics Post-Graduate Program through UT Austin.

eda matplotlib numpy pandas python scikit-learn scipy seaborn visualization xgboost

Last synced: 21 Dec 2024

https://github.com/drorata/mnist-examples

ML examples for the MNIST dataset

machine-learning ml mnist python scikit-learn torch

Last synced: 15 Jan 2025

https://github.com/ahmedheakl/diabetes_classification_svm

Classifying patients to know if they have diabetes using Supporting Vector Machine Model.

machine-learning python scikit-learn

Last synced: 13 Jan 2025

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 10 Oct 2024

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 10 Oct 2024

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 10 Oct 2024

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 10 Oct 2024

https://github.com/himanshugoyal77/shell-detection-frontend

Fraud detection of companies using Machine learning and django

django scikit-learn

Last synced: 19 Jan 2025

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 08 Jan 2025

https://github.com/sergeimakarovv/energy-data-analytics-ml

Analyzing global data on sustainable energy, predicting CO2 emissions per capita

machine-learning pandas plotly python scikit-learn streamlit

Last synced: 10 Oct 2024

https://github.com/ebadshabbir/decision_tree_algorithm

Decision Tree Classifier for Social Network Ads A Python implementation of a Decision Tree Classifier to predict user purchasing behavior based on age and estimated salary. Includes feature scaling, model evaluation (confusion matrix and accuracy), and visualizations of decision boundaries for both training and test sets.

decision-tree-classifier jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 17 Jan 2025

https://github.com/belsabbagh/employee-turnover-and-customer-churn-classification

A data science project that tests mutliple models on an employee tunronver and customer churn problem

machine-learning pandas python scikit-learn

Last synced: 09 Jan 2025

https://github.com/gsmafra/sklearn-dummies

Scikit-learn label binarizer with support for missing values

data-science machine-learning pandas python scikit-learn

Last synced: 15 Dec 2024

https://github.com/tapas-gope/telecommunication-customer-churn

This project involves predicting customer churn in a telecommunications company using machine learning techniques, exploring various features' impact, optimizing models, and identifying key factors influencing churn.

feature-engineering matplotlib-pyplot model-evaluation-and-validation numpy pandas python scikit-learn

Last synced: 19 Dec 2024

https://github.com/mpoojithavigneswari/bangalore-house-price-prediction

This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.

data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 16 Jan 2025

https://github.com/rickcontreras/modelos1

Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.

classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance

Last synced: 10 Oct 2024

https://github.com/alaazameldev/text-based-search-engine

Implementation of a search engine using TF-IDF and Word Embedding-based vectorization techniques for efficient document retrieval

chromadb fastapi gensim-word2vec nltk numpy precision-recall python scikit-learn tf-idf-vectorizer

Last synced: 10 Oct 2024

https://github.com/ismaelvr1999/air-quality-clustering

This project focuses on analyzing air quality data and categorizing it into clusters using the K-Means algorithm.

jupyter-notebook machine-learning matplotlib pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/leticiamilan/formacao-python-developer-dio

Formação Python Developer - Digital Innovation One

django flask pandas pymongo python scikit-learn sqlalchemy sqlite

Last synced: 28 Jan 2025

https://github.com/aymen016/cosmic-mystery-challenge-2912

"Explore the depths of space and unravel cosmic mysteries in the year 2912 with our Cosmic Mystery Challenge repository. Dive into data science adventures as you predict the fate of passengers aboard the Spaceship Titanic after a collision with a spacetime anomaly. Join us in reshaping history and saving lives across the universe!"

kaggle matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python scikit-learn scikitlearn-machine-learning seaborn

Last synced: 04 Feb 2025

https://github.com/msikorski93/heart-failure-prediction

The subject of this repository was to perform binary classification based on respondent's collected features (age, cholesterol level, fasting blood sugar, thallium stress test results, etc.).

classification knn-classifier logistic-regression random-forest-classifier roc-curves scikit-learn svm-classifier

Last synced: 09 Jan 2025

https://github.com/bilalm04/email-spam-classifier

A machine learning project that classifies emails as spam or not spam using Logistic Regression, with a deployable Flask API for real-time classification.

api flask jupyter-notebook machine-learning matplotlib nlp numpy pandas python scikit-learn

Last synced: 10 Oct 2024

https://github.com/aasjunior/mlapp-api

Esta API fornece endpoints para aplicar algoritmos de aprendizado de máquina, como K-Nearest Neighbors (KNN), Árvore de Decisão e Algoritmo Genético. Realizado como tarefa da disciplina de Laboratório Mobile/Computação Natural no 5º Semestre de Desenvolvimento de Software Multiplataforma.

fastapi machine-learning python scikit-learn

Last synced: 30 Dec 2024

https://github.com/aasjunior/machinelearningapp

O Machine Learning App é um aplicativo desenvolvido com Kotlin, Android Studio e Jetpack Compose, para aplicação de algoritmos de aprendizado de máquina e exibição dos resultados. Realizado como tarefa da disciplina de Laboratório Mobile/Computação Natural no 5º Semestre de Desenvolvimento de Software Multiplataforma.

fastapi jetpack-compose kotlin-android machine-learning material-design scikit-learn

Last synced: 30 Dec 2024

https://github.com/kunalpisolkar24/dsbda_lab

Collection of practical codes for Savitribai Phule Pune University's Data Science and Big Data Analytics Laboratory (310256).

data-analytics data-preprocessing data-science data-wrangling descriptive-statistics linear-regression logistic-regression mapreduce scala scikit-learn sppu-computer-engineering tf-idf

Last synced: 16 Jan 2025

https://github.com/colinwu0403/weatherpredictor

ML model that predicts future weather temperatures. Dataset taken from NOAA's Climate Data Online

pandas scikit-learn

Last synced: 22 Jan 2025

https://github.com/no-country-simulation/s16-21-n-data-bi

Analisis del COVID-19 - insights sobre la evolución de la pandemia - impacto en 5 paises sudamericanos.

eda etl machine-learning matplotlib pandas powerbi python scikit-learn seabron streamlit

Last synced: 11 Nov 2024

https://github.com/xprithvi/random-forest-regressor

This Jupyter notebook serves as a machine learning template to quickly make predictions and analyse feature importance in a dataset.

data-science feature-extraction machine-learning random-forest random-forest-regression scikit-learn

Last synced: 20 Jan 2025

https://github.com/vijaykumarr1452/startup_success_predictor

This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)

machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn

Last synced: 01 Feb 2025

https://github.com/giatraskon/machine_learning_assignments

Machine learning assignments covering regression, classification, neural networks, adversarial examples, and real-time emotion detection using Python. Includes theoretical insights and practical implementations.

adversarial-examples bayesian-inference bias-variance-tradeoff cifar10 classification deep-learning emotion-recognition iris-dataset k-nearest-neighbours keras machine-learning mnist neural-networks opencv pima-indians-diabetes python regression ridge-regression scikit-learn tensorflow

Last synced: 01 Feb 2025

https://github.com/galaxy092/samsung-innovation-campus-big-data-capstone-project

Samsung Innovation Campus Big Data Capstone Project - Weather Prediction

hadoop jupyter-notebook pandas pyspark scikit-learn sparksql

Last synced: 01 Feb 2025

https://github.com/dukebw/ml-model-selection

Machine learning model selection using Dlib and scikit-learn.

dlib machine-learning ranking scikit-learn

Last synced: 01 Feb 2025

https://github.com/gregoritsch3/dl_cv_e2e_potatodiseaseclassification

A guided CodeBasics Deep Learning Project where a Convolutional Model is deployed onto a Website (FastAPI) and Mobile App (React Native, Google Cloud). It's purpose is the classification of potato plant images into "healthy", "Early Blight" and "Late Blight" categories.

cnn-classification gcp model-deployment scikit-learn tensorflow

Last synced: 01 Feb 2025

https://github.com/jaswanthv99/basic_ml-model_understanding

This project explains basic ML-Models(KNN, Naive bayes, Logistic Regression, SVM, A neural N/W)

matplotlib-python pandas-python scikit-learn tensorflow

Last synced: 01 Feb 2025

https://github.com/asier-ortiz/python-for-data-science-and-machine-learning-bootcamp

Python for Data Science and Machine Learning Bootcamp: NumPy, Pandas, Seaborn, Matplotlib, Plotly, Scikit-Learn, TensorFlow, and more

matplotlib numpy pandas plotty python scikit-learn seaborn tensorflow

Last synced: 01 Feb 2025

https://github.com/amandeep-gupta19/chatbot

Created a custom chatbot using Langchain. Here's a summary of what I did: Data Extraction: I gathered data about technical courses from the Brainlox website using Langchain’s URL loaders. Embedding Creation & Storage: I converted this data into embeddings and stored it in a vector store for efficient searching. API Development: I built a Flask

data-extraction faiss-vector-database flask-restful langchain numpy scikit-learn vector-database webbaseloader

Last synced: 01 Feb 2025

https://github.com/oadultradeepfield/galaxy10-anomaly-detection

A public API and experimental PyTorch pipeline for anomaly detection in the Galaxy10 DECals dataset using ResNet50, autoencoders, and clustering techniques

flask google-cloud-run kaggle pytorch scikit-learn

Last synced: 01 Feb 2025

https://github.com/gilevatanya/yandex-practicum-projects

Кейсы решенные на курсах Яндекс Практикума.

bert bootstrap catboost keras lightgbm matplotlib nltk numpy pandas postgresql python pytorch scikit-learn scipy seaborn sql

Last synced: 01 Feb 2025

https://github.com/swat1563/recommendation-system

This repository features a recommendation system and analytics engine using datasets on users, organizations, contents, contacts, events, and recommendations. It includes data preprocessing, building a recommendation system, and creating visual reports with Power BI.

analytics data-analysis data-visualization engine kaggle numpy pandas powerbi powerbi-dashboards powerbi-desktop powerbi-reports python recommendation-engine recommendation-system recommender-systems scikit-learn scipy

Last synced: 01 Feb 2025

https://github.com/eljandoubi/genre_classification

Create an ML pipeline for Genre Classification using MLflow.

hydra machine-learning mlflow numpy pandas pandas-profiling pytest scikit-learn scipy wandb

Last synced: 24 Jan 2025

https://github.com/rakibhhridoy/visualmachinelearning-yellowbrick

Yellowbrick wraps the scikit-learn and matplotlib to create publication-ready figures and interactive data explorations. It is a diagnostic visualization platform for machine learning that allows us to steer the model selection process by helping to evaluate the performance, stability, and predictive value of our models and further assist in diagnosing the problems in our workflow.

classification hyperparameter-tuning machine-learning model-evaluation model-view-presenter model-visualization python random-forest random-forest-classifier scikit-learn visualization xgboost xgboost-algorithm yellowbrick

Last synced: 25 Dec 2024

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 25 Dec 2024

https://github.com/rakibhhridoy/breastcanceranalysis-classificationclustering

Breast cancer prediction both in classification and clustering method for better understanding the data. Though clustering is different from classification,to finding the key aspect the data have,sometimes we need every possible way to catch behavior of the data.

breast-cancer-prediction breastcancer-classification classification classification-algorithm clustering eda hyperparameter-optimization machine-learning python scikit-learn supervised-learning unsupervised-learning

Last synced: 25 Dec 2024

https://github.com/snghrsw/kikagaku-ml-learning

Pythonで単回帰分析と重回帰分析、ディープラーニングで回帰と分類

liner-regestion multiple-regression numpy pandas python scikit-learn

Last synced: 23 Jan 2025

https://github.com/thekartikeyamishra/predictive-sales-analytics

The Predictive Sales Analytics tool aims to help MSMEs forecast future sales using historical data. This advanced version leverages Machine Learning for accurate predictions and provides a dashboard to visualize sales trends, seasonality, and predictions.

joblib machine-learning matplotlib pandas python scikit-learn streamlit

Last synced: 26 Dec 2024

https://github.com/thekartikeyamishra/aipoweredmarketingassistant

AI-Powered Marketing Assistant, an advanced tool designed to enhance your digital marketing campaigns using the power of machine learning (ML) and large language models (LLMs). This project empowers small businesses and MSMEs to create compelling content, analyze campaigns, and strategize effectively.

artificial-intelligence llm matplotlib numpy openai pandas python scikit-learn streamlit

Last synced: 26 Dec 2024

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 26 Dec 2024

https://github.com/josugoar/digit-recognizer

Digit recognizer full stack web app and classifier

flask jquery opencv scikit-learn

Last synced: 01 Jan 2025

https://github.com/danielstankw/car_classification_and_regression

Regression & Classification model using a car data. Involves EDA, feature engineering, pipelines and more.

classification learning machine regression scikit-learn tensorflow trim

Last synced: 02 Jan 2025

https://github.com/lopez86/datascienceexamples

Examples of various data science & data analysis topics using various sources of data.

data-analysis data-science pandas scikit-learn tutorial visualization

Last synced: 26 Dec 2024

https://github.com/jalijuhola/amazon-textual-reviews-recommender-

predicting score and recommending using amazon textual reviews

numpy pandas python scikit-learn typescript

Last synced: 26 Dec 2024

https://github.com/mateusoliveira30/house-prices

This project was developed for the Kaggle competition "House Prices - Advanced Regression Techniques." The goal is to predict house sale prices using advanced regression techniques, including feature engineering, Random Forests, and Gradient Boosting.

kaggle-competition machine-learning scikit-learn

Last synced: 02 Jan 2025

https://github.com/yungshenglu/pokemonrecognition

Recognize Pokemon's image using scikit-learn in Python

knn-model python scikit-learn svm-model

Last synced: 26 Dec 2024

https://github.com/brianlesko/maze-runner

Developed a Python-based maze-crawling application using a PS5 controller interface. This project highlights skills in software-hardware integration and low-code UI design, demonstrating expertise ideal for advanced software engineering.

communication dualsense engineer engineering hacking hardware hardware-hacking interface low-code-ui mechanical-engineer mechanical-engineering protocol ps5 python robotics-engineer scikit-learn software sony streamlit ui

Last synced: 26 Dec 2024

https://github.com/vhnegrisoli/machine-learning-linguagens-programacao

Projeto de Data Science e Machine Learning de análise de linguagens de programação de 2004 a 2021

data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 10 Jan 2025