Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/junya737/weighted-pls-regression

A Python implementation of Weighted Partial Least Squares Regression with support for sample weights.

machine-learning partial-least-squares-regression scikit-learn

Last synced: 28 Dec 2024

https://github.com/virajbhutada/article-clustered-recommendation-system-ml

This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.

anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy

Last synced: 15 Oct 2024

https://github.com/andriamahay11master/machinelearning

This project is created to practice supervised ML using the libraries existing in Python

linear-regression multiple-linear-regression python scikit-learn scikitlearn-machine-learning

Last synced: 22 Jan 2025

https://github.com/atharvapathak/size_estimator_project

This project in Python aims to provide a tool for estimating the size of objects in images or videos. Using computer vision techniques, the project analyzes the input media, detects objects of interest, and provides an estimation of their size based on known reference points or objects.

cicd cnn opencv python pytorch rnn scikit-learn sql tensorflow

Last synced: 15 Jan 2025

https://github.com/jayadavv/dynamic-ml-model-selector

An interactive web application that allows users to upload their datasets and dynamically select, train, and evaluate various machine learning models. The app provides comprehensive performance metrics and visualizations, making it easy for users to analyze their data effectively.

decision-trees linear-regression logistic-regression matplotlib-pyplot plotly python random-forest scikit-learn streamlit

Last synced: 10 Feb 2025

https://github.com/lexxai/goit_python_ds_hw_03

Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.

latex linear-regression matplotlib numpy pandas python scikit-learn

Last synced: 24 Jan 2025

https://github.com/lexxai/goit_python_ds_hw_05

Метод опорних векторів -Support Vector Machine, SVM. Дерева рішень - RandomForestClassifier, XGBClassifier

classification numpy pandas python random-forest-classifier reporting scikit-learn svm-classifier svm-model xgbclassifier

Last synced: 24 Jan 2025

https://github.com/mecha-aima/fake-bills-detection

This Python project implements a simple classification model comparison using scikit-learn to classify banknotes as either "Authentic" or "Counterfeit" based on four features

classification-model machine-learning model-selection scikit-learn

Last synced: 22 Jan 2025

https://github.com/achronus/data-exploration

A repository dedicated to interesting data exploration projects I've completed

data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 22 Jan 2025

https://github.com/gayathri2200/car-price-prediction---machine-learning

Car price prediction Machine Learning --- Which is used to predict the price of used cars based on the features.

data-science machine-learning modeldeployment pandas price-prediction python regression scikit-learn streamlit visual-studio visualization

Last synced: 10 Feb 2025

https://github.com/jmromer/warbler

A recommendation engine powered by scikit-learn and served by Flask.

flask flask-api numpy pandas python3 scikit-learn

Last synced: 13 Jan 2025

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 22 Jan 2025

https://github.com/filsan95/project-iot_malware_identification

This repository contains the code and data for a project that detects malware from IoT devices using a publish-subscribe model with Confluent and Databricks. The project streams IoT device data to Kafka, analyzes it, and detects malware using machine learning models such as Random Forest and Gradient Boosted Trees.

apache-kafka classification confluent databricks machine-learning-algorithms scikit-learn sql

Last synced: 22 Jan 2025

https://github.com/gamowy/systemysi

Zadania z systemów sztucznej inteligencji

artificial-intelligence python scikit-learn university-coursework

Last synced: 25 Jan 2025

https://github.com/lijesh010/ml_project_data_preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-cleaning data-exploration data-preprocessing machine-learning numpy pandas-python python scikit-learn

Last synced: 03 Feb 2025

https://github.com/venky-1710/superhero-recruitment

Superhero Recruitment System predicts hero selection using machine learning. Users input hero attributes through a web interface. A Random Forest model analyzes abilities, strengths, weaknesses, success rates, and missions completed. The Flask app displays results, showing if a hero is selected.

css flask html numpy pandas python scikit-learn

Last synced: 20 Jan 2025

https://github.com/kosmoswerner/scikitlearn.net

C# bindings for Scikit-Learn (WIP 🏗)

machine-learning scikit scikit-learn sklearn

Last synced: 15 Nov 2024

https://github.com/lakshitalearning/codsoft

Machine Learning Projects - CODSOFT Internship: This repository showcases my machine learning projects completed during my internship at Codsoft. It demonstrates my skills in developing innovative solutions using various ML techniques and tools.

churn-prediction codsoft codsoftinternship deep-learning handwritten-text-recognition internship-project keras machine-learning python rnn-tensorflow scikit-learn spam-detection

Last synced: 13 Feb 2025

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 15 Jan 2025

https://github.com/mpolinowski/fisher-discriminant-analysis

LDA is a widely used dimensionality reduction technique built on Fisher’s linear discriminant.

linear-discriminant-analysis matplotlib-pyplot python scikit-learn

Last synced: 28 Jan 2025

https://github.com/mpolinowski/local-linear-embedding

Improve Data Quality by discarding non-correlating, noisy Dimensions

locally-linear-embedding pyplot python scikit-learn

Last synced: 28 Jan 2025

https://github.com/lau1944/coronavirus-world-prediction

Coronavirus Case Confirmed Trend Around The World

coronavirus pandas python scikit-learn

Last synced: 31 Dec 2024

https://github.com/pradeep31747/smartsuggest-personalized_product_recommendations

This project implements a personalized product recommendation system using machine learning techniques to enhance user experience and drive engagement.

jupyter-notebook keras numpy pandas pyhton scikit-learn sql tensorflow vscode

Last synced: 13 Feb 2025

https://github.com/mpolinowski/multi-dimensional-scaling

Multidimensional Scaling is a family of statistical methods that focus on creating mappings of items based on distance.

matplotlib-pyplot multi-dimensional-scaling python scikit-learn

Last synced: 28 Jan 2025

https://github.com/mpolinowski/scikit-wine-quality

Predicting Wine Quality with Several Classification Techniques using SciKit Learn.

feature-classifiers python scikit-learn

Last synced: 28 Jan 2025

https://github.com/arssite/dirty-cleanflooringimageprocessingusingyolov5

Uses YOLOv5 to classify floor cleanliness into five categories based on visual cues. It includes an annotated dataset, trained model,& evaluation outputs. Code covers data preprocessing, training, & testing. A comparative analysis highlights YOLOv5's advantages over traditional methods, providing an efficient solution automated floor cleanliness.

deep-neural-networks github google-colab jupyter-notebook labelimg matplotlib-pyplot numpy-library opencv-python pandas-python pytorch scikit-learn tensorflow yolov5

Last synced: 01 Feb 2025

https://github.com/mpolinowski/tstochastic-neighbor-embedding

Improve Data Quality by discarding non-correlating, noisy Dimensions

matplotlib-pyplot python scikit-learn t-sne

Last synced: 28 Jan 2025

https://github.com/bruceunx/ai-simulator

aiplayground 人工智能学习乐园

ai maching-learning scikit-learn

Last synced: 15 Jan 2025

https://github.com/namratha2301/bangalorehousepricepredictor

Predicting house price in Bangalore based on the key features of the house like number of rooms, size in square feet etc.

azure bashscript docker flake8 flask github-actions scikit-learn

Last synced: 28 Jan 2025

https://github.com/hallowshaw/text-emotion-classification-using-lstm-and-tokenization

This repository provides a machine learning and deep learning pipeline for text emotion detection. It includes a pretrained LSTM model, tokenizer, and preprocessing steps to classify emotions such as joy, sadness, and anger from text input. Easily deployable with provided resources and scripts.

emotion-classification emotion-detection feature-engineering lstm nltk nltk-python scikit-learn scikitlearn-machine-learning sentiment-analysis sequential-models text-classification text-classification-multi-label tokenization tokenizer

Last synced: 29 Dec 2024

https://github.com/smaddanki/data-science

Code blocks, algorithms, and research snippets in Data Science, Machine Learning, AI & Quant Finance.

deep-learning machine-learning pytorch scikit-learn spark

Last synced: 08 Feb 2025

https://github.com/callesjuan/ninjalprm

Protótipo de ferramenta de agrupamento de dispositivos Android por geolocalização (Server)

python scikit-learn xmpp

Last synced: 24 Oct 2024

https://github.com/aryan-coder-student/smart-agricultural-monitering-aryanpahari037-

Increase agricultural productivity and sustainability by offering a smart farming assistant that helps farmers make data-driven decisions to optimize resource use and crop management.

hackathon-project machine-learning numpy pandas python scikit-learn

Last synced: 05 Feb 2025

https://github.com/viniciusds2020/ml_pycaret_classificacao

Sistema de preprocessamento e treinamento de modelos de machine learning utilizando PyCaret. Uma metodologia low-code para processos de MLops

machine-learning mlops preprocessing pycaret python scikit-learn

Last synced: 05 Feb 2025

https://github.com/sarowarahmed/predicting-kolkata-house-price

🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.

app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit

Last synced: 13 Feb 2025

https://github.com/svetlanam/pycon-workshop

Pycon CZ workshop: Better data analyses and product recommendations with Instagram data

data-analysis data-science martinus matplotlib pandas pycon2016 pyconcz python scikit-learn workshop

Last synced: 13 Jan 2025

https://github.com/idaraabasiudoh/drug_prescribtion_decision_tree_model

This repository contains a machine learning project focused on classifying drugs based on patient characteristics using a Decision Tree classifier. The project uses Python and popular data science libraries such as scikit-learn, pandas, and matplotlib.

data-analysis jupyter-notebook machine-learning python3 scikit-learn

Last synced: 14 Jan 2025

https://github.com/sundanc/movierecommendation

Simple movie recommendation system built with Streamlit

movie-recommendation-app python scikit-learn scikitlearn-machine-learning streamlib

Last synced: 06 Feb 2025

https://github.com/thbaylson/datascience

All of my past data science assignments put into one singular notebook. The notebook begins with Intro to Data Visualization and ends with Machine Learning.

data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn

Last synced: 28 Jan 2025

https://github.com/jeus0522/7-explore-different-classifier-ml-app

A project exploring various classification algorithms, showcasing their implementation, comparison, and evaluation using Python and scikit-learn.

k-nearest-neighbours knn random-forest scikit-learn streamlit support-vector-machine svm

Last synced: 13 Feb 2025

https://github.com/kavaivaleri/linear-regression

Basic linear regression using data from Kaggle https://www.kaggle.com/adityadesai13/used-car-dataset-ford-and-mercedes/tasks?taskId=1258

linear-regression numpy pandas plots scikit-learn scipy seaborn-plots

Last synced: 20 Jan 2025

https://github.com/barrettotte/anilist-ml

Training a binary classifier model to predict if I would recommend an anime using my Anilist user data.

anilist binary-classification data-visualization machine-learning scikit-learn

Last synced: 03 Feb 2025

https://github.com/harshitwaldia/disease_detection

A disease detection system using Random Forest Classifier and GUI in Python, identifying illnesses based on user symptoms.

pandas-python python3 random-forest-classifier scikit-learn tkinter-gui

Last synced: 03 Feb 2025

https://github.com/kalelmartinho/imersao_dados_3

Terceira edição da Imersão Dados da Alura. O projeto foi inspirado em um desafio do Laboratory innovation science at Harvard disponibilizando os dados em uma competição no kaggle.

alura data-science imersaodados imersaodados3 pandas python3 scikit-learn

Last synced: 13 Jan 2025

https://github.com/marknature/machine-learning-intern

Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset

data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn

Last synced: 13 Jan 2025

https://github.com/cptanalatriste/copycat-detector

A Naive-Bayes classifier for detecting plagiarism.

amazon-sagemaker naive-bayes-classifier scikit-learn

Last synced: 12 Jan 2025

https://github.com/thekartikeyamishra/resumeevaluatorapp

The Automated Resume Evaluator is a Python-based application that helps evaluate resumes against job descriptions. It calculates an Applicant Tracking System (ATS) score, which is the percentage of keywords from the job description found in the resume.

flask machine-learning matplotlib nlp nltk pypdf python scikit-learn spacy textblob

Last synced: 03 Feb 2025

https://github.com/stefagnone/-air-france-sponsored-search-campaign-optimization

Optimized Air France's online advertising campaigns using advanced regression modeling and data analysis, uncovering actionable insights to maximize ROI and enhance ad performance.

advertising-analytics air-france data-science digital-marketing marketing-campaigns predictive-modeling python r regression-analysis roi-optimization scikit-learn sponsored-search

Last synced: 03 Feb 2025

https://github.com/lucs1590/commom_segmentations

The purpose of this repository is to document and expose code samples using common threading techniques.

computational-vision machine-learning open-source opencv python scikit-image scikit-learn segmentation sklearn

Last synced: 13 Jan 2025

https://github.com/joel-beck/airbnb-oslo

Price Prediction Models for Airbnb Apartments in Oslo | Winter Term 2021/22

prediction python pytorch scikit-learn

Last synced: 22 Dec 2024

https://github.com/tomgorb/ds-utils

pre-processing of a DataFrame into a sparse matrix for model input

machine-learning preprocessing scikit-learn

Last synced: 09 Jan 2025

https://github.com/radoslawregula/iris-classification

Jupyter notebook implementing an efficient machine learning method to classify flowers from the Iris data set.

classification iris-dataset jupyter-notebook machine-learning python scikit-learn softmax-classifier

Last synced: 21 Jan 2025

https://github.com/rririanto/thesis-projects

The computer science thesis project that I worked on when I was a student and was looking for a part time job

bag machine-learning python2 python27 scikit-learn surf

Last synced: 20 Jan 2025

https://github.com/omar-karimov/ai4renewals-gendes

This project adopts a modular Python architecture within an MLOps framework to enhance subscription renewal predictions, utilizing FastAPI and MongoDB with AWS integration (S3, ECR, EC2). Docker ensures seamless deployment, and GitHub Actions automate the CI/CD workflows. Evidently AI monitors drift to guarantee predictive accuracy and reliability.

api aws deployment docker drift-detection ec2 ecr evidently fastapi github-actions mangodb mlops modularization pipeline s3 scikit-learn workflow yaml-configuration

Last synced: 22 Jan 2025

https://github.com/bsamseth/triangular-regressor

A scikit-learn compatible implementation of a 2D triangular regressor.

scikit-learn triangulation

Last synced: 06 Feb 2025

https://github.com/boomerspine/selflearning_chatbot

Self learning chatbot using python

python scikit-learn

Last synced: 27 Jan 2025

https://github.com/sudothearkknight/15-machinelearningprojects

A curation of 15 Machine Learning projects in various fields that are helping me gain a better understanding of the different machine learning tools, techniques, algorithms and methodalogies.

classification-algorithm machine-learning machine-learning-algorithms natural-language-processing pycharm-ide python3 regression-models scikit-learn scikitlearn-machine-learning spam-detection

Last synced: 12 Feb 2025

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 12 Feb 2025

https://github.com/a-poor/sample-model-serve

Demo for using Flask to serve a scikit-learn model as an API

api data-science docker flask machine-learning scikit-learn

Last synced: 02 Feb 2025

https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer

Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.

breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm

Last synced: 12 Feb 2025

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 12 Feb 2025

https://github.com/kevin-patyk/mlobster

An educational project which aims to recreate the major algorithms and functionality of Scikit-learn in Rust.

educational-project machine-learning mlobster python rust scikit-learn

Last synced: 28 Jan 2025

https://github.com/kiapanahi/handson-machine-learning-book-playground

Sample codes and practices around the book "Hands-On Machine Learning with Scikit-Learn and TensorFlow"

machine-learning python scikit-learn tensorflow

Last synced: 16 Jan 2025

https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn

This repo desribes bulding a decision tree to predict customer churn in a given organisation

accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn

Last synced: 16 Jan 2025

https://github.com/billgewrgoulas/recommendation-systems

Algorithms for joke rating prediction using the joke data-set from Kaggle.

algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi

Last synced: 16 Jan 2025

https://github.com/somjit101/ds-logistic-regression

A simple implementation of the Logistic Regression Classifier on the Breast Cancer Dataset with L1 regularization and GridSearch for hyperparameter tuning.

breast-cancer-prediction breast-cancer-wisconsin grid-search grid-search-cross-validation hyperparameter-tuning logistic-regression machine-learning-algorithms regularization scikit-learn

Last synced: 16 Jan 2025

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 03 Feb 2025