An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with sklearn-library

A curated list of projects in awesome lists tagged with sklearn-library .

https://github.com/transitive-bullshit/scikit-learn-ts

Powerful machine learning library for Node.js – uses Python's scikit-learn under the hood.

ai machine-learning ml scikit-learn sklearn sklearn-library

Last synced: 04 Apr 2025

https://github.com/heidelbergcement/hcrystalball

A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosystem.

cross-validation data-science fbprophet model-selection pmdarima sarimax sklearn sklearn-api sklearn-compatible sklearn-library sktime statsmodels tbats time-series time-series-forecasting transformer wrapper

Last synced: 05 Apr 2025

https://github.com/dadaloop82/MyHomeSmart-HASS-AppDeamon

I have a dream: to give HomeAssistant the ability to reason and perform actions logically and comprehensively, using AppDaemon, Sklearn, and Decision Tree.

appdeamon automation hassio history home-assistant home-automation homeassistant influxdb machine-learning pandas python sklearn sklearn-library

Last synced: 06 Apr 2025

https://github.com/ksachdeva/scikit-nni

AutoML - Hyper parameters search for scikit-learn pipelines using Microsoft NNI

automl hyperparameter-search hyperparameters neural-network-intelligence nni scikit-learn scikit-learn-api sklearn sklearn-library tool

Last synced: 23 Oct 2025

https://github.com/azaz9026/medicine-recommendation-system

A Medicine Recommendation System in machine learning (ML) is a software application designed to assist healthcare professionals and patients in selecting the most appropriate medication based on various factors such as medical history, symptoms, demographics, and drug interactions

api data-preprocessing eda encoding flask machine-learning render-template sklearn-library statistics

Last synced: 10 Apr 2025

https://github.com/cg1507/quickcnn

QuickCNN is high-level library written in Python, and backed by the Keras, TensorFlow, and Scikit-learn libraries. It was developed to exercise faster experimentation with Convolutional Neural Networks(CNN). Majorly, it is intended to use the Google-Colaboratory to quickly play with the ConvNet architectures. It also allow to train on your local system.

bottleneck-features cnn-training convolutional-neural-network deep-learning fine-tuning-cnns google-colaboratory image-classification keras sklearn-library tensorboard tensorflow transfer-learning

Last synced: 31 Oct 2025

https://github.com/syamkakarla98/linear-regression

Implementation of Linear regression on Boston House Pricing and Diabetes data sets using python.

linearregression pyhton3 sklearn-library

Last synced: 03 May 2025

https://github.com/ewenwan/kaggle

kaggle 比赛 使用sklearn进行kaggle数据竞赛基础及实践

kaggle sklearn-library

Last synced: 07 Apr 2025

https://github.com/jamesgeorge007/gender-classifier

A machine learning model which predicts the gender based on the data with which it is trained.

classifier machine-learning python3 sklearn-library

Last synced: 11 Aug 2025

https://github.com/jimmymugendi/email-sms-spam_classifier

Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.

data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library

Last synced: 25 Sep 2025

https://github.com/shridhar1504/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-cleaning data-science data-testing data-visualization forecasting-models machin model-evaluation model-fitting prediction predictive-modeling python3 regression-algorithms salesforecast sklearn-library supervised-learning

Last synced: 30 Oct 2025

https://github.com/shaik-sohail-72/iris-flower-type-prediction-and-classification-with-ml-and-mern

Iris flower type prediction and classification with machine learning and MERN web I/O system. This project predict the type of iris flower by using machine learning . K-NN algorithm is used for multiclass classification.

css ejs expressjs google-oauth2 heroku-deployment html iris-dataset javascript knn-algorithm knn-classification knn-classifier machine-learning nodejs pandas python sklearn-library

Last synced: 05 Oct 2025

https://github.com/ranjan2104/uber-rides_prediction-by-using-ml---flask

Uber Technologies, Inc., commonly known as Uber, is an American technology company. Its services include ride-hailing, food delivery (Uber Eats), package delivery, couriers, freight transportation, and, through a partnership with Lime, electric bicycle and motorized scooter rental. The company is based in San Francisco and has operations in over 900 metropolitan areas worldwide. It is one of the largest firms in the gig economy. so that i Make this Project so company can pred there weekly as well as monthly rides pred that can company help to mange there rides info correctly

flask gunicorn-web-server jinja2-templates machine-learning numpy pandas pickel python request sklearn-library

Last synced: 26 Feb 2025

https://github.com/ranjan2104/data-science-and-machine-learning-series

This is what is called by the much talked about term, Big Data. The basis to any attempt to answer the question of which to learn first between Data Science or Machine Learning should be Big Data. Why this is so is very simple. It is on Big Data that both Data Science and Machine Learning are built.

data-science jupyter-notebook machine-learning machine-learning-algorithms python3 sklearn-library

Last synced: 26 Feb 2025

https://github.com/sarthak-1408/water-potability

In this repo, I am doing data analysis in water potability and check each and every classification model's accuracy.

catboost-model classification classification-algorithm sklearn sklearn-classify sklearn-library water-quality

Last synced: 14 Apr 2025

https://github.com/abeed04/hotel-bookings-prediction-using-machine-learning

Random forest algorithm can be used to analyze hotel booking data and predict booking behavior. This allows hotels to optimize pricing strategies, staffing, and identify potential cancellations for proactive guest communication.

flask numpy pandas-dataframe pickle-file pycharm-ide python randomforestalgorithm sklearn-library sklearn-metrics

Last synced: 06 Apr 2025

https://github.com/j0fin/iris-says

A minimalist platform for learning, understanding and realising Iris Flower Classification.:cherry_blossom:

ai education educational-project educational-tool flask flask-application machine-learning plotly plotly-express pycharm-ide sklearn-library visualization website

Last synced: 06 Apr 2025

https://github.com/mohammadmoataz2/analysis-and-ml-projects-using-python

This repository contains a collection of analysis and machine learning projects implemented in Python. The projects cover various domains and utilize different techniques to gain insights from data and build predictive models.

anaylsis artificial-intelligence machine-learning pandas-python python seaborn sklearn sklearn-library

Last synced: 04 Oct 2025

https://github.com/camilajaviera91/bagging-with-kaggle

Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).

accuracy-score bagging-classifier curses decision-tree-classifier kaggle labelencoder pandas python simpleimputer sklearn-library train-test-split

Last synced: 07 Sep 2025

https://github.com/rvigneshwaran/sklearn-implementations

This repository contains the implementation of various algorithms in Machine learning using sklearn library

datascience datascience-machinelearning machine-learning-algorithms machine-translation sklearn sklearn-library

Last synced: 11 Mar 2025

https://github.com/mbalatsko/mlmr

This library will help you easily parallelize your python code for all kind of data transformations in MapReduce fashion.

mapreduce ml parallel parallel-computing sklearn-library

Last synced: 14 Mar 2025

https://github.com/praju-1/machine_learning

Exploring Machine learning with its supervised and unsupervised algorithm and subtypes also.. All algorithm implemented in python With proper description of each Dataset used.

machine-learning machine-learning-algorithms matplotlib numpy pandas-library python sklearn-library statistics

Last synced: 02 Apr 2025

https://github.com/artzaragozagithub/nlp--p6_sentiment_analysis_and_summarization_of_stock_news

Natural Language Processing AI-model driven sentiment analysis system that will automatically process and analyze news articles to gauge market sentiment, and summarizing the news at a weekly level to enhance the accuracy of their stock price predictions and optimize investment strategies.

classifier-training confusion-matrix decisiontreeclassifier eda glove-embeddings gridsearchcv keyedvectors llama mistral-7b myplot nlp-keywords-extraction numpy-library pandas-library prompt-engineering sentiment-analysis sklearn-library text-processing text-summarization transformers-models word2vec

Last synced: 06 Apr 2025

https://github.com/gourab-sinha/machine_learning

This repository consists of Machine Learning Concepts and Projects.

classification machine-learning neural-network numpy pandas sklearn-library

Last synced: 22 Aug 2025

https://github.com/rickydoan/machine-learning-risk-model-prediction-classification

This project leverages machine learning to provide insights into loan and credit risk. By analyzing user-provided financial data, it predicts the likelihood of loan default, generates a credit score, and assigns a risk rating. Designed to assist financial institutions and individuals in making informed decisions

classification joblib machine-learning numpy pandas python sklearn-library streamlit

Last synced: 05 May 2025

https://github.com/vineet416/chronic-kidney-disease-prediction

This repository contain code of Chronic Kidney Disease Detection Prediction Project. The goal of this project is predict the chronic kidney disease using parameters like Diabetes Mellitus, Blood Urea, Sugar, Hypertension etc.. I used multiple machine learning algorithms with hyperparameter tuning which is having highest accuracy score of 97.5

data-visualization data-wrangling exploratory-data-analysis feature-engineering feature-selection hyperparameter-tuning machine-learning matplotlib numpy pandas plotly pre-processing python seaborn sklearn-library statsmodels

Last synced: 23 Mar 2025

https://github.com/jaybfn/semantic_segmentation_project

This project was build for segmenting different body parts of a mosquito for identifying various species and combat vector-borne diseases.

computer-vision convolutional-neural-networks keras-tensorflow machine-learning neural-network python3 segmentation-models sklearn-library tensorflow2

Last synced: 23 Feb 2025

https://github.com/fayzi-dev/machin_learning

Machin Learning Full Algorithm (Linear Regression, Decision tree, Random forest, Neural network ,Logistic regression ,Support vector machine ,Naive Bayes ,Clustering, XGBoost,DBscan,KMeans)

algorithms artificial-neural-networks logistic-regression machine-learning matplotlib matplotlib-pyplot naive-bayes-classifier numpy pandas python python3 seaborn seaborn-plots sklearn sklearn-knn sklearn-library sklearn-linear-model sklearn-linear-regression sklearn-metrics sklearn-svm

Last synced: 09 Apr 2025

https://github.com/hamidhosen42/a-graph-machine-learning

Graph-Machine-Learning. Graph Machine Learning provides a new set of tools for processing network data and leveraging the power of the relation between entities that can be used for predictive, modeling, and analytics tasks.

ipynb-jupyter-notebook machine-learning-algorithms mathplotlib numpy pandas python sklearn-library

Last synced: 02 Mar 2025

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 30 Oct 2025

https://github.com/subhadipsinha722133/bitcoin-price-prediction-web-app

A Streamlit-based web application for predicting Bitcoin price movements using machine learning models

data-visualization machine-learning-algorithms matplotlib-pyplot sklearn-library

Last synced: 09 Oct 2025

https://github.com/kalebers/economic_analysis_data_science

Data Analysis Python project using economic data base to predict percentage of good and bad payers

data-analysis data-science machine-learning pandas python scipy sklearn-library

Last synced: 15 Mar 2025

https://github.com/y-india/retail-sales-analysis-project

Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.

ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit

Last synced: 20 Nov 2025

https://github.com/pabvald/data-mining

Implementation of different Machine Learning algorithms within the Data Mining course (2019/2020) of the University of Valladolid

jupyter-notebook keras machine-learning python3 sklearn-library

Last synced: 24 Oct 2025

https://github.com/adityashinde716/liam-the-healthcare-chatbot

Made a healthcare chatbot Liam which will help you to detect your health problem and gives precautions.

csv numpy pandas python python3 random random-forest random-forest-classifier requests sklearn sklearn-library sklearn-model

Last synced: 19 Nov 2025

https://github.com/gauravsakure02/project_titanic_eda

This Project includes Exploratory Data Analysis of the Titanic Dataset

eda numpy pandas-dataframe sklearn-library

Last synced: 14 Oct 2025

https://github.com/harsh0713/sms-spam-classification

The "SMS Spam Classification" project aims to develop a machine learning model to automatically identify and classify SMS messages as either spam or legitimate (ham).

bernoulli gaussian-naive-bayes jupyter-notebook multinomial-naive-bayes nltk-python punkt python sklearn-library stopwords streamlit string

Last synced: 18 Oct 2025

https://github.com/sorna-fast/breast-cancer-diagnosis-neural-network

ANN-based breast cancer classifier using the Wisconsin Diagnostic Dataset. Implements advanced feature engineering and achieves 98.25% test accuracy. Includes comprehensive EDA, model training, and clinical impact analysis

keras-classification-models keras-neural-networks keras-tensorflow matplotlib-pyplot pandas-dataframe scikit-learn seaborn-plots sklearn-library tensorflow

Last synced: 06 Aug 2025

https://github.com/lijesh010/ml_project_car_price_prediction_using_linearregression

This repository presents a data-driven exploration into predicting car prices using a machine learning model based on linear regression, aimed at aiding a Chinese automobile company's entry into the competitive US market.

car-price-prediction-with-machine-learning data-science datapreprocessing jupyter-notebook linear-regression machine-learning-algorithms python sklearn-library

Last synced: 02 Aug 2025

https://github.com/mayankmittal29/vaccineclassifier

This machine learning project builds a classifier for 2 types of Vaccines by loading the dataset , doing the Exploratory Data Analysis and feature engineering and then using 5 different ML algos to predict and find best of them using ROC-AUC metrics

machine-learning-algorithms pandas python3 seaborn sklearn-library

Last synced: 24 Feb 2025

https://github.com/antonio-f/multilabel-classification

Predict tags on StackOverflow with linear models - Week 1 assignment of Coursera's Natural Language Processing course from the Advanced Machine Learning Specialization.

bag-of-words logistic-regression multilabel-classification nltk-library one-vs-rest sklearn-library tfidf tfidf-vectorizer

Last synced: 30 Mar 2025

https://github.com/anvesham/predicting_customer_churn_neural_network

A Comparative Study in Customer Churn Prediction through Multilayer Perceptrons and Support Vector Machines

multilayer-perceptron python pytorch roc-curve sklearn-library smoteenn support-vector-machine voting-classifier

Last synced: 05 Mar 2025

https://github.com/ahmedabdalkreem/breast-cancer

It help to know the patient have breast cancer or not and show some of diagram analysis data to know what depenent in this disease that lead to this disease.

matplotlib numpy pandas python sklearn-library

Last synced: 06 Mar 2025

https://github.com/margaretkhendre/crypto-clustering-vs-unsupervised-machine-learning-challenge

In this repository, Google Colaboratory is paired with Unsupervised Machine Learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.

colab-notebook holoviews hvplot jupyter-notebook pandas pathlib python sklearn-library unsupervised-machine-learning

Last synced: 26 Feb 2025

https://github.com/muhammedrahil/trained-healthcare-chat-bot

The field of healthcare is vast, complex, and ever-evolving. Providing accurate and accessible information to patients and professionals alike is crucial for improving healthcare outcomes and fostering a well-informed community. However, sifting through extensive medical literature and resources can take time and effort, even for experts in the fie

chatbot helthcare prediction prognosis python sklearn sklearn-library

Last synced: 30 Jun 2025

https://github.com/abhay-sinha-0/carpricepredictionproject

A machine learning project that predicts the selling price of a car based on its features such as year, mileage, fuel type, transmission, and more. This model can assist individuals and dealerships in estimating fair market prices for used cars.

artificial-intelligence data-analysis data-science data-visualization exploratory-data-analysis machine-learning-algorithms matplotlib-pyplot mysql-database numpy-library pandas-library python skit-learn sklearn-library

Last synced: 15 May 2025

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 07 Apr 2025

https://github.com/darkdk123/bulldozers-price-prediction

Utilizing bulldozer's auction sale price Dataset to predict the Price of Bulldozers!

data-science machine-learning matplotlib-pyplot pandas plotly python3 regression-models sklearn-library

Last synced: 17 Nov 2025

https://github.com/johannaschmidle/house-price-predictor

A machine learning model to accurately predict house prices based on various features such as quality, size, and location, utilizing Random Forest and XGBoost algorithms (Python)

anova-test cross-validation house-price-prediction machine-learning onehot-encoding ordinal-encoding python random-forest random-forest-regressors sklearn sklearn-library target-encoding visualization xgboost xgboost-model

Last synced: 07 Apr 2025

https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy

This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.

data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit

Last synced: 16 May 2025

https://github.com/balajimohan18/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-science data-testing data-visualization forecasting forecasting-models machine-learning model-evaluation predictive-modeling python regression-algorithms salesforecast scipy sklearn-library supervised-learning

Last synced: 04 Mar 2025

https://github.com/ahmedabdalkreem/skin-cancer

In this project we work to extraction features from Images using CNN and build the Neural Network to arrive the patient have skin-cancer Malignant or Benign.

cnn deep-learning matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 02 Sep 2025

https://github.com/alisson-t-bucchi/pizza_price_predictor_ml

Project using Machine Learning and LLM from .csv file to predict a pizza value based on each ingredient added.

llm-training machine-learning pandas-library personal-project python sklearn-library streamlit

Last synced: 24 Mar 2025

https://github.com/4702chahat/rock-vs-mine

This Project is based on Machine Learning which uses Logistic Regression model for predicting whether the object detected by Submarine is Rock or Mine

accuracy-score data-science deep-learning jupyter-notebook logestic-regression machine-learning numpy-arrays pandas-dataframe predicitve predictive-model python rock-vs-mine sckit-learn sklearn-classifier sklearn-library sklearn-metrics

Last synced: 24 Mar 2025

https://github.com/mustafadanabasi/python-onehotencoder-sample

OneHotEncoder : Kategorik verileri "binary" (0 ve 1) sütunlarına ayırır. Her benzersiz kategori için ayrı bir sütun oluşturur. Özellikle sıralı olmayan kategorik değişkenler için uygundur.

onehotencoder python sklearn-library

Last synced: 15 Mar 2025

https://github.com/neerajcodes888/data-science

This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science

data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit

Last synced: 02 Jul 2025

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 30 Dec 2025

https://github.com/margaretkhendre/credit-risk-classification-vs-supervised-machine-learning-challenge

In this repository, Google Colaboratory is paired with Supervised Machine Learning to evaluate a model based on loan risk. A dataset of historical lending activity is used to build a model that can identify the creditworthiness of borrowers.

colab-notebook jupyter-notebook pandas python sklearn-library supervised-machine-learning

Last synced: 26 Feb 2025

https://github.com/nxhawk/mln

Machine learning is a branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

decision-tree-classifier genetic-algorithm jupyter-notebook machine-learning-algorithms numpy pandas sklearn-library

Last synced: 27 Nov 2025

https://github.com/dineshdhamodharan24/industrial-copper-modeling

It seems like you have a project that involves modeling industrial copper data using Python and several libraries such as pandas, numpy, and scikit-learn. This is a common and practical approach, as these libraries are widely used for data manipulation, analysis, and machine learning tasks.

bussiness-solution numpy pandas pickle pickle-file seaborn sklearn-library streamlit

Last synced: 23 Apr 2025

https://github.com/harshineeshree/machine-learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data-analysis data-engineering data-science database feature-engineering feature-extraction feature-selection machine-learning-algorithms python3 pytorch-implementation sklearn-library

Last synced: 23 Apr 2025

https://github.com/murilobellatini/newspapers-text-mining

Study case with end-to-end Data Science project for classifying Newspapers' articles. From raw Data Extraction up to deployed Text Classifier inside a containerized API.

api-rest data-science sklearn-library text-mining word2vec xgboost

Last synced: 07 Jan 2026

https://github.com/pranava007/ai_ml_knn_classification

K Nearest Neighbor Classification

knn python3 sklearn-library

Last synced: 20 Feb 2025

https://github.com/pranava007/ai_in_business_intelligence_and_analytics_ml

AI_in_Business_Intelligence_and_Analytics

pandas python3 sklearn-library

Last synced: 20 Feb 2025