Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/nirmalyabag20/breast-cancer-prediction-using-machine-learning

This project leverages machine learning to classify breast cancer as malignant or benign based on tumor characteristics. By applying and evaluating multiple algorithms, the model achieves high accuracy, demonstrating the practical application of data-driven solutions in medical diagnostics.

logistic-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 12 Feb 2025

https://github.com/m-rishab/research-paper-recommendation

This project aims to build a research paper recommendation system. Given a paper title as input, the system provides the top 5 recommended research papers.

deep-learning flask llm nlp numpy pandas python scikit-learn sentence-transformers tensorflow torch transformers

Last synced: 22 Jan 2025

https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023

This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.

data-science machine-learning ml numpy pandas python3 scikit-learn

Last synced: 13 Feb 2025

https://github.com/bacross/datamunger

python package for handling nan's and outliers

data data-frame datamunger knn nan outliers python scikit-learn

Last synced: 02 Feb 2025

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 12 Feb 2025

https://github.com/bistcuite/plainml

Painless Machine Learning Library for python based on scikit-learn

machine-learning ml plainml python scikit-learn

Last synced: 02 Feb 2025

https://github.com/ultrasage-danz/scikit-learn-ml

Machine Learning with scikit-learn by Data School

ai data data-school machine-learning macos ml scikit-learn ultrasage-dan

Last synced: 30 Jan 2025

https://github.com/amirjahantab/iris_classification

This project analyzes the famous Iris dataset using various machine learning techniques. The goal is to classify the iris flowers into three species: Setosa, Versicolor, and Virginica based on the features provided in the dataset.

classification data-science machine-learning scikit-learn

Last synced: 11 Jan 2025

https://github.com/vidhi1290/hr_employee_prediction

"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.

data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp

Last synced: 02 Feb 2025

https://github.com/udityamerit/all_mlpackages

The `all_MLpackage` is a Python tool for breast cancer detection using scikit-learn's machine learning algorithms. It automates model training, generates accuracy comparisons, and helps identify the best-performing algorithm for diagnosis.

classification knn-algorithm logistic-regression machine-learning numpy pandas scikit-learn svm-model

Last synced: 31 Oct 2024

https://github.com/harshita2234/breast-cancer-prediction

A machine learning project to predict breast cancer using logistic regression. This project includes data preprocessing, feature scaling, model training, and evaluation, based on a guided project from Coursera.

breast-cancer-prediction jupyter-notebook logistic-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 30 Jan 2025

https://github.com/tsu2000/audit_risk

Machine learning web app in Streamlit about classifying fraudulent companies using various classification models.

machine-learning plotly python random-forest scikit-learn streamlit-webapp

Last synced: 23 Jan 2025

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes exploratory-data-analysis pandas python scikit-learn visualization

Last synced: 07 Nov 2024

https://github.com/canayter/unsupervised-machine-learning

Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.

k-means-clustering python scikit-learn unsupervised-machine-learning

Last synced: 09 Jan 2025

https://github.com/spamfromaditya/drugs-consumption-prediction-model-eda-bagging-classifier

Drug consumption prediction models are like crystal balls for public health. By analyzing vast amounts of data, these models can identify individuals or communities at higher risk of drug use. They consider factors like demographics, social media activity, prescription history, and even economic indicators.

bagging-classifier machine-learning matplotlib numpy python scikit-learn

Last synced: 31 Dec 2024

https://github.com/vectominist/mednlp

Mandarin Medical Dialogue Analysis with Pytorch.

dialog huggingface mandarin medical pytorch scikit-learn transformers

Last synced: 29 Jan 2025

https://github.com/jdavydovportfolio/careerpredictor

A project leveraging AI and Machine Learning (Logistic Regression) to predict graduate job placements. Includes data preprocessing, exploratory analysis, and predictive modeling.

artificial-intelligence exploratory-data-analysis jupyter-notebook linear-regression logistic-regression machine-learning machine-learning-algorithms machine-learning-models matplotlib ml numpy pandas pandas-dataframe predictive-modeling programming python scikit-learn

Last synced: 02 Feb 2025

https://github.com/alisonmitchell/boston-housing

Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.

data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 15 Jan 2025

https://github.com/ascender1729/salarypredictionlinearreg

SalaryPredictionLinearReg is a Python-based project utilizing linear regression to predict salaries from years of experience. It covers data loading, model training, detailed statistical analysis, and visualization of results.

data-science linear-regression machine-learning python salary-prediction scikit-learn seaborn statsmodels

Last synced: 06 Feb 2025

https://github.com/rexsimiloluwah/fastapi-ml-apps

Machine learning apps built with FastAPI

docker fastapi machine-learning python scikit-learn tensorflow

Last synced: 05 Feb 2025

https://github.com/catlikeflyer/rsp-recognition

A computer vision project to recognize thumbs up

machine-learning mediapipe-hands python scikit-learn

Last synced: 21 Jan 2025

https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction

Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.

breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn

Last synced: 23 Jan 2025

https://github.com/mehuaniket/blog-classifier

blog classifier with scikit random forest.

bag-of-words blog-classifier python scikit-learn

Last synced: 04 Jan 2025

https://github.com/fohlen/stats-experiment

A tiny stats experiment with GENESIS data

matplotlib python3 scikit-learn

Last synced: 23 Jan 2025

https://github.com/abhiramdodda/rainfall_prediction

Machine Learning model built on Telangana dataset cropped from Indian weather dataset merged with average temperature dataset

numpy pandas python3 scikit-learn scikitlearn-machine-learning

Last synced: 12 Jan 2025

https://github.com/antim21/spamsense-ai

Classifying emails into Spam or Not Spam categories using Machine Learning techniques

machine-learning nlp python scikit-learn

Last synced: 10 Feb 2025

https://github.com/ax-va/numpy-pandas-matplotlib-scikit-learn-vanderplas-2023

These examples provide an introduction to Data Science and classic Machine Learning using NumPy, Pandas, Matplotlib, and scikit-learn. They are taken, with some changes, from the book "Python Data Science Handbook: Essential Tools for Working with Data", Second Edition, written by Jake VanderPlas and published by O'Reilly Media in 2023.

ax-va classic-machine-learning data-science machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 18 Nov 2024

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 11 Feb 2025

https://github.com/aarryasutar/hate_speech_detection

This project aims to detect hate speech on Twitter using advanced NLP and machine learning techniques, exploring feature extraction methods like TF-IDF and sentiment analysis, and evaluating models such as Logistic Regression and SVM.

confusion-matrix doc2vec gensim logistic-regression matplotlib naive-bayes nltk numpy pandas python random-forest scikit-learn seaborn stemming stopwords-removal svm tf-idf-vectorizer tokenization vader word-cloud

Last synced: 12 Jan 2025

https://github.com/cristhian-sdp/lenguaje-de-senas-con-python

๐Ÿ’ปProgramaciรณn del proyecto "Lenguaje de seรฑas con Python"๐Ÿ‘Œ.

machine-learning mediapipe numpy opencv proyectosuniversitarios python scikit-learn

Last synced: 31 Jan 2025

https://github.com/monzerdev/fake-news-detection

Project implementing machine learning models to detect fake news articles. Utilizes Deep Neural Networks, Support Vector Machines (SVM), and Ensemble methods (Random Forest). Developed using Python with scikit-learn, PyTorch, and nltk.

dnn fakenewsdetection machinelearning nlp nltk python pytorch random-forest scikit-learn svm

Last synced: 13 Feb 2025

https://github.com/pr38/tensorbspline

A scikit-learn compatible implementation of Polynomial Spline/B-Spline as well as a Multidimensional/Tensor spline variation.

basis-expansion feature-engineering feature-extraction machine-learning scikit-learn

Last synced: 04 Jan 2025

https://github.com/byigitt/smartmove

fake data generation and analysis for ankara metro station

ankara cv2 metro numpy pandas scikit-learn

Last synced: 14 Feb 2025

https://github.com/kr1shnasomani/greendetect

Detection of plant diseases using CNN (ResNet50 architecture)

computer-vision deep-learning keras neural-network numpy opencv pandas scikit-learn tensorflow

Last synced: 12 Feb 2025

https://github.com/sethios-notebook/__ia_learnig__

Formation Python spรฉciale Machine Learning francais. Apprendre Python en 30 fichiers qui contiennent une formation sur Numpy, Pandas, Matplotlib, Scipy, Sklearn, Seaborn, H5py, et bien d'autres techniques. Python est le langage d'excellence pour le machine learning, le deep learning, et la data science.

iac machine-learning matplotlib numpy python scikit-learn

Last synced: 13 Feb 2025

https://github.com/3rd-son/knn-breast-cancer-prediction-model-

A breast cancer prediction model using KNN with an accuracy of 96%

jupyter-notebook knn-classifier matplotlib numpy pandas python scikit-learn scipy search

Last synced: 28 Dec 2024

https://github.com/alexsomai/machine-learning-getting-started

Dummy examples and experiments to get started with Machine Learning

artificial-intelligence deep-learning machine-learning python scikit-learn

Last synced: 04 Jan 2025

https://github.com/akhil888binoy/intelligent-supplychain-management-system

Blockchain-powered supply chain management system with ML-driven sales prediction. Streamlines supplier-employee transactions and inventory management. Built with MERN stack, Solidity, and Flask.

blockchain decentralized-payments ethereum express flask foundry hackathon-project inventory-management machine-learning mern-stack mongodb nodejs python react sales-prediction scikit-learn smart-contracts solidity supply-chain-management wagmi

Last synced: 09 Feb 2025

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 29 Jan 2025

https://github.com/bestmahdi2/uni__dataminningstackoverflowproject

A university project related to data mining lesson on StackOverflow website data with Python language

cart csv data-mining logistic-regression matplotlib mlp naive-bayes nltk numpy pandas python scikit-learn scipy seaborn stackoverflow svc textblob tqdm xgboost

Last synced: 14 Oct 2024

https://github.com/docsallover/movie-recommendation

Content-Based Movie Recommendation System In Python Using Numpy, Scikit-Learn & Pandas

data-science jupyter machine-learning numpy pandas python scikit-learn

Last synced: 15 Jan 2025

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 15 Jan 2025

https://github.com/troublem1/mle

MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline

packages python3 scikit-learn sklearn

Last synced: 29 Jan 2025

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/grachale/predict_titanik

Predicting the survival of Titanic passengers (binary classification) with usage of decision tree and KNN from scikit-learn.

classification decision-tree-classifier knn-classifier matplotlib pandas python scikit-learn titanic-survival-prediction

Last synced: 13 Jan 2025

https://github.com/lucasfrag/dengue-prediction-knc

Projeto desenvolvido para realizar previsรฃo de casos de dengue usando o algoritmo de classificaรงรฃo KNeighborsClassifier.

data-science knearest-neighbor-classifier machine-learning pandas python scikit-learn

Last synced: 18 Jan 2025

https://github.com/guoshijiang/scikit-learn

ๅธฆไฝ ไธ€่ตทๅญฆไน scikit-learn

nlp-machine-learning scikit-learn

Last synced: 24 Jan 2025

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/soumyagautam/sign-sense

Deep Learning and Neural Network based Sign Sense or 'Sign Language' to Speech converter is an desktop app which can detect hand signs in a frame and can convert them to Speech, according to their respective meaning. Opposite to this, it can also recognise your voice and can convert it to sign language.

ai cv2 dataprocessing deep-learning keras machine-learning mediapipe moviepy-library neural-network openai-whisper scikit-learn tensorflow tkinter-python

Last synced: 30 Dec 2024

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 29 Jan 2025

https://github.com/grachale/predict_pass_exam

Creating AdaBoost classifier with decision trees for predicting whether a student will pass or fail an exam (classification) based on the number of study hours and their scores in the previous exam.

adaboost cross-validation decision-tree jupyter-notebook matplotlib python scikit-learn seaborn

Last synced: 13 Jan 2025

https://github.com/jordandeklerk/pygridge

A scikit-learn compatible Python package for data-driven group regularized ridge regression

python regression regularized-regression scikit-learn

Last synced: 11 Feb 2025

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 11 Feb 2025

https://github.com/rakshit-vasava/predictive-analytics-for-insurance-purchase

Predicting customer insurance purchases using stacking models and SMOTE for the Homesite Quote Conversion Problem on Kaggle.

k-nearest-neighbours kaggle-competition multilayer-perceptron python random-forest scikit-learn smote support-vector-machines

Last synced: 11 Feb 2025

https://github.com/somjit101/human-activity-recognition

This project is to build a model that predicts the human activities such as Walking, Walking Upstairs, Walking Downstairs, Sitting, Standing or Laying using readings from the sensors on a smartphone carried by the user.

decision-tree-classifier eda feature-engineering gradient-boosting-classifier grid-search human-activity-recognition keras logistic-regression lstm random-forest-classifier rbf-kernel scikit-learn seaborn-plots signal-processing support-vector-classifier support-vector-machine t-sne tensorflow uci-har-dataset uci-machine-learning

Last synced: 16 Jan 2025

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 28 Jan 2025

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 28 Jan 2025

https://github.com/kostasereksonas/ids_test

Code for intrusion detection system based on "Intrusion Detection System Using Machine Learning Algorithms" tutorial on Geeksforgeeks and Intrusion Detection on NSL KDD Github repository.

ids intrusion-detection intrusion-detection-system nsl-kdd-dataset numpy pandas python scikit-learn tensorflow

Last synced: 16 Jan 2025

https://github.com/grachale/predict_life_expect

Predicting life expectancy (regression) with usage of custom random forest, linear regression and decision tree regressor from scikit-learn.

decision-tree-regression jupyter-notebook linear-regression pandas python random-forest regression scikit-learn

Last synced: 13 Jan 2025

https://github.com/techysphinx/transportation_backventures

Road Transport Network Telematics Develop a telematics solution to enable efficient trucking operations for the long haul to connect the country through route optimization, live tracking and monitoring, optimal capacity utilization analysis and to enable appropriate response. Please go through the given figma design link.

fastapi jupyter-notebook ml scikit-learn

Last synced: 10 Feb 2025

https://github.com/shridhar1504/boston-house-price-prediction-datascience-project

The Boston House Price Prediction project utilizes data science methodologies and machine learning algorithms to provide accurate predictions for housing prices in the Boston area.

boston data-science house-price-prediction machine-learning regression-algorithms regression-models scikit-learn supervised-learning

Last synced: 23 Dec 2024

https://github.com/ffstghc/caco2ml

Main code chunks used for models in the publication "Exploring the Potential of Adaptive, Local Machine Learning (ML) in Comparison ton the Prediction Performance of Global Models: A Case Study from Bayer's Caco-2 Permeability Database"

caco-2 local-models machine-learning pharmacokinetics scikit-learn

Last synced: 11 Feb 2025

https://github.com/priyanshulathi/url-based-web-phishing-detection-model

A machine learning project for detecting phishing URLs by extracting key features from web addresses and applying various classification algorithms to identify potential threats effectively.

decision-tree knn logistic-regression machine-learning python random-forest scikit-learn

Last synced: 22 Jan 2025

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: 20 Jan 2025

https://github.com/flysirin/adstextclassification

Classification of advertisements by topic

docker excel flask pandas python pytorch scikit-learn

Last synced: 22 Jan 2025

https://github.com/daniel1kp/diamond-price-predictor

๐Ÿ’ŽDiamond Price Predictor is an user-friendly web application that simplifies the analysis and prediction of diamond prices.

altair pandas python scikit-learn streamlit

Last synced: 15 Jan 2025

https://github.com/ivanyu/kaggle-digit-recognizer

Kaggle's "Digit Recognizer" competition

kaggle keras machine-learning scikit-learn

Last synced: 01 Feb 2025

https://github.com/akashshnkr/multi-disease-prediction

Developed and integrated three machine learning models for predicting diabetes, Parkinson's, and heart disease into a Streamlit-based web application. The interface allows users to input data and receive accurate health predictions, enhancing early detection and healthcare outcomes.

logistic-regression machine-learning-algorithms numpy pandas python scikit-learn streamlit-webapp svm

Last synced: 22 Jan 2025

https://github.com/jesly-joji/house-price-prediction

House Price Prediction using Linear Regression with Scikit-learn and Flask

flask regression scikit-learn

Last synced: 22 Jan 2025

https://github.com/gauravsingh9356/machine_learning

All my practical learning work involved in MACHINE LEARNING (Data Processing to Deep Learning)

deep-learning jupyter-notebook machine-learning machine-learning-algorithms nlp-machine-learning python scikit-learn

Last synced: 02 Jan 2025

https://github.com/joshi-jyoti/heart-disease-prediction

This repository contains a Python-based project for predicting the likelihood of heart disease using a Logistic Regression machine learning model. It leverages a dataset of patient medical information to train and evaluate the model, providing insights into potential diagnoses.๐Ÿฉบ

heart-disease-prediction heart-disease-predictor kaggle-dataset machine-learning numpy pandas python scikit-learn

Last synced: 04 Feb 2025

https://github.com/ayuhamaro/py_weight_estimation

้ซ”้‡่ฎŠๅŒ–้ ไผฐ

matplotlib numpy pandas python3 scikit-learn scipy

Last synced: 15 Jan 2025

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 27 Jan 2025

https://github.com/anoopgeorge418/my-learnings

Learning Topics Related Data-Science and Web-Development From Complete Scratch.

bokeh bootstrap css folium html matplotlib numpy pandas plotly python scikit-learn seaborn

Last synced: 07 Jan 2025