Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/ymorsi7/hatespeechnlp

Detecting and analyzing hate speech on videos relating to sexism on a right-wing platform (NLTK, scikit-learn, pandas).

decision-tree-classifier nlp nlp-machine-learning nltk-python pandas scikit-learn tf-idf

Last synced: 23 Nov 2024

https://github.com/siam29/exploring-explainable-ai-demystifying-dt-rf-knn-xgbc

Implemented XAI techniques to enhance transparency in fraud detection models. I employed techniques such as SHAP, LIME on DT, RF, XGBC, and KNN to offer lucid explanations for transactions that were flagged.

machine-learning matplotlib pandas scikit-learn xai

Last synced: 06 Dec 2024

https://github.com/troublem1/mle

MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline

packages python3 scikit-learn sklearn

Last synced: 01 Dec 2024

https://github.com/kingabzpro/mlops-with-jenkins

From data ingestion to deploying the model using Jenkins.

classification fastapi jenkins mlops scikit-learn

Last synced: 13 Oct 2024

https://github.com/corentinth/ml-gender_classification

[Machine Learning] The Hello Wolrd of Machine Learning using sklearn

body-metrics gender-classification machine-learning scikit-learn

Last synced: 20 Dec 2024

https://github.com/spamfromaditya/drugs-consumption-prediction-model-eda-bagging-classifier

Drug consumption prediction models are like crystal balls for public health. By analyzing vast amounts of data, these models can identify individuals or communities at higher risk of drug use. They consider factors like demographics, social media activity, prescription history, and even economic indicators.

bagging-classifier machine-learning matplotlib numpy python scikit-learn

Last synced: 31 Dec 2024

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes exploratory-data-analysis pandas python scikit-learn visualization

Last synced: 07 Nov 2024

https://github.com/francescopaolol/sentimentanalysis

About sentiment analysis on IMDB Dataset of 50K Movie Reviews

jupyter-notebook kaggle machine-learning ml pandas scikit-learn sentiment-analysis

Last synced: 22 Dec 2024

https://github.com/carmoreno/analisisaccidentalidadbogota

Data Analysis about traffic accidents at Bogotรก, Colombia.

data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn

Last synced: 05 Jan 2025

https://github.com/shaadclt/data-preprocessing-pipeline

This project contains a data preprocessing pipeline implemented in Python using the pandas and numpy libraries. The pipeline handles missing values, outliers, and normalizes numeric features in a dataset.

numpy pandas scikit-learn

Last synced: 07 Dec 2024

https://github.com/francescopaolol/decisiontree

About classify iris plants into three species in this classic dataset

decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 01 Dec 2024

https://github.com/francescopaolol/favoritatimeseriesforecasting

See: https://www.kaggle.com/competitions/store-sales-time-series-forecasting

jupyter-notebook kaggle-competition machine-learning pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/jdavydovportfolio/careerpredictor

A project leveraging AI and Machine Learning (Logistic Regression) to predict graduate job placements. Includes data preprocessing, exploratory analysis, and predictive modeling.

artificial-intelligence exploratory-data-analysis jupyter-notebook linear-regression logistic-regression machine-learning machine-learning-algorithms machine-learning-models matplotlib ml numpy pandas pandas-dataframe predictive-modeling programming python scikit-learn

Last synced: 07 Dec 2024

https://github.com/m-rishab/credbet

A loan prediction web app which tells You that you are eligible for loan or not!

decision-tree-classifier matplotlib numpy pandas python scikit-learn

Last synced: 21 Nov 2024

https://github.com/3rd-son/knn-breast-cancer-prediction-model-

A breast cancer prediction model using KNN with an accuracy of 96%

jupyter-notebook knn-classifier matplotlib numpy pandas python scikit-learn scipy search

Last synced: 28 Dec 2024

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 01 Dec 2024

https://github.com/bestmahdi2/uni__dataminningstackoverflowproject

A university project related to data mining lesson on StackOverflow website data with Python language

cart csv data-mining logistic-regression matplotlib mlp naive-bayes nltk numpy pandas python scikit-learn scipy seaborn stackoverflow svc textblob tqdm xgboost

Last synced: 14 Oct 2024

https://github.com/lucasfrag/dengue-prediction-knc

Projeto desenvolvido para realizar previsรฃo de casos de dengue usando o algoritmo de classificaรงรฃo KNeighborsClassifier.

data-science knearest-neighbor-classifier machine-learning pandas python scikit-learn

Last synced: 17 Nov 2024

https://github.com/samarpan-rai/serveitlearn

It creates an extremely thin layer around FastAPI library which allows you to create an end point super fast.

fastapi inference ml pypi scikit-learn

Last synced: 17 Dec 2024

https://github.com/ax-va/numpy-pandas-matplotlib-scikit-learn-vanderplas-2023

These examples provide an introduction to Data Science and classic Machine Learning using NumPy, Pandas, Matplotlib, and scikit-learn. They are taken, with some changes, from the book "Python Data Science Handbook: Essential Tools for Working with Data", Second Edition, written by Jake VanderPlas and published by O'Reilly Media in 2023.

ax-va classic-machine-learning data-science machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 18 Nov 2024

https://github.com/francescopaolol/titaniccompetition

It's my first kaggle competition about predict survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle-competition machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/somjit101/human-activity-recognition

This project is to build a model that predicts the human activities such as Walking, Walking Upstairs, Walking Downstairs, Sitting, Standing or Laying using readings from the sensors on a smartphone carried by the user.

decision-tree-classifier eda feature-engineering gradient-boosting-classifier grid-search human-activity-recognition keras logistic-regression lstm random-forest-classifier rbf-kernel scikit-learn seaborn-plots signal-processing support-vector-classifier support-vector-machine t-sne tensorflow uci-har-dataset uci-machine-learning

Last synced: 16 Nov 2024

https://github.com/bistcuite/plainml

Painless Machine Learning Library for python based on scikit-learn

machine-learning ml plainml python scikit-learn

Last synced: 08 Dec 2024

https://github.com/mgobeaalcoba/data_champions_meli

Algorithms and work carried out within the framework of data champions by Mercado Libre

algorithms canvas classification clustering data-science machine-learning python3 scikit-learn

Last synced: 19 Nov 2024

https://github.com/mgobeaalcoba/linear_algebra_for_machine_learning

Explore fundamental linear algebra concepts essential for machine learning in this repository, with code examples and explanations. Get a solid foundation for ML!

machine-learning matplotlib numpy pandas python3 scikit-learn scipy seaborn

Last synced: 19 Nov 2024

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: 19 Nov 2024

https://github.com/aditya172926/text_summarization

Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.

glove machine-learning nlp python scikit-learn spacy

Last synced: 24 Nov 2024

https://github.com/skekre98/picture-compressor

A tool for compressing images using unsupervised machine learning

kmeans-clustering scikit-learn

Last synced: 21 Nov 2024

https://github.com/upul/chocolate-quality-analysis

This repository contains a Jupiter notebook which describes how to use basic machine learning tools such Scikit-Learning, Pandas, and Numpy for buiding models.

machine-learning numpy pandas predictive-analytics scikit-learn

Last synced: 18 Nov 2024

https://github.com/palak-463/tablataalrecognitionsystem

Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument.

cnn deep-learning flask fnn librosa numpy os pickle python scikit-learn

Last synced: 11 Nov 2024

https://github.com/guoshijiang/scikit-learn

ๅธฆไฝ ไธ€่ตทๅญฆไน scikit-learn

nlp-machine-learning scikit-learn

Last synced: 24 Nov 2024

https://github.com/ewertondrigues02/previsao-de-vendas

Previsรฃo de vendas de uma empresa fictรญcia onde foi feita anรกlise com ferramentas como Jupyter Notebook, Google Colab, Python e bibliotecas de Machine Learn como: regressรฃo linear, arvore de decisรฃo, scikit-learn

analise-de-dados analise-exploratoria arvore-de-decisao ciencia-de-dados colab excel google-colab jupyter jupyter-notebook machine-learning previsao previsao-de-vendas python3 regressao-linear scikit-learn

Last synced: 18 Nov 2024

https://github.com/canayter/unsupervised-machine-learning

Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.

k-means-clustering python scikit-learn unsupervised-machine-learning

Last synced: 11 Nov 2024

https://github.com/khaymanii/multiple-disease-prediction-system

This system predicts if a patient has heart, parkinsons and diabetes disease

matplotlib numpy pandas python scikit-learn

Last synced: 20 Nov 2024

https://github.com/rs2416/Detecting_Social_Anxiety

This repository contains the full dataset and code needed to recreate the classification models and reproduce the results within this paper: https://formative.jmir.org/2021/10/e32656/

jupyter-notebook machine-learning python scikit-learn social-anxiety

Last synced: 14 Nov 2024

https://github.com/vaibhavs10/learn-ml

Modified notebooks (single) from kaggle.com/learn with added nuances

decision-trees machine-learning pandas random-forest scikit-learn

Last synced: 19 Dec 2024

https://github.com/lfenzo/ml-solar-sao-paulo

Implementation of scientific project regarding the use of Machine Learning in Solar Radiation Prediction

forecasting machine-learning python scikit-learn

Last synced: 17 Nov 2024

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 30 Nov 2024

https://github.com/tomgorb/ds-utils

pre-processing of a DataFrame into a sparse matrix for model input

machine-learning preprocessing scikit-learn

Last synced: 11 Nov 2024

https://github.com/kr1shnasomani/greendetect

Detection of plant diseases using CNN (ResNet50 architecture)

computer-vision deep-learning keras neural-network numpy opencv pandas scikit-learn seaborn tensorflow

Last synced: 19 Dec 2024

https://github.com/farrajota/kaggle_titanic

My solutions to the "Titanic: Machine Learning from Disaster" kaggle competition

docker docker-compose kaggle kaggle-competition kaggle-titanic notebook pyspark python scikit-learn

Last synced: 17 Nov 2024

https://github.com/sralter/happy_customers

Predicting whether a customer is happy based on the results from a survey.

eda ensemble-classifier hyperopt lazypredict ml scikit-learn

Last synced: 17 Nov 2024

https://github.com/sralter/classifire

Wildfire Prediction Model: Samuel Alter's BrainStation 2023 Data Science Capstone Project

qgis scikit-learn tensorflow

Last synced: 17 Nov 2024

https://github.com/ksmooi/mscs_big_data_match_predictor

The Match Predictor project uses various machine learning models to predict soccer match outcomes, leveraging historical data and advanced algorithms to forecast results. The system includes a backend in Python with Flask and a frontend in TypeScript with React.

flask machine-learning python scikit-learn

Last synced: 14 Dec 2024

https://github.com/kingabzpro/ml-workflow-orchestration-with-prefect

An introductory project to streamline the machine learning pipeline using Prefect and Discord Notifications, from data ingestion to model saving

discord mlops prefect scikit-learn

Last synced: 17 Nov 2024

https://github.com/alisonmitchell/boston-housing

Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.

data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 15 Nov 2024

https://github.com/ahmedshahriar/telco-customer-churn-prediction-streamlit-app

This streamlit app predicts the churn rate using Gradient Boosting models (XGBoost, Catboost, LightGBM) on IBM Customer Churn Dataset

binary-classification binary-classifiers data-science jupyter-notebook machine-learning pandas python scikit-learn sklearn stacking-ensemble streamlit streamlit-webapp

Last synced: 16 Nov 2024

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 12 Nov 2024

https://github.com/strcoder4007/machine-learning-deep-learning-practice

Implementation of Linear/Logistic Reg, K-NN, SVM, Clustering, K-Means, ConvNet, ResNet, MobileNet, RNN, LSTM etc. using Pandas, SciKitLearn, NumPy & TensorFlow 2

convolutional-neural-networks matplotlib scikit-learn tensorflow2

Last synced: 12 Nov 2024

https://github.com/kingabzpro/github-actions-for-machine-learning-beginners

A project on automating ML workflow using scikit-learn pipelines, CML, and GitHub actions.

cml github-actions machine-learning mlops scikit-learn

Last synced: 17 Nov 2024

https://github.com/jordandeklerk/pygridge

A scikit-learn compatible Python package for data-driven group regularized ridge regression

python regression regularized-regression scikit-learn

Last synced: 19 Dec 2024

https://github.com/lechemi/machine-learning-vademecum

Un notebook contenente nozioni di base ed esempi pratici in python sul machine learning.

machine-learning python scikit-learn

Last synced: 19 Dec 2024

https://github.com/snehilsanyal/ee524

Course webpage for IIT Guwahati EE524 Machine Learning Lab (Jul-Nov 2020) Session

course-webpage machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 10 Dec 2024

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 15 Nov 2024

https://github.com/chrislemke/scikit-tabtrans

TabTransformer ready for Scikit learn ๐Ÿง‘โ€๐Ÿ”ฌ

deep-learning machine-learning python scikit-learn transformer

Last synced: 09 Nov 2024

https://github.com/udityamerit/all_mlpackages

The `all_MLpackage` is a Python tool for breast cancer detection using scikit-learn's machine learning algorithms. It automates model training, generates accuracy comparisons, and helps identify the best-performing algorithm for diagnosis.

classification knn-algorithm logistic-regression machine-learning numpy pandas scikit-learn svm-model

Last synced: 31 Oct 2024

https://github.com/ayuhamaro/py_weight_estimation

้ซ”้‡่ฎŠๅŒ–้ ไผฐ

matplotlib numpy pandas python3 scikit-learn scipy

Last synced: 14 Nov 2024

https://github.com/mehuaniket/blog-classifier

blog classifier with scikit random forest.

bag-of-words blog-classifier python scikit-learn

Last synced: 04 Jan 2025

https://github.com/george-gca/ai_papers_search_tool

Automatic paper clustering and search tool by fastext from Facebook Research

fasttext fasttext-embeddings fasttext-python nlp python scikit-learn

Last synced: 14 Nov 2024

https://github.com/jersongb22/computervision

Links to my repositories with a wide variety of Computer Vision models using CNNs, Transfer Learning, and Vision Transformer with TensorFlow, PyTorch, Hugging Face and Ultralytics.

cnn computer-vision efficientnetv2 hugging-face image-captioning image-classification image-segmentation lenet-5 object-detection opencv plotly python pytorch scikit-learn tensorflow tensorflow-hub ultralytics video-classification vision-transformer yolo11

Last synced: 05 Dec 2024

https://github.com/kookmin-sw/capstone-2023-29

์ž๋ฆฌ์žˆ์–ด? - ๊ฒฝ๊ธฐ๋„ ๊ด‘์—ญ๋ฒ„์Šค ์ขŒ์„์˜ˆ์ธก ์‹œ์Šคํ…œ

fastapi lstm postgresql python3 pytorch react scikit-learn sqlalchemy

Last synced: 13 Nov 2024

https://github.com/pr38/tensorbspline

A scikit-learn compatible implementation of Polynomial Spline/B-Spline as well as a Multidimensional/Tensor spline variation.

basis-expansion feature-engineering feature-extraction machine-learning scikit-learn

Last synced: 04 Jan 2025

https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction

Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.

breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn

Last synced: 22 Nov 2024