Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/md-emon-hasan/6-classification-iris-ml-apps

A ML project on the classification of the Iris dataset, demonstrating data preprocessing, model training, and evaluation using Python and scikit-learn.

classification data-science iris-classification iris-dataset iris-flower-classification predictive-modeling scikit-learn

Last synced: 13 Jan 2025

https://github.com/j-i-l/tfb-prediction

Transcription factor binding prediction

bioinformatics machine-learning pandas python scikit-learn

Last synced: 20 Dec 2024

https://github.com/palak-463/tablataalrecognitionsystem

Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. 🎛️

cnn deep-learning flask fnn librosa numpy os pickle python scikit-learn

Last synced: 09 Jan 2025

https://github.com/an-exodus/dubai-real-estate-price-prediction-ml

This repository contains a comparative analysis of machine learning algorithms to predict real estate prices in Dubai. Using data from Bayut, we evaluate Decision Tree, Linear Regression, Random Forest, and Gradient Boosting models based on their predictive accuracy.

decision-tree gradient-boosting linear-regression machine-learning random-forest scikit-learn

Last synced: 21 Dec 2024

https://github.com/juselara1/bregclus

Python implementation of Bregman Hard Clustering and Bregman Soft Clustering as a scikit-learn module.

bregman-divergence clustering numpy scikit-learn unsupervised-learning

Last synced: 31 Dec 2024

https://github.com/aryansk/fake-news-detection

A sophisticated machine learning solution to detect fake news using multiple classification algorithms. Identify the credibility of news articles with advanced text analysis techniques!

fake-news-detection machine-learning machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 01 Feb 2025

https://github.com/aarryasutar/logistic_regression_on_age_prediction

This code evaluates the performance of a logistic regression model on age prediction using various features to predict a binary target variable, calculating metrics to determine the performance. It evaluates the comparison, identifies favorable features, and visualizes the ROC-AUC curve to determine the best model performance.

accuracy-score confusion-matrix f1-score feature-selection logistic-regression model-training numpy pandas precision recall rmse roc-auc-curve scikit-learn visualization

Last synced: 21 Dec 2024

https://github.com/ayoubelmortaji/analysis-and-detection-of-threats-in-cloud-environments-with-machine-learning

Developed a machine learning-driven threat detection model for cloud environments, utilizing Random Forest (RF) and Decision Tree (DT) algorithms. Focused on analyzing network activities and identifying suspicious behaviors through Security Information and Event Management (SIEM) systems. Enhanced cloud security by addressing key challenges in dete

cloud-computing machine-learning matplotlib numpy pandas python scikit-learn seaborn siem

Last synced: 01 Feb 2025

https://github.com/tsu2000/audit_risk

Machine learning web app in Streamlit about classifying fraudulent companies using various classification models.

machine-learning plotly python random-forest scikit-learn streamlit-webapp

Last synced: 23 Jan 2025

https://github.com/ultrasage-danz/scikit-learn-ml

Machine Learning with scikit-learn by Data School

ai data data-school machine-learning macos ml scikit-learn ultrasage-dan

Last synced: 30 Jan 2025

https://github.com/md-emon-hasan/ai-from-university

🎓 Collection of academic resources, projects, and exercises related to artificial intelligence concepts learned in university coursework.

ai artificial-intelligence linear-regression logestic-regression mahcine-learning ml scikit-learn

Last synced: 13 Jan 2025

https://github.com/shliakhovai/house-price-prediction

This repository contains a complete machine learning pipeline for predicting housing prices. It includes data preprocessing, feature engineering, and model training and evaluation components, designed to provide a robust solution for regression tasks.

data-science machine-learning matplotlib numpy pandas prediction python regression scikit-learn seaborn

Last synced: 21 Dec 2024

https://github.com/harshita2234/breast-cancer-prediction

A machine learning project to predict breast cancer using logistic regression. This project includes data preprocessing, feature scaling, model training, and evaluation, based on a guided project from Coursera.

breast-cancer-prediction jupyter-notebook logistic-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 30 Jan 2025

https://github.com/monzerdev/fake-news-detection

Project implementing machine learning models to detect fake news articles. Utilizes Deep Neural Networks, Support Vector Machines (SVM), and Ensemble methods (Random Forest). Developed using Python with scikit-learn, PyTorch, and nltk.

dnn fakenewsdetection machinelearning nlp nltk python pytorch random-forest scikit-learn svm

Last synced: 21 Dec 2024

https://github.com/h-fuzzy-logic/python-finding-nsf-award-themes

Using NLP to find themes and concepts in NSF Awards

nltk pandas python scikit-learn

Last synced: 15 Dec 2024

https://github.com/f-aguzzi/ChemFuseKit

Chemometrics library for data fusion, model training and prediction of data from multiple sensor sources.

chemometrics datafusion knn lda pca plsda scikit-learn svm

Last synced: 13 Jan 2025

https://github.com/canayter/unsupervised-machine-learning

Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.

k-means-clustering python scikit-learn unsupervised-machine-learning

Last synced: 09 Jan 2025

https://github.com/francescopaolol/sentimentanalysis

About sentiment analysis on IMDB Dataset of 50K Movie Reviews

jupyter-notebook kaggle machine-learning ml pandas scikit-learn sentiment-analysis

Last synced: 22 Dec 2024

https://github.com/vectominist/mednlp

Mandarin Medical Dialogue Analysis with Pytorch.

dialog huggingface mandarin medical pytorch scikit-learn transformers

Last synced: 29 Jan 2025

https://github.com/oneapi-src/credit-card-fraud-detection

AI Starter Kit for Credit Card Fraud Detection model using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 05 Nov 2024

https://github.com/aryansk/customer-segmentation-analysis

Advanced customer segmentation project using K-Means clustering to analyze customer behavior based on annual income, spending score, and age.

elbow-method exploratory-data-analysis machine-learning machine-learning-algorithms python scikit-learn sentiment-analysis sentiment-classification

Last synced: 01 Feb 2025

https://github.com/francescopaolol/logisticregression

About predicting survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle logistic-regression machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/uhstray-io/pyrizon

Data Collection, Analysis, Mapping, Pipelining & Transformation, & API using Python

api data-engineering etl numpy pandas plotly python pytorch raw-data scikit-learn seaborne sql sqlite tensorflow

Last synced: 03 Feb 2025

https://github.com/francescopaolol/decisiontree

About classify iris plants into three species in this classic dataset

decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/aravindnathan02/machine-learning-projects

Machine Learning and Deep Learning projects which mainly focuses on predictive modeling.

deep-learning machine-learning neural-networks predictive-modeling python scikit-learn tensorflow

Last synced: 03 Feb 2025

https://github.com/francescopaolol/favoritatimeseriesforecasting

See: https://www.kaggle.com/competitions/store-sales-time-series-forecasting

jupyter-notebook kaggle-competition machine-learning pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/jdavydovportfolio/careerpredictor

A project leveraging AI and Machine Learning (Logistic Regression) to predict graduate job placements. Includes data preprocessing, exploratory analysis, and predictive modeling.

artificial-intelligence exploratory-data-analysis jupyter-notebook linear-regression logistic-regression machine-learning machine-learning-algorithms machine-learning-models matplotlib ml numpy pandas pandas-dataframe predictive-modeling programming python scikit-learn

Last synced: 02 Feb 2025

https://github.com/jesly-joji/spam-ham-classifier

Used Naive Bayes Algorithm, NLP Text Preprocessing Techniques

naive-bayes-classifier nlp scikit-learn streamlit text-preprocessing

Last synced: 20 Dec 2024

https://github.com/vaibhavs10/learn-ml

Modified notebooks (single) from kaggle.com/learn with added nuances

decision-trees machine-learning pandas random-forest scikit-learn

Last synced: 19 Dec 2024

https://github.com/vishal-038/attendance_by_face_recogination

This project is a face recognition-based attendance system that uses Python, OpenCV, Scikit-learn, Streamlit, and various other libraries like Pandas, Numpy, Datetime, and OS for different functionalities. It enables adding faces to the database, taking attendance based on face recognition, and showing live attendance through a web interface built

opencv python scikit-learn

Last synced: 17 Dec 2024

https://github.com/francescopaolol/titaniccompetition

It's my first kaggle competition about predict survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle-competition machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/chitralputhran/drive-curve-machine-learning-app

:blue_car: Drive Curve is a web application made with the help of Flask, a microframework for Python based on Werkzeug, Jinja 2, and good intentions. On the backend, a Machine Learning model is used for predicting the price of the car. The machine learning model was trained on the Automobile Dataset from the UCI Machine Learning Repository.

flask machine-learning python scikit-learn webapp

Last synced: 21 Jan 2025

https://github.com/chitralputhran/tutorial-sklearn-columntransformer

ColumnTransformer was introduced in scikit-learn from version 0.20 onwards. The notebook file contains a quick and easy tutorial on ColumnTransformer to get you started.

scikit-learn

Last synced: 21 Jan 2025

https://github.com/nirmalyabag20/crop-yield-prediction-using-machine-learning

This project uses machine learning to predict crop yields based on factors like region, crop type, rainfall, temperature, and pesticide use. By analyzing a dataset of over 28,000 records, the models provide accurate yield forecasts, helping optimize farming decisions and resource management, ultimately contributing to sustainable agriculture.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 19 Dec 2024

https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction

Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.

breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn

Last synced: 23 Jan 2025

https://github.com/queirozpedro/cluesuspeitos

Estudando o funcionamento do Jogo Clue Suspeitos e implementando treinamento com MLP

matplotlib mlp-classifier python scikit-learn

Last synced: 21 Jan 2025

https://github.com/fohlen/stats-experiment

A tiny stats experiment with GENESIS data

matplotlib python3 scikit-learn

Last synced: 23 Jan 2025

https://github.com/aymanmansur/insider-threat-detection-using-cert-dataset-logon-

Detecting anomalies in user logon behavior using the CERT Insider Threat Detection Dataset. This project extracts key features like session duration and logon frequency during non-working hours and applies Isolation Forest to identify suspicious activity.

matplotlib pandas python scikit-learn

Last synced: 03 Feb 2025

https://github.com/mrapp-ke/examplewisef1maximizer

A scikit-learn meta-estimator for multi-label classification that aims to maximize the example-wise F1 measure

machine-learning multilabel-classification scikit-learn

Last synced: 24 Dec 2024

https://github.com/troublem1/mle

MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline

packages python3 scikit-learn sklearn

Last synced: 29 Jan 2025

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 29 Jan 2025

https://github.com/saro0307/pre-doctor-ai-model

Pre-Doctor is an AI-driven health advisor using sci-kit-learn, offering quick medical advice based on user-input symptoms, making healthcare accessible and user-friendly. Utilizing Flask and pyttsx3, it seamlessly integrates machine learning for informed well-being.

artificial-intelligence css flask generative-ai generative-model html machine-learning python reinforcement-learning scikit-learn

Last synced: 13 Jan 2025

https://github.com/ax-va/numpy-pandas-matplotlib-scikit-learn-vanderplas-2023

These examples provide an introduction to Data Science and classic Machine Learning using NumPy, Pandas, Matplotlib, and scikit-learn. They are taken, with some changes, from the book "Python Data Science Handbook: Essential Tools for Working with Data", Second Edition, written by Jake VanderPlas and published by O'Reilly Media in 2023.

ax-va classic-machine-learning data-science machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 18 Nov 2024

https://github.com/spamfromaditya/drugs-consumption-prediction-model-eda-bagging-classifier

Drug consumption prediction models are like crystal balls for public health. By analyzing vast amounts of data, these models can identify individuals or communities at higher risk of drug use. They consider factors like demographics, social media activity, prescription history, and even economic indicators.

bagging-classifier machine-learning matplotlib numpy python scikit-learn

Last synced: 31 Dec 2024

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 29 Jan 2025

https://github.com/venky-1710/stress-level-predection

Stress Level Prediction is a web app using machine learning to estimate user stress levels. It takes inputs like anxiety, sleep quality, and academic performance, then predicts stress using a Decision Tree Classifier. Built with Python, Flask, and scikit-learn, it's useful for students, researchers, and those interested in stress management.

css flask html machine-learning numpy pandas python python-sklearn scikit-learn

Last synced: 20 Jan 2025

https://github.com/guoshijiang/scikit-learn

带你一起学习scikit-learn

nlp-machine-learning scikit-learn

Last synced: 24 Jan 2025

https://github.com/raythurman2386/gis-playground

GIS Playground is a comprehensive web-based GIS application that combines multiple data sources and provides advanced spatial data visualization and analysis capabilities. The application features real-time wildfire data integration, intelligent spatial data processing, and interactive mapping functionality.

flask gdal geopandas leaflet nltk scikit-learn

Last synced: 27 Jan 2025

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/tomwassing/brane-project

Brane example project using the Scikit-learn and Matplotlib packages

brane branescript matplotlib scikit-learn

Last synced: 02 Feb 2025

https://github.com/rajikaimal/emma

:santa: Intelligent mention bot for GitHub organizations

bot emma machine-learning python scikit-learn

Last synced: 14 Dec 2024

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/siam29/hybrid-feature-engineering-and-ensemble-learning

In this ML project, I proposed a methodology that provided an outperformed performance compared to another existing paper. For the comparison here focused mainly on F1, accuracy, AUC, and ROC score. This methodology provides a 99.96% accuracy score and 90.05% F1 score. 

feature-selection keras-tensorflow machine-learning matplotlib python scikit-learn

Last synced: 01 Feb 2025

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 02 Feb 2025

https://github.com/mgobeaalcoba/data_champions_meli

Algorithms and work carried out within the framework of data champions by Mercado Libre

algorithms canvas classification clustering data-science machine-learning python3 scikit-learn

Last synced: 20 Jan 2025

https://github.com/vidhi1290/text-classification-model-with-attention-mechanism-nlp

This Python project utilizes PyTorch to perform text classification with an attention mechanism. Pre-trained GloVe embeddings are processed for word representation, and a custom attention model is trained on consumer complaint data to categorize complaints into product categories.🎯

attention-mechanism deeplearning machine-learning nlp nltk numpy pandas python pytorch scikit-learn text-classification tqdm

Last synced: 02 Feb 2025

https://github.com/kingabzpro/mlops-with-jenkins

From data ingestion to deploying the model using Jenkins.

classification fastapi jenkins mlops scikit-learn

Last synced: 13 Oct 2024

https://github.com/corentinth/ml-gender_classification

[Machine Learning] The Hello Wolrd of Machine Learning using sklearn

body-metrics gender-classification machine-learning scikit-learn

Last synced: 20 Dec 2024

https://github.com/bistcuite/plainml

Painless Machine Learning Library for python based on scikit-learn

machine-learning ml plainml python scikit-learn

Last synced: 02 Feb 2025

https://github.com/mgobeaalcoba/linear_algebra_for_machine_learning

Explore fundamental linear algebra concepts essential for machine learning in this repository, with code examples and explanations. Get a solid foundation for ML!

machine-learning matplotlib numpy pandas python3 scikit-learn scipy seaborn

Last synced: 20 Jan 2025

https://github.com/bacross/datamunger

python package for handling nan's and outliers

data data-frame datamunger knn nan outliers python scikit-learn

Last synced: 02 Feb 2025

https://github.com/vidhi1290/hr_employee_prediction

"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.

data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp

Last synced: 02 Feb 2025

https://github.com/catlikeflyer/rsp-recognition

A computer vision project to recognize thumbs up

machine-learning mediapipe-hands python scikit-learn

Last synced: 21 Jan 2025

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes exploratory-data-analysis pandas python scikit-learn visualization

Last synced: 07 Nov 2024

https://github.com/carmoreno/analisisaccidentalidadbogota

Data Analysis about traffic accidents at Bogotá, Colombia.

data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn

Last synced: 05 Jan 2025