Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/prithivsakthiur/data-board

Data Boards - Visualization of various plots ( Analysis )

data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces

Last synced: 21 Dec 2024

https://github.com/zen204/airbnb_availability

A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.

binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning

Last synced: 03 Nov 2024

https://github.com/bhuvaneshwarguttula/student-performance-indicator

To understand and predict how the student's performance (test scores) is affected by the other variables (Gender, Ethnicity, Parental level of education, Lunch, Test preparation course).

exploratory-data-analysis machine-learning pandas python scikit-learn student-performance-analysis

Last synced: 10 Oct 2024

https://github.com/ultrasage-danz/scikit-learn-ml

Machine Learning with scikit-learn by Data School

ai data data-school machine-learning macos ml scikit-learn ultrasage-dan

Last synced: 02 Dec 2024

https://github.com/ghufranbarcha/codsoft-machine-learning-internship

This repository contain all Machine Learning & NLP task during my internship at Codsoft.

jupyter-notebook machinelearning nlp nltk python scikit-learn

Last synced: 02 Dec 2024

https://github.com/adzialocha/notebook

Jupyter notebooks for random experiments with audio processing, data analysis and machine learning

jupyter-notebook keras learning librosa music21 scikit-learn

Last synced: 22 Dec 2024

https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis

Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.

data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn

Last synced: 21 Dec 2024

https://github.com/vidhi1290/text-classification-model-with-attention-mechanism-nlp

This Python project utilizes PyTorch to perform text classification with an attention mechanism. Pre-trained GloVe embeddings are processed for word representation, and a custom attention model is trained on consumer complaint data to categorize complaints into product categories.🎯

attention-mechanism deeplearning machine-learning nlp nltk numpy pandas python pytorch scikit-learn text-classification tqdm

Last synced: 08 Dec 2024

https://github.com/gappeah/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 10 Oct 2024

https://github.com/offchan42/thai-thesis-classification

Classify each document inside the corpus using Python machine learning module: scikit-learn

nlp python python2 scikit-learn segment thai thai-language thai-thesis-classification

Last synced: 29 Oct 2024

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 08 Dec 2024

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 10 Oct 2024

https://github.com/bacross/datamunger

python package for handling nan's and outliers

data data-frame datamunger knn nan outliers python scikit-learn

Last synced: 08 Dec 2024

https://github.com/sethios-notebook/__ia_learnig__

Formation Python spΓ©ciale Machine Learning francais. Apprendre Python en 30 fichiers qui contiennent une formation sur Numpy, Pandas, Matplotlib, Scipy, Sklearn, Seaborn, H5py, et bien d'autres techniques. Python est le langage d'excellence pour le machine learning, le deep learning, et la data science.

iac machine-learning matplotlib numpy python scikit-learn

Last synced: 21 Dec 2024

https://github.com/md-emon-hasan/ml-project-car-price-prediction

πŸš— End-to-end ML project for predicting car prices based on various features. Includes data preprocessing, model training, and a Flask web for predictions.

car-price-prediction car-price-predictor data-science feature-engineering ml predictive-modeling scikit-learn

Last synced: 10 Oct 2024

https://github.com/rickiepark/ml-ko

λ¨Έμ‹ λŸ¬λ‹, λ”₯λŸ¬λ‹ ν•œκΈ€ λ²ˆμ—­ μ €μž₯μ†Œ

deep-learning keras machine-learning python scikit-learn tensorflow

Last synced: 21 Dec 2024

https://github.com/rickiepark/ml-with-python-cookbook-2nd

<μ‹€λ¬΄λ‘œ ν†΅ν•˜λŠ” ML 문제 ν•΄κ²° with 파이썬>

deep-learning machie-learning pytorch scikit-learn

Last synced: 21 Dec 2024

https://github.com/palak-463/tablataalrecognitionsystem

Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. πŸŽ›οΈ

cnn deep-learning flask fnn librosa numpy os pickle python scikit-learn

Last synced: 09 Jan 2025

https://github.com/uea-geral/rna-perceptron-exercise

πŸ€–Disciplina de RNA: treinamento de um neurΓ΄nio Perceptron.

jupyter-notebook neural-network numpy perceptron python scikit-learn

Last synced: 26 Nov 2024

https://github.com/juselara1/bregclus

Python implementation of Bregman Hard Clustering and Bregman Soft Clustering as a scikit-learn module.

bregman-divergence clustering numpy scikit-learn unsupervised-learning

Last synced: 31 Dec 2024

https://github.com/tsu2000/audit_risk

Machine learning web app in Streamlit about classifying fraudulent companies using various classification models.

machine-learning plotly python random-forest scikit-learn streamlit-webapp

Last synced: 22 Nov 2024

https://github.com/jersongb22/datascience_ibm_stockpredictionlstm_project

In the IBM Advanced Data Science specialization, an interactive real-time web application was developed using LSTM networks in TensorFlow to predict stock market trends for global companies.

apache-spark data-science deep-learning lstm-neural-networks machine machine-learning plotly python scikit-learn streamlit tensorflow

Last synced: 25 Nov 2024

https://github.com/dustinmichels/bayesian-values-guesser

Uses some user input, data from the World Values Survey <www.worldvaluessurvey.org>, and Bayes Rule to guess a number of beliefs the user might have. STATUS: In progress.

bayes-rule bayesian-values-guesser naive-bayes-classifier pandas python scikit-learn values-survey

Last synced: 14 Dec 2024

https://github.com/ayushshahh/fespn

A neural network made to predict final exam scores of students

mlp mlp-regressor multilayer-perceptron neural-network prediction-model scikit-learn

Last synced: 07 Dec 2024

https://github.com/swimshahriar/heart-attack-prediction

Heart attack prediction from 13 features.

jupyter-notebook pandas python3 scikit-learn

Last synced: 20 Dec 2024

https://github.com/khaymanii/movie-recommendation-model

This is a model built using Python and Cosine Similarity algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/aahnik/gdsc-ml-ds-bootcamp-2023

This repo contains files given by my seniors as well as assignments and final project done by me during the bootcamp.

data-science machine-learning ml numpy pandas python3 scikit-learn

Last synced: 11 Oct 2024

https://github.com/khaymanii/loan_status_predictor

This is a loan status predictor model built using Support Vector Machine Algorithm and Gradio UI library

matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/rusiruchapana/blood-group-prediction

Creating a machine learning project to predict blood groups from fingerprint patterns

asp-net-web-api keras matplotlib numpy opencv-python pandas pillow scikit-learn tensorflow

Last synced: 21 Jan 2025

https://github.com/f-aguzzi/chemfusekit

Chemometrics library for data fusion, model training and prediction of data from multiple sensor sources.

chemometrics datafusion knn lda pca plsda scikit-learn svm

Last synced: 01 Jan 2025

https://github.com/ishanoshada/matplot3dex

A Matplotlib 3D Extension package for enhanced data visualization

data data-science matplotlib python-packages scikit-learn

Last synced: 01 Dec 2024

https://github.com/canayter/unsupervised-machine-learning

Utilizing Python and unsupervised learning to predict if cryptocurrencies are affected by 24-hour or 7-day price changes.

k-means-clustering python scikit-learn unsupervised-machine-learning

Last synced: 09 Jan 2025

https://github.com/magnuss0/movie-rec-system

The project extracts movie data using TheMovieDB API, processes it using TF-IDF and cosine similarity for generating recommendations, and stores the data in a DuckDB database. The system is encapsulated within a FastAPI web application and can be deployed using Docker. It provides movie recommendations in JSON format.

cosine-similarity docker duckdb movies-recommendation moviesdb-api ploomber poetry-python scikit-learn streamlit tf-idf

Last synced: 25 Nov 2024

https://github.com/troublem1/mle

MultiLabel-Transformer(MLE) is an extended version of a LabelEncoder, such that, it encodes multiple categorical columns to numeric in any workflow or pipeline

packages python3 scikit-learn sklearn

Last synced: 01 Dec 2024

https://github.com/spamfromaditya/drugs-consumption-prediction-model-eda-bagging-classifier

Drug consumption prediction models are like crystal balls for public health. By analyzing vast amounts of data, these models can identify individuals or communities at higher risk of drug use. They consider factors like demographics, social media activity, prescription history, and even economic indicators.

bagging-classifier machine-learning matplotlib numpy python scikit-learn

Last synced: 31 Dec 2024

https://github.com/francescopaolol/favoritatimeseriesforecasting

See: https://www.kaggle.com/competitions/store-sales-time-series-forecasting

jupyter-notebook kaggle-competition machine-learning pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/francescopaolol/titaniccompetition

It's my first kaggle competition about predict survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle-competition machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/francescopaolol/logisticregression

About predicting survival on the Titanic and get familiar with ML basics

jupyter-notebook kaggle logistic-regression machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/shaadclt/data-preprocessing-pipeline

This project contains a data preprocessing pipeline implemented in Python using the pandas and numpy libraries. The pipeline handles missing values, outliers, and normalizes numeric features in a dataset.

numpy pandas scikit-learn

Last synced: 07 Dec 2024

https://github.com/alessiochen/setiment-analysis-ai-project

Application of Sentimental Analysis for Artificial Intelligence class at UNIFI

ai andrew dataset movie-reviews scikit-learn sentiment-analysis

Last synced: 05 Jan 2025

https://github.com/francescopaolol/decisiontree

About classify iris plants into three species in this classic dataset

decision-tree-classifier jupyter-notebook kaggle machine-learning ml pandas scikit-learn

Last synced: 22 Dec 2024

https://github.com/francescopaolol/sentimentanalysis

About sentiment analysis on IMDB Dataset of 50K Movie Reviews

jupyter-notebook kaggle machine-learning ml pandas scikit-learn sentiment-analysis

Last synced: 22 Dec 2024

https://github.com/h-fuzzy-logic/python-finding-nsf-award-themes

Using NLP to find themes and concepts in NSF Awards

nltk pandas python scikit-learn

Last synced: 15 Dec 2024

https://github.com/ax-va/numpy-pandas-matplotlib-scikit-learn-vanderplas-2023

These examples provide an introduction to Data Science and classic Machine Learning using NumPy, Pandas, Matplotlib, and scikit-learn. They are taken, with some changes, from the book "Python Data Science Handbook: Essential Tools for Working with Data", Second Edition, written by Jake VanderPlas and published by O'Reilly Media in 2023.

ax-va classic-machine-learning data-science machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 18 Nov 2024

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 19 Dec 2024

https://github.com/chanioxaris/german-credit-data

Experimental classification algorithms on german credit data implemented using scikit-learn library

classification classifier cross-validation dataset information-entropy information-gain naive-bayes prediction random-forest scikit-learn support-vector-machines

Last synced: 02 Nov 2024

https://github.com/jdavydovportfolio/careerpredictor

A project leveraging AI and Machine Learning (Logistic Regression) to predict graduate job placements. Includes data preprocessing, exploratory analysis, and predictive modeling.

artificial-intelligence exploratory-data-analysis jupyter-notebook linear-regression logistic-regression machine-learning machine-learning-algorithms machine-learning-models matplotlib ml numpy pandas pandas-dataframe predictive-modeling programming python scikit-learn

Last synced: 07 Dec 2024

https://github.com/guoshijiang/scikit-learn

εΈ¦δ½ δΈ€θ΅·ε­¦δΉ scikit-learn

nlp-machine-learning scikit-learn

Last synced: 24 Nov 2024

https://github.com/aditya172926/text_summarization

Project to generate summaries and perform Named Entity Recognition from multiple types of text bodies.

glove machine-learning nlp python scikit-learn spacy

Last synced: 24 Nov 2024

https://github.com/pankajarm/tabular_ml_toolkit

A helper library to jumpstart your machine learning project based on tabular or structured data.

data-science feature-engineering hyperparameter-tuning machine-learning parallelism python scikit-learn structured-data tabular xgboost

Last synced: 21 Dec 2024

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 21 Dec 2024

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 01 Dec 2024

https://github.com/ccharlesss/financeml

machine learning web application using Python's FastAPI and scikit-learn to predict S&P 500 stock price trends and cluster stocks based on average annual returns and volatility. Utilised the MVC design pattern to structure the application effectively. Implemented a decision tree classifier with 84% accuracy.

cicd docker fastapi finance javascript jenkins machine-learning restful-api scikit-learn webapplication

Last synced: 13 Jan 2025

https://github.com/tharindanimnajith/deep-learning-spam-detection

Deep Learning classifiers to detect spam SMS messages - LSTM Model, DenseNet CNN Models - NLP, Python, Jupyter Notebook, Tensorflow, Keras, Numpy, Pandas, Matplotlib, Scikit-Learn

deep-learning densenet keras lstm nlp python3 scikit-learn tensorflow

Last synced: 25 Nov 2024

https://github.com/an-exodus/dubai-real-estate-price-prediction-ml

This repository contains a comparative analysis of machine learning algorithms to predict real estate prices in Dubai. Using data from Bayut, we evaluate Decision Tree, Linear Regression, Random Forest, and Gradient Boosting models based on their predictive accuracy.

decision-tree gradient-boosting linear-regression machine-learning random-forest scikit-learn

Last synced: 21 Dec 2024

https://github.com/md-emon-hasan/6-classification-iris-ml-apps

A ML project on the classification of the Iris dataset, demonstrating data preprocessing, model training, and evaluation using Python and scikit-learn.

classification data-science iris-classification iris-dataset iris-flower-classification predictive-modeling scikit-learn

Last synced: 13 Jan 2025

https://github.com/j-i-l/tfb-prediction

Transcription factor binding prediction

bioinformatics machine-learning pandas python scikit-learn

Last synced: 20 Dec 2024

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 30 Nov 2024

https://github.com/asosnovsky/analyzing-blood-vessel-aneurysm

A few simple scripts to identify aneurysm in a blood-vessel (research projects)

machine-learning meanshift medical-image-processing scikit-learn

Last synced: 23 Nov 2024

https://github.com/aarryasutar/logistic_regression_on_age_prediction

This code evaluates the performance of a logistic regression model on age prediction using various features to predict a binary target variable, calculating metrics to determine the performance. It evaluates the comparison, identifies favorable features, and visualizes the ROC-AUC curve to determine the best model performance.

accuracy-score confusion-matrix f1-score feature-selection logistic-regression model-training numpy pandas precision recall rmse roc-auc-curve scikit-learn visualization

Last synced: 21 Dec 2024

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 01 Dec 2024

https://github.com/vectominist/mednlp

Mandarin Medical Dialogue Analysis with Pytorch.

dialog huggingface mandarin medical pytorch scikit-learn transformers

Last synced: 02 Dec 2024

https://github.com/bestmahdi2/uni__pythonsupportvectormachinesbinaryclassification

A university project in which the binary classification of support vector machines is implemented with Python language

binary-classification classification matplotlib numpy python scikit-image scikit-learn seaborn support-vector-machine svm

Last synced: 17 Jan 2025

https://github.com/edikedik/eboruta

Flexible and transparent Python Boruta implementation

ensemble-models feature-selection machine-learning python scikit-learn

Last synced: 13 Oct 2024

https://github.com/priyanshulathi/url-based-web-phishing-detection-model

A machine learning project for detecting phishing URLs by extracting key features from web addresses and applying various classification algorithms to identify potential threats effectively.

decision-tree knn logistic-regression machine-learning python random-forest scikit-learn

Last synced: 22 Jan 2025

https://github.com/shliakhovai/house-price-prediction

This repository contains a complete machine learning pipeline for predicting housing prices. It includes data preprocessing, feature engineering, and model training and evaluation components, designed to provide a robust solution for regression tasks.

data-science machine-learning matplotlib numpy pandas prediction python regression scikit-learn seaborn

Last synced: 21 Dec 2024