Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
scikit-learn
![](https://explore-feed.github.com/topics/scikit-learn/scikit-learn.png)
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2025-02-16 00:26:44 UTC
- JSON Representation
https://github.com/icejan/predicton-systems
Various systems that train on data and generate a prediction
lightfm machine-learning numpy python scikit-learn
Last synced: 27 Jan 2025
https://github.com/sorenfyhn/dtu-machine-learning-project
My project work for the DTU 02450 Machine Learning course.
machine-learning python pytorch quarto scikit-learn
Last synced: 23 Jan 2025
https://github.com/sorenfyhn/resights-avm-house-prices
My attempts for the automatic valuation model (AVM) hackathon by Resights.
machine-learning python pytorch scikit-learn
Last synced: 23 Jan 2025
https://github.com/satvikpraveen/fashionmnist-analysis
A comprehensive analysis of the Fashion MNIST dataset using PyTorch. Covers data preparation, EDA, baseline modeling, and fine-tuning CNNs like ResNet. Includes modular folders for data, notebooks, and results. Features CSV exports, visualizations, metrics comparison, and a requirements.txt for easy setup. Ideal for ML workflow exploration.
computer-vision confusion-matrix convolutional-neural-networks deep-learning-algorithms exploratory-data-analysis fashion-mnist-dataset fine-tuning hyperparameter-tuning image-classification jupyter-notebook machine-learning-algorithms matplotlib-pyplot model-evaluation numpy pandas pytorch resnet-18 scikit-learn seaborn vgg
Last synced: 23 Jan 2025
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 06 Jan 2025
https://github.com/rahulsm20/insurance-data
A data analytics project dealing with risk assessment and it's effects in health insurance.
data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 06 Jan 2025
https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland
An elementary research for a company to opening store in a city using gurobi and pulp optimization.
christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem
Last synced: 23 Jan 2025
https://github.com/selcia25/sleep-disorder-detection
💤This project aims to develop an automated method for detecting sleep disorders from heart rate signals.
cnn-classification kmeans-clustering machine-learning matplotlib scikit-learn scipy sleep-disorders tensorflow
Last synced: 26 Jan 2025
https://github.com/stewartpark/sklearn2gem
⚡ sklearn2gem ports your scikit-learn model into a fast ruby C binding!
ruby rubygem scikit-learn sklearn
Last synced: 03 Jan 2025
https://github.com/adam-maz/virtual_screening
Within this repository I present scripts that can be helpful during virtual screening in drug design & development.
clusterization jupyter-notebook k-means-clustering maestro-schrodinger medicinal-chemistry molecular-fingerprints pandas python rdkit scikit-learn scoring-functions virtual-screening
Last synced: 17 Feb 2025
https://github.com/alpha597/music_classification_ml
A project which compares different machine learning algorithms' accuracy in music genre classification of a large dataset.
machine-learning pandas python scikit-learn tensorflow
Last synced: 17 Feb 2025
https://github.com/nicolasvauche/vinylexplore_ml
VinyleXplore est un moteur de recommandation de vinyles intelligent basé sur l'humeur et le contexte d'écoute de l'utilisateur. Il utilise FastAPI pour exposer une API REST et scikit-learn pour entraîner un modèle de Machine Learning qui améliore la pertinence des suggestions.
machine-learning python scikit-learn vinyle
Last synced: 17 Feb 2025
https://github.com/danaelshrbiny10/backpack-price-prediction
A deep learning model for predicting backpack prices using TensorFlow and Keras. Includes data preprocessing, model training, and visualizations for performance analysis.
keras matplotlib pandas python3 scikit-learn seaborn tensorflow
Last synced: 17 Feb 2025
https://github.com/danielwohlr/delivery_time_series
Time series forecasting of food delivery service data
forecasting-time-series python scikit-learn
Last synced: 17 Feb 2025
https://github.com/danicaalana/wine-dataset-decision-tree
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.
data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset
Last synced: 17 Feb 2025
https://github.com/abrarshahok/wildlife-species-identification-using-cnn
In this project i've built and trained a Convolutional Neural Network (CNN) to classify wild animals.
cnn cnn-classification cnn-keras cnn-model convolutional-neural-networks neural-network numpy opencv pandas python scikit-learn tensorflow wildlife-classification
Last synced: 17 Feb 2025
https://github.com/nazmul-1117/100-days-of-machine-learning
I'm Nazmul so exited to start a new journey to learn 100 Days of Machine Learning. It's February 8, 2025. I'm so exited, let's see what happened insha'Allah
data-science machine-learning numpy pandas-dataframe python3 scikit-learn statistics
Last synced: 17 Feb 2025
https://github.com/mirzaazwad/tymbert
TYMBert is our submission for NCIM 2025, a spam classifier that makes use of knowledge distillation to compress the model while preserving accuracy
bert huggingface-transformers knowledge-distillation machine-learning matplotlib numpy pandas python3 scikit-learn tiny-bert torch
Last synced: 17 Feb 2025
https://github.com/jenil311/application-of-covid-19-spread-analysis
The objective of this project is to study the COVID-19 outbreak using basic statistical techniques and make short term predictions using ML regression methods.
covid19-tracker machine-learning regression-analysis regression-models ridge-regression scikit-learn
Last synced: 22 Jan 2025
https://github.com/sdpdas/sm_sentiment_analysis
Using Natural Language Processing (NLP) and pandas, numpy, scikit-learn for classification and applying logistic regression as it is a supervised model, lastly NLTK. Pickle library used for saving and running the model anywhere.
logistic-regression machine-learning nlp scikit-learn sentiment-analysis stemming vectorizer
Last synced: 22 Jan 2025
https://github.com/alan-oliveir/cnn_digitos_manuscritos
Trabalho final para disciplina Redes Neurais(2023/1) - Rede Neural Convolucional para identificação de dígitos manuscritos.
convolutional-neural-networks jupyter-notebook scikit-learn
Last synced: 13 Jan 2025
https://github.com/daniel-furman/recfeatureselect
Feature selection functions (1) using the multi-collinearity matrix and recursively proceeding to a spearman threshold and (2) using Forward Stepwise Selection running on an ensemble sklearner (with options for HPO).
correlation-threshold machine-learning modeling multicollinearity recursion recursive-algorithm scikit-learn spearman-rho
Last synced: 22 Jan 2025
https://github.com/jo-minseok/global-warming-100year
🌡️ 2100년까지의 지구 기온, 해수면, 북극 빙하, 탄소 예측 MachineLearning [완료]
arima-model machine-learning matplotlib numpy pandas scikit-learn seaborn
Last synced: 10 Feb 2025
https://github.com/mayhixza/insurance-dataset-analysis
Medical cost insurance EDA project
data-science data-visualization eda linear-regression matplotlib scikit-learn seaborn
Last synced: 02 Feb 2025
https://github.com/chdl17/lead-score-case-study
Lead scoring is the process of assigning a numerical value or score to each lead, based on factors such as demographics and behavior, to determine their potential value as customers.
machine-learning-algorithms matplotlib-pyplot python scikit-learn
Last synced: 06 Jan 2025
https://github.com/saadarazzaq/custom-neural-networks
Perceptron, Multiclass and Deep Neural Network Implemented All in One Repo ✨
deep-neural-networks keras numpy perceptron python scikit-learn tensorflow
Last synced: 23 Jan 2025
https://github.com/bishopce16/credit_risk_analysis
An analysis to build and evaluate a dataset from LendingClub to discover credit risk using Supervised Machine Learning Algorithms.
imbalanced-learning jupyter-notebook machine-learning machine-learning-algorithms pandas python scikit-learn visual-studio-code
Last synced: 06 Jan 2025
https://github.com/bishopce16/cryptocurrencies
An analysis on cryptocurrencies dataset using unsupervised machine learning, PCA algorithm, and K-means clustering.
hvplot jupyter-notebook pandas plotly python scikit-learn unsupervised-machine-learning visual-studio-code
Last synced: 06 Jan 2025
https://github.com/lkethridge/intro_to_machine_learning_project
Intro to Machine Learning Project from TripleTen
accuracy-metrics classification decision-tree-regression evaluation-metrics jupyter-notebook linear-regression logistic-regression machine-learning mean-square-error model-comparison-and-selection model-quality random-forest-regression regression scikit-learn splitting-data supervised-learning trained-models tuning-hyperparameters
Last synced: 21 Jan 2025
https://github.com/visu512/spotify-music-recommender
Built Spotify Music recommendation system using Machine learning
csv-files dataset jupyter-notebook kmeans-clustering machine-learning matplotlib numpy panda python scikit-learn spotify-api stream vscode
Last synced: 21 Jan 2025
https://github.com/codealphaprojects/spotify-music-recommender
CodeAlpha Machine Learning Task-1
jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn spotify spotify-api streamlit vscode
Last synced: 21 Jan 2025
https://github.com/nowon1/insurance-claim-prediction_version
This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.
data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn
Last synced: 21 Jan 2025
https://github.com/esha-sm/forecastx
This is an interactive web application for forecasting sales data using the ARIMA model. Users can upload their own CSV files or use a default dataset to generate forecasts and visualizations.
arima-model flask-api jupyter-notebook matplotlib pandas plotly python scikit-learn seaborn
Last synced: 21 Jan 2025
https://github.com/bagusperdanay7/absa-with-bilstm-undergraduate-thesis
My undergraduate thesis program, Aspect-Based Sentiment Analysis Towards Matket Place Application Review Using Bidirectional Long Short-Term Memory used Python, Keras and Tensorflow
ai aspect-based-sentiment-analysis bilstm deep-learning gensim imbalanced-learning ipython-notebook keras machine-learning matplotlib natural-language-processing nltk numpy pandas python scikit-learn seaborn tensorflow
Last synced: 21 Jan 2025
https://github.com/sonaligill/olympics-analysis
The outcome of this project is an interactive streamlit web application that visualizes the analysis of Olympic data while rendering different aspects of Olympic history, compare country performances, and gain insights into athlete demographics.
numpy plotly python scikit-learn scipy streamlit
Last synced: 14 Feb 2025
https://github.com/kanika300393/loan_prediction
This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.
data-science machine-learning numpy pandas python scikit-learn svm-classifier
Last synced: 17 Feb 2025
https://github.com/gurpreet0022/nlp_exploration
This repository explores various Natural Language Processing (NLP) techniques using the NLTK library in Python. It demonstrates these techniques on a sample dataset and performs sentiment analysis on movie reviews.
beginner-friendly nlp nlp-machine-learning nltk scikit-learn
Last synced: 17 Feb 2025
https://github.com/thariniselvakumar/kidney-disease-prediction
This project is about the kidney disease prediction using machine learning algorithms
machine-learning matplotlib numpy pandas scikit-learn seaborn
Last synced: 17 Feb 2025
https://github.com/tiarmdhnt/detect-botnets-in-network-traffic
Application of Deep Learning to Detect Botnets in Network Traffic Using CTU-13 Dataset
botnet-detection deep-learning machine-learning matplotlib network-security neural-networks pandas python pytorch scikit-learn seaborn tensorflow
Last synced: 17 Feb 2025
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 17 Feb 2025
https://github.com/omidghadami95/als_vs_gradientdescent
Matrix Factorization (ALS vs Gradient Descent)
als data-mining data-science factorization gradient-descent machine-learning matrix-factorization mf optimization prediction scikit-learn visualization
Last synced: 17 Feb 2025
https://github.com/coder5omkar/logistic-regression-customer-churn-prediction
This project uses Logistic Regression to predict customer churn in the telecom industry. To run, clone the repository, install dependencies, and run the Jupyter notebook for full analysis and predictions.
logistic-regression ml pandas scikit-learn seaborn statistics
Last synced: 17 Feb 2025
https://github.com/thekartikeyamishra/customer-retention-predictor
The Customer Retention Predictor is a Python-based tool designed to help businesses predict customer churn using historical data. This project is particularly beneficial for small businesses and MSMEs in India, allowing them to identify customers at risk of leaving and take proactive measures to retain them.
joblib machine-learning numpy pandas python scikit-learn tinker
Last synced: 17 Feb 2025
https://github.com/gregoritsch3/ml_eda_classification_diabetes
An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.
anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics
Last synced: 17 Feb 2025
https://github.com/sayed-ashfaq/delhivery-dataanalysis
In this project, I conducted basic analysis, feature engineering, normalization, and outlier handling, along with statistical and non-parametric testing to extract insights.
feature-engineering normalization outlier-detection pandas python scikit-learn statistcal-tests statistical-analysis
Last synced: 17 Feb 2025
https://github.com/the-developer-306/fake-review-detector
This project is a machine learning-based review classification system that predicts whether a product review is GENUINE or FAKE. It preprocesses review text, analyzes sentiment, and uses numerical features like ratings and helpfulness to make predictions. The model is deployed via a Flask web application for user interaction.
classification flask logistic-regression machine-learning numpy pandas python renderdeploy scikit-learn sentiment-analysis
Last synced: 17 Feb 2025
https://github.com/aliy98/navigation-sensor-data-classification
Classification of a Navigation Robot Sensor Dataset Using SVM, Random Forest and Neural Network
artificial-neural-networks keras multiclass-classification random-forest scikit-learn scitos-g5 support-vector-machines
Last synced: 02 Feb 2025
https://github.com/3rd-son/breast-cancer-prediction-app
classification-algorithm machine-learning python scikit-learn
Last synced: 28 Dec 2024
https://github.com/ahmed122000/ml_model_deployment
The HR Analytics: Job Change Predictor is a Flask-based web application that uses machine learning to predict whether an employee will stay with a company or leave. It allows users to train models, evaluate their performance, and make predictions based on employee data, providing valuable insights for HR decision-making.
classification flask machine-learning python3 rest-api scikit-learn
Last synced: 02 Feb 2025
https://github.com/peterchain/titanic
Script for the Titanic dataset for evaluating which passengers survived
kaggle machine-learning pandas-dataframe python3 scikit-learn
Last synced: 02 Feb 2025
https://github.com/chewzzz1014/health-ease-project
coursework gradio machine-learning python scikit-learn tensorflow uvicorn
Last synced: 26 Jan 2025
https://github.com/pradipnp/decisiontree-iris
Machine learning project to classify iris flowers using a decision tree
classification decision-tree iris-dataset machine-learning python scikit-learn
Last synced: 10 Feb 2025
https://github.com/richardbmk/datascience_machinelearning
projects related with data science and machine learning projects.
data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn
Last synced: 23 Jan 2025
https://github.com/syedshameersarwar/mlopslifecycle
grafana kserve kubeflow mlflow mlops monitoring scikit-learn
Last synced: 29 Dec 2024
https://github.com/sachinh123/cognitive-customer-insights-with-watson-ai
This project analyzes customer data to provide insights for personalized services, behavior prediction, and improved support.
flask ibm-cloud ibm-watson-assistant ibm-watson-nlu nltk python scikit-learn
Last synced: 10 Feb 2025
https://github.com/myahninsi/credit_card_fraud_detection
This repository is for the Neural Networks and Deep Learning Course - Assignment 1, focusing on credit card fraud detection. The project utilizes a machine learning model to predict whether a transaction is fraudulent using a synthetic credit card dataset.
matplotlib numpy pandas pickle python scikit-learn seaborn streamlit
Last synced: 10 Feb 2025
https://github.com/alessandromonolo/fraud-detection-binary-classification-model
This project builds a machine learning model to classify fraudulent clients using a banking dataset. Data preprocessing, statistical analysis, and feature selection were performed before training KNN and Random Forest Classifier. Model performance was evaluated using accuracy, precision, recall, and F1-score.
classification-model fraud-detection knn-classification machine-learning pandas python random-forest scikit-learn statistical-analysis
Last synced: 10 Feb 2025
https://github.com/arseniistratiuk/decision-tree-classifier-lab
A student lab project focused on tree algorithms and a Decision Tree classifier.
bellman-ford decision-tree-classifier floyd-warshall graph-algorithms kruskal-algorithm machine-learning networkx prim-algorithm scikit-learn
Last synced: 10 Feb 2025
https://github.com/gt7o3/loan-prediction
Predict loan approval status using machine learning techniques. This project demonstrates data preprocessing, feature engineering, model training, and evaluation, along with an interactive Streamlit app for real-time predictions. Ideal for financial decision-making.
accuracy-analysis juypter lending-club loan-application loan-data loan-default-prediction loan-prediction logistic-regression machine-learning pca predictive-analytics python scikit-learn visualization
Last synced: 10 Feb 2025
https://github.com/asuquoaa/predicting_viewer_engagement_with_educational_videos
This project uses machine learning to predict video engagement based on features such as transcript complexity, speaker speed, and silence periods. By understanding the factors influencing engagement, we can improve content recommendations and educational experiences.
data-visualization exploratory-data-analysis machine-learning scikit-learn
Last synced: 10 Feb 2025
https://github.com/miyajianimation/spam-filter
Spam-Filter is a powerful tool used to automatically detect and remove unwanted or unsolicited electronic messages that often flood email inboxes. It helps users to efficiently manage their emails by filtering out irrelevant or potentially harmful content, allowing them to focus on important messages.
anti-spam antispam blocklist cold-calls docker fritz-box fritzbox lua rspamd scikit-learn spam-classification spamd support-vector-machines zabbix
Last synced: 10 Feb 2025
https://github.com/abhivur/connections-ai
Contributors: Meet Gamdha, Gaurav Nimmagadda
bert python scikit-learn word2vec
Last synced: 10 Feb 2025
https://github.com/aicorsair/python-case-study-365-data-science-customer-segmentation-in-marketing
This repository contains a detailed case study on the segmentation of 365 Data Science customers using real-world data from an onboarding survey.
customer-segmentation data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-selection hierarchical-clustering k-means-clustering machine-learning marketing marketing-analytics matplotlib pandas python scikit-learn social-media social-media-analysis
Last synced: 10 Feb 2025
https://github.com/akimuddinshaikh/machine-learning-project
A comparative study of regression models (Decision Tree, Random Forest, Ridge, Lasso, SVM) for predicting real estate prices in King County, NYC, and California using PCA & Pipeline techniques.
machine-learning pca-analysis python regression-models scikit-learn statsmodels
Last synced: 10 Feb 2025
https://github.com/akimuddinshaikh/domain-application-of-predictive-analysis
Data-Driven House Price Prediction "Predicting house prices using Machine Learning techniques
feature-engineering pca python random-forest scikit-learn
Last synced: 10 Feb 2025
https://github.com/szymonrucinski/pippi-lang
Elegant 📑 text preprocessing pipeline 🚰 available as pip package 🐍 based on scikit-learn pipeline. Combines Transformer and Column Transformer into a single object.
data-cleaning data-science nlp pipeline scikit-learn
Last synced: 02 Feb 2025
https://github.com/pymc-learn/pymc-learn-sphinx-theme
Sphinx theme for Pymc-learn documentation
pymc3 pymc4 scikit-learn sphinx sphinx-theme
Last synced: 02 Feb 2025
https://github.com/tbarlow12/learn-it-your-way
Using Python Flask, I wanted to create a simple web API that allows users to upload a dataset, choose one or more models, store them server side, and then hit an endpoint to get a prediction.
flask machine-learning python scikit-learn tensorflow
Last synced: 02 Feb 2025
https://github.com/nickklos10/concrete-compressive-strenght-prediction
This project predicts concrete compressive strength using a neural network regression model built with Keras.
jupyter-notebook keras matplotlib numpy pandas python scikit-learn
Last synced: 24 Oct 2024
https://github.com/mohit1106/fraud-detection
an anomaly detection system on 284,807 transactions, achieving an AUC of ~0.972 with CNNs and Autoencoders.
autoencoders cnn-model isolation-forest keras python scikit-learn tensorflow
Last synced: 02 Feb 2025
https://github.com/oroszgy/cookiecutter-ml-flask
Cookiecutter template for training and serving machine learning models with scikit-learn, spacy, Flask and Docker
docker flask flask-application machine-learning nlp rest-api scikit-learn spacy
Last synced: 03 Feb 2025
https://github.com/manojkp08/student-performance-analysis
The Student Performance Analyzer is your go-to solution for understanding and improving student performance. By blending the power of machine learning with interactive visualizations, this tool provides educators and learners with personalized insights into learning styles, performance gaps, and actionable improvements.
machine-learning numpy pandas python requests scikit-learn streamlit
Last synced: 27 Jan 2025
https://github.com/thchilly/mlds102_py_exercises
Complete exercise sets from MLDS Practical Data Science and Applications course
data-science matplotlib numpy pandas python scikit-learn scipy tensorflow
Last synced: 27 Jan 2025
https://github.com/broodhoney/titanic-ml-from-disaster
This repository contains my analysis and solutions for the Titanic: Machine Learning from Disaster competition on Kaggle. The notebook explores the dataset, performs extensive Exploratory Data Analysis (EDA), applies feature engineering techniques, and builds predictive models to determine survival outcomes based on passenger data
machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning
Last synced: 03 Feb 2025
https://github.com/lmriccardo/moments-learning
Repository for the First-Second Moments Learning project. In this repo you will find an implementation of a learning model to learn the relationship between time-series model parameters and the first two moments of its outputs
machine-learning mean mlp-regressor models random-forest scikit-learn time-series torch variance
Last synced: 10 Feb 2025
https://github.com/clinton-mwachia/machine-learning-with-python
machine learning with python
machine-learning python regression scikit-learn
Last synced: 29 Dec 2024
https://github.com/sunilvarma-l/liverdiseaseprediction
"Streamlit app to predict liver disease risk using a machine learning model based on patient input data."
machine-learning matplotlib numpy pandas pickle python scikit-learn seaborn streamlit
Last synced: 07 Feb 2025
https://github.com/santiagoasp98/spam-detection
SMS spam detection using Logistic Regression and Multinomial Naive Bayes.
classification logistic-regression machine-learning multinomial-naive-bayes python scikit-learn spam-detection
Last synced: 07 Feb 2025
https://github.com/yashsuthar00/machine-learning
Ai internship files
artifical-intelligense clustering data-science decision-trees jupyter-notebook k-means-clustering machine-learning natural-language-processing python regression-analysis scikit-learn
Last synced: 10 Feb 2025
https://github.com/vishal-verma-96/capstone_project_by_skill_academy
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn
Last synced: 18 Oct 2024
https://github.com/monish-nallagondalla/cement_strength_prediction
The Cement Strength Prediction project uses machine learning to predict the compressive strength of cement based on its components, such as Cement, Fly Ash, Water, Superplasticizer, Coarse Aggregate, Fine Aggregate, and Age. The goal is to forecast compressive strength (MPa) for optimized cement production and quality control.
cement-strength-prediction construction-industry data-analysis data-preprocessing data-science data-visualization feature-engineering machine-learning predictive-modeling python regression-analysis scikit-learn
Last synced: 03 Feb 2025
https://github.com/trainingbypackt/hands-on-machine-learning-with-python
Structure, build, and deploy machine learning projects
machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning
Last synced: 13 Jan 2025
https://github.com/lorenzorottigni/ml-lending-club
Machine Learning python bootcamp: random forest classifier on LendingClub dataset
ipynb machine-learning numpy pandas python random-forest-classifier scikit-learn seaborn
Last synced: 30 Jan 2025
https://github.com/avik-pal/kaggle-titanic
Predicting whether a given set of people survive on the Titanic
machine-learning numpy pandas scikit-learn scikitlearn-machine-learning
Last synced: 21 Jan 2025
https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization
A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.
dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python
Last synced: 03 Feb 2025
https://github.com/rishisolanke/twitter-sentiment-analysis-using-machine-learning-
A research project that classifies tweets as positive, negative, or neutral using ML algorithms (Logistic Regression, Naïve Bayes, SVM) with NLP preprocessing.
data-science data-visualization logistic-regression machine-learning ml-models naive-bayes natural-language-processing nlp scikit-learn sentiment-analysis svm text-classification twitter-data
Last synced: 03 Feb 2025
https://github.com/hasanulmukit/spam-email-classifier
This is a Spam Email Classifier built using Python and Streamlit. It uses a pre-trained model to predict whether an email is Spam or Not Spam. The app also provides the probability scores for both categories, enhancing transparency and reliability of the prediction.
email-classifier machine-learning nlp python scikit-learn spam-detection streamlit text-classification
Last synced: 03 Feb 2025
https://github.com/otuemre/viginids
VigiNIDS: A machine learning-based system for detecting malicious network traffic using the UNSW-NB15 dataset. It distinguishes between normal and attack activities, providing a data-driven approach to network security.
classification cybersecurity intrusion-detection-system machine-learning network-intrusion-detection python scikit-learn unsw-nb15 xgboost
Last synced: 03 Feb 2025
https://github.com/roboneet/enron_analysis
udacity project (to be complete)
Last synced: 30 Jan 2025
https://github.com/virajbhutada/article-recommendation-system
This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.
anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy
Last synced: 01 Feb 2025
https://github.com/alphacrypto246/old-car-price-prediction
The Old Car Price Prediction project predicts used car prices using features like age, mileage, and fuel type. It includes data preprocessing, model training, and visualization of trends, with easy customization for additional features or models.
machine-learning numpy pandas scikit-learn scikitlearn-machine-learning
Last synced: 10 Feb 2025
https://github.com/alphacrypto246/student-learning-style-prediction
An interactive web application built with Streamlit that predicts a student's preferred learning style (visual, auditory, or kinesthetic) using machine learning, aiding educators in personalizing teaching strategies.
machine-learning scikit-learn scikitlearn-machine-learning streamlit
Last synced: 10 Feb 2025
https://github.com/alphacrypto246/stock-price-movement-prediction
A project leveraging Polynomial Regression to predict stock price movements based on historical data. Includes data preprocessing, feature engineering, visualization, and model evaluation to provide insights for informed trading decisions.
machine-learning numpy pandas polynomial-regression scikit-learn yfinance
Last synced: 10 Feb 2025
https://github.com/alphacrypto246/insurance-charges-prediction
The Predicting Insurance Charges project uses Decision Tree Regression to predict insurance charges based on features like age, sex, BMI, and smoking habits. It involves data preprocessing, feature scaling, and model evaluation with metrics like MAE and R².
machine-learning numpy pandas scikit-learn scikitlearn-machine-learning
Last synced: 10 Feb 2025
https://github.com/lordhacker756/estate-ai
Estate AI is a machine learning application that predicts the approximate rent a user would need to pay for their requirement across major metro cities of India. It is built using NextJS 13, TailwindCSS, and TypeScript for the frontend, Scikit Learn for Model Training and and Flask for the backend.
fastapi flask machine-learning nextjs13 scikit-learn
Last synced: 03 Feb 2025
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 18 Jan 2025
https://github.com/sudarshanc00/brain-tumor-classification
This project uses a deep learning model in PyTorch to classify brain MRI images into four tumor types, aiding early diagnosis and treatment planning. Two ResNet-based models were developed and optimized, achieving high accuracy to support healthcare professionals in identifying tumor categories.
matplotlib numpy pytorch resnet scikit-learn streamlit
Last synced: 21 Jan 2025