Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/gauravsingh9356/machine_learning

All my practical learning work involved in MACHINE LEARNING (Data Processing to Deep Learning)

deep-learning jupyter-notebook machine-learning machine-learning-algorithms nlp-machine-learning python scikit-learn

Last synced: 02 Jan 2025

https://github.com/joshi-jyoti/heart-disease-prediction

This repository contains a Python-based project for predicting the likelihood of heart disease using a Logistic Regression machine learning model. It leverages a dataset of patient medical information to train and evaluate the model, providing insights into potential diagnoses.🩺

heart-disease-prediction heart-disease-predictor kaggle-dataset machine-learning numpy pandas python scikit-learn

Last synced: 04 Feb 2025

https://github.com/somjit101/nlp-casestudy-quora-question-similarity

An application of NLP and classical ML algorithms to an interesting real-world use case of predicting similarity between two questions on Quora. This allows the platform to combine similar questions into one and combine their answers to avoid duplication and unnecessary confusion.

cross-validation feature-engineering feature-extraction gradient-boosting kaggle logistic-regression machine-learning model-calibration natural-language-processing nlp quora-question-pairs scikit-learn svm text-mining xgboost

Last synced: 16 Jan 2025

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 27 Jan 2025

https://github.com/anoopgeorge418/my-learnings

Learning Topics Related Data-Science and Web-Development From Complete Scratch.

bokeh bootstrap css folium html matplotlib numpy pandas plotly python scikit-learn seaborn

Last synced: 07 Jan 2025

https://github.com/shubhranpara/heart-disease-predictor

I have created this project as my Python term assignment. In this project I have trained a ML model to predict the heart disease using Scikit-learn library in python.

google-colab jupyter-notebook machine-learning medical prediction-model python scikit-learn

Last synced: 02 Jan 2025

https://github.com/hayatoy/gcpml-notebook

Dockerfile with Jupyter Machine Learning environment plus Google Cloud SDK

dockerfile google-cloud-platform jupyter scikit-learn tensorflow

Last synced: 02 Jan 2025

https://github.com/g-eoj/cv-tl-keras

Use the cross validation functions from scikit-learn to evaluate image classification transfer learning with Keras models.

cross-validation keras numpy scikit-learn transfer-learning

Last synced: 21 Jan 2025

https://github.com/chrislemke/scikit-tabtrans

TabTransformer ready for Scikit learn 🧑‍🔬

deep-learning machine-learning python scikit-learn transformer

Last synced: 09 Nov 2024

https://github.com/siam29/credit-card-fraud-detection-in-real-time

This project delivers a fast and efficient fraud detection methodology, providing predictions in under a second, emphasizing the importance of both high performance and quick response times.

ensemble-machine-learning feature-selection genetic-algorithm machine-learning matplotlib pandas pca scikit-learn

Last synced: 01 Feb 2025

https://github.com/haloapping/ml-with-me

Kalo dengar istilah ML, biasanya rada ambigu. Soalnya punya beberapa kepanjangan, seperti Mobile Legend, Makan Lontong, dan lain-lain. Tapi pada repo ini membahas Machine Learning :)

ml pusing python3 scikit-learn stress tau-ah-gelap

Last synced: 06 Jan 2025

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 28 Jan 2025

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 28 Jan 2025

https://github.com/idaraabasiudoh/knn-customer-classification

Labels telecommunication customer base to respective groups to determine service type required for each customer.

data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn

Last synced: 14 Jan 2025

https://github.com/haloapping/ml-workflow

Template alur kerja machine learning.

mahine-learning numpy pandas python3 scikit-learn

Last synced: 06 Jan 2025

https://github.com/kostasereksonas/ids_test

Code for intrusion detection system based on "Intrusion Detection System Using Machine Learning Algorithms" tutorial on Geeksforgeeks and Intrusion Detection on NSL KDD Github repository.

ids intrusion-detection intrusion-detection-system nsl-kdd-dataset numpy pandas python scikit-learn tensorflow

Last synced: 16 Jan 2025

https://github.com/docsallover/spam-detection

Building a Spam Filter with Python: Using Machine Learning to Combat Spam

datascience flask jinja2 machine-learning numpy numpy-library pandas pandas-python python python3 scikit-learn

Last synced: 29 Jan 2025

https://github.com/sorenfyhn/resights-avm-house-prices

My attempts for the automatic valuation model (AVM) hackathon by Resights.

machine-learning python pytorch scikit-learn

Last synced: 23 Jan 2025

https://github.com/iamwatchdogs/cardiovascular-risk-prediction

This mini-project uses machine learning algorithms to predict possible risks of heart disease by analyzing given data.

jupyter-notebook machine-learning-algorithms matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Jan 2025

https://github.com/sorenfyhn/dtu-machine-learning-project

My project work for the DTU 02450 Machine Learning course.

machine-learning python pytorch quarto scikit-learn

Last synced: 23 Jan 2025

https://github.com/mastermindromii/car-price-prediction-model

Here is My Regression Project based on Predicting Price of Car using Linear Regression.

linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 08 Jan 2025

https://github.com/icejan/predicton-systems

Various systems that train on data and generate a prediction

lightfm machine-learning numpy python scikit-learn

Last synced: 27 Jan 2025

https://github.com/dummumounika/ecommerce-sales-categorization

This repository contains Python code for text classification and analysis of e-commerce sales data. The script processes textual descriptions of products and categorizes them into predefined categories using a Naive Bayes classifier. It also includes various analysis and visualization methods to explore the dataset.

machine-learning matplotlib-pyplot ntlk numpy pandas python scikit-learn

Last synced: 02 Jan 2025

https://github.com/srosalino/prediction_of_seoul_bikes_demand

The objective of this project is to predict the number of bicycles needed to be made available each hour in order to make the service as efficient as possible

cross-validation data-exploration-and-preprocessing hyperparameter-tuning machine-learning regularization-methods scikit-learn

Last synced: 08 Jan 2025

https://github.com/elifirinci/mushrooms-plants-classification

This project features AI models for identifying mushrooms and plants as poisonous or edible using image-based predictions. Both models are tested through an interactive Gradio interface, ensuring user-friendly and accurate identification for foragers and researchers.

classification cnn cnn-classification gradio image-classification machine-learning mushroom-classification plant-classification scikit-learn

Last synced: 08 Jan 2025

https://github.com/nikhilgugwad/sentiment-analysis

Sentiment analysis for the Kannada language to classify Kannada sentences into different emotions.

numpy pandas scikit-learn

Last synced: 02 Feb 2025

https://github.com/akshaya13/recommendation-system

Content Based Recommendation system using tags!

nltk scikit-learn similarity-search tmdb-database

Last synced: 08 Jan 2025

https://github.com/fahrettinsolak/ai-map-based-geographic-clustering-project

This project focuses on clustering crime incidents in San Francisco using the K-Means algorithm. The dataset is obtained from Kaggle and contains information about crime types, geographical coordinates, and other relevant features. The goal is to identify crime hotspots through geographic clustering and visualize the clusters on an interactive map.

artificial-intelligence deep-learning elbow-method jupyter-notebook machine-learning numpy openstreetmap pandas phyton plotly scikit-learn standardscaler

Last synced: 02 Feb 2025

https://github.com/tanaybhadula/ml-preprocessing-cli

A CLI tool with python to preprocess datasets for performing supervised learning to save time for users. Input data can be preprocessed using simple commands and preprocessed dataset can be downloaded later

cli data-cleaning data-preprocessing machine-learning pandas python scikit-learn

Last synced: 10 Jan 2025

https://github.com/adhadse/hands-on-machine-learning-book-notes-and-practice

This repo holds the Jupyter notebooks and datasets containing notes/comments on things I learned from this book. Feel free to use and learned from them.

data-science deep-learning jupyter-notebooks keras machine-learning python scikit-learn tensorflow

Last synced: 10 Feb 2025

https://github.com/anras5/criteo-search-data

EDA and statistical tests on CriteoSearchData dataset

data-science pandas scikit-learn statistics

Last synced: 21 Jan 2025

https://github.com/josancamon19/boston_housing

Predicting Boston Housing Prices for Udacity Machine Learning Nanodegree

boston-housing-price-prediction machine-learning machine-learning-nanodegree scikit-learn udacity

Last synced: 28 Dec 2024

https://github.com/yuanx749/mlp-pytorch

PyTorch MLP with scikit-learn API.

pytorch scikit-learn

Last synced: 08 Jan 2025

https://github.com/rohra-mehak/sciencesync

System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis

agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3

Last synced: 28 Dec 2024

https://github.com/divinenaman/color-extraction-api

Extract colours from images using K-means, along with FastAPI pipeline.

fastapi k-means-clustering scikit-learn

Last synced: 10 Feb 2025

https://github.com/dionixius7/titanic-disaster-ml-model

This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances

data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction

Last synced: 18 Jan 2025

https://github.com/thmslmr/scikitlearn-examples

💡Scikit Learn examples - Python

python scikit-learn tutorials

Last synced: 29 Dec 2024

https://github.com/rcghpge/pymo

A framework in Mojo for AI/ML/DL applications and other domains.

deep-learning machine-learning magic-cli modular mojo mojo-language pymo python scikit-learn

Last synced: 21 Jan 2025

https://github.com/fgebhart/handson-ml

hands-on machine learning notebooks collection

jupyter-notebook machine-learning scikit-learn

Last synced: 13 Jan 2025

https://github.com/adriantomin/bulldozer-price-prediction

Predicting the Sale Price of Bulldozers Using Machine Learning 🚜💰 This project uses machine learning to predict bulldozer sale prices based on historical data from the Kaggle Bluebook for Bulldozers competition. The goal is to minimize the RMSLE between actual and predicted prices.

data-science jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 09 Feb 2025

https://github.com/myahninsi/customer-segmentation-recommendation-ml

This project addressed challenges in understanding customer behavior and personalizing shopping experiences for an e-commerce platform. Developed ML solutions including K-Means clustering for segmentation, Random Forest regression for CLV prediction, and collaborative filtering for product recommendations.

collaborative-filtering k-means-clustering pandas python random-forest scikit-learn

Last synced: 30 Dec 2024

https://github.com/csengupta1101/housing-price-prediction

Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.

jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode

Last synced: 29 Dec 2024

https://github.com/sundarmd/breast-cancer-detection

Breast-Cancer-Detection is a machine learning project that utilizes logistic regression to predict whether a tumor is benign or malignant based on the Breast Cancer Wisconsin (Diagnostic) dataset. The project demonstrates data preprocessing, model training, and evaluation using the `scikit-learn` library.

logistic-regression machine-learning python scikit-learn

Last synced: 30 Dec 2024

https://github.com/wlopezm-unal/titanic_ship-streamlit

Machine Learning model, where using titanic ship data and see if is be able to predict if a passager was salved or died. This apply use Machien learning (Random Forest, gassianNB and Logistic Regressión) . Further, using streamlit together to FastApi be able to see the predict result

docker machine-learning optuna scikit-learn streamlit supervised-machine-learning

Last synced: 08 Jan 2025

https://github.com/rahulb813/mvscr

An alternative movie recommendation system for the CineMatchSM with better predictions. Trained with a Kaggle dataset of over 450000+ entries.

gridsearchcv knn python scikit-learn scipy-sparse svd tensorflow xgboost

Last synced: 02 Jan 2025

https://github.com/deliprofesor/ridge-regression-for-sales-prediction-model-evaluation-and-hyperparameter-tuning

This project builds and optimizes a model on a dataset using Ridge regression and polynomial features. Model accuracy is enhanced through regularization and polynomial transformations. Grid search and cross-validation are used to find the best parameters, and the model's performance is evaluated.

cross-validation data-science data-visualization grid-search machine-learning model-optimization mse overfitting-prevention polynomial-regression python r2-score regression-analysis regularization ridge-regression rmse scikit-learn

Last synced: 30 Dec 2024

https://github.com/eljandoubi/dynamic_risk_assessment_system

ML Model Scoring and Monitoring

os pandas scikit-learn sqlite sys

Last synced: 24 Jan 2025

https://github.com/myahninsi/housing-price-prediction-ml

Final project for Big Data Visualization for Business Communications 01 (DSMM Group 1). Analyzes housing data, identifies key price factors, and builds predictive models using machine learning. Includes Power BI dashboards for interactive visualizations and Flask for deployment.

elastic-net lasso-regression linear-regression matplotlib numpy pandas ridge-regression scikit-learn seaborn

Last synced: 30 Dec 2024

https://github.com/adadalshabab/human-stress-analysis-greadsearch-classifier

The project leverages data from physiological signals, self-reported surveys, behavioral observations, or other relevant sources to infer and analyze stress levels.

classification knn-classification machine-learning machine-learning-algorithms matplotlib pandas scikit-learn

Last synced: 21 Jan 2025

https://github.com/chrispsang/customerchurnanalysis

Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.

customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau

Last synced: 09 Feb 2025

https://github.com/alexliap/sk_serve

Deployment of a Scikit-Learn model and it's column transformations made easy.

machine-learning mlops model-deployment scikit-learn

Last synced: 09 Feb 2025

https://github.com/dmschauer/aws-sagemaker-deployment-test

I did a simple test to see how deploying a machine learning model on AWS Sagemaker and thus turning it into an API works. Since scikit-learn models require less dependencies than e.g. TensorFlow models I went with them for this test. To do so I used a tutorial.

aws boto3 python sagemaker scikit-learn

Last synced: 08 Jan 2025

https://github.com/rakibhhridoy/appliedmachinelearninghousing-regression

Let's take the Housing dataset which contains information about different houses in Boston. This data was originally a part of UCI Machine Learning Repository and has been removed now. We can also access this data from the scikit-learn library. The objective is to predict the value of prices of the house using the given features.

deep-learning housing-market housing-prices machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 25 Dec 2024

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 14 Jan 2025

https://github.com/paragon279/cancercellclassificationbyscikitlearn

Classifies cancer cell using Scikit-learn

machine-learning python scikit-learn

Last synced: 20 Jan 2025

https://github.com/mahdi-meyghani/movie-recommendation-system

A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.

data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning

Last synced: 09 Feb 2025

https://github.com/darkdk123/customer-churn-prediction-innobytes

Predicting Customer churns as an Internship project at Innobytes services.

data-science python scikit-learn streamlit xgboost-classifier

Last synced: 28 Dec 2024

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 14 Jan 2025

https://github.com/rishavp15/aivshuman_text

In this project make user to decide that the text which is entered in text box is a human generated or a computer generated text.

django pandas python scikit-learn

Last synced: 09 Feb 2025

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 09 Feb 2025

https://github.com/aysenurcftc/breast_cancer_streamlit

Breast Cancer Wisconsin Dataset Classifier with Scikit-learn and Streamlit

breast-cancer classification gridsearch scikit-learn streamlit

Last synced: 14 Jan 2025

https://github.com/taeefnajib/predicting-heart-disease-using-logistic-regression

In this project, we trained a model that predicts whether a person will have heart disease or not. We also created a web app using Flask and deployed our application on Render.

classification deployment disease flask heart logistic-regression render scikit-learn sklearn

Last synced: 21 Jan 2025

https://github.com/drtey/tryingml

Hands on Machine Learning´s fundamentals

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/sergeimakarovv/energy-data-analytics-ml

Analyzing global data on sustainable energy, predicting CO2 emissions per capita

machine-learning pandas plotly python scikit-learn streamlit

Last synced: 09 Feb 2025

https://github.com/eco786786/spotify-playlist-generator

This project uses machine learning to cluster songs by features like tempo, genre and mood with K-Means. It then creates personalised Spotify playlists based on these clusters, providing dynamic, genre specific track collections. Integrating the Spotify API, it enables users to explore new music within custom groupings.

flask matplotlib pandas python3 scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/arizdn234/spotify-api-with-colab

Crawling, Analyzing, Clustering music data from Spotify API

machile-learning scikit-learn spotify-api spotipy-library

Last synced: 30 Dec 2024

https://github.com/mkdirer/depression-data-analysis

This project analyzes a Kaggle depression dataset using data preprocessing, clustering, classification, and outlier detection techniques. Python libraries like pandas, numpy, matplotlib, seaborn, and scikit-learn are used to extract insights.

classification clustering matplotlib numpy pandas scikit-learn seaborn vizualization

Last synced: 09 Feb 2025

https://github.com/alejoduarte23/si_bayesianmixturemodel

Implementation of a two-stage fast Bayesian system identification for separated Modes. This repository expands the usage of this technique by adding a mixture model fit to obtain modal parameters from the posterior distribution.

matplotlib numpy scikit-learn scipy

Last synced: 09 Feb 2025

https://github.com/akash-nath29/mnist_test

Handwritten digit recognition API using Scikit-Learn and FastAPI

fastapi machine-learning scikit-learn

Last synced: 09 Feb 2025

https://github.com/shreeparab1890/duplicate-question-predictor

The ipython notebook is working to build a model which will detect duplicate questions if two questions pair are given.

bag-of-words nlp nlp-machine-learning nltk numpy pandas python random-forest scikit-learn sklearn streamlit

Last synced: 01 Jan 2025