Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/mgobeaalcoba/linear_algebra_for_machine_learning

Explore fundamental linear algebra concepts essential for machine learning in this repository, with code examples and explanations. Get a solid foundation for ML!

machine-learning matplotlib numpy pandas python3 scikit-learn scipy seaborn

Last synced: 20 Jan 2025

https://github.com/dinhanhx/determination

Scripts to set global random for some machine learning framework

determination deterministic keras pytorch randomness scikit-learn tensorflow2

Last synced: 28 Jan 2025

https://github.com/ansh-info/industrial-scale-penicillin-simulation

Optimizing industrial-scale penicillin production using machine learning and data analysis.

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 28 Jan 2025

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 29 Jan 2025

https://github.com/hayatoy/gcpml-notebook

Dockerfile with Jupyter Machine Learning environment plus Google Cloud SDK

dockerfile google-cloud-platform jupyter scikit-learn tensorflow

Last synced: 02 Jan 2025

https://github.com/kr1shnasomani/greendetect

Detection of plant diseases using CNN (ResNet50 architecture)

computer-vision deep-learning keras neural-network numpy opencv pandas scikit-learn seaborn tensorflow

Last synced: 19 Dec 2024

https://github.com/kostasereksonas/ids_test

Code for intrusion detection system based on "Intrusion Detection System Using Machine Learning Algorithms" tutorial on Geeksforgeeks and Intrusion Detection on NSL KDD Github repository.

ids intrusion-detection intrusion-detection-system nsl-kdd-dataset numpy pandas python scikit-learn tensorflow

Last synced: 16 Jan 2025

https://github.com/shliakhovai/numpy-neural-network

A simple neural network implemented from scratch using NumPy.

machine-learning neural-network numpy preceptron scikit-learn

Last synced: 12 Jan 2025

https://github.com/flysirin/adstextclassification

Classification of advertisements by topic

docker excel flask pandas python pytorch scikit-learn

Last synced: 22 Jan 2025

https://github.com/g-eoj/cv-tl-keras

Use the cross validation functions from scikit-learn to evaluate image classification transfer learning with Keras models.

cross-validation keras numpy scikit-learn transfer-learning

Last synced: 21 Jan 2025

https://github.com/priyanshulathi/url-based-web-phishing-detection-model

A machine learning project for detecting phishing URLs by extracting key features from web addresses and applying various classification algorithms to identify potential threats effectively.

decision-tree knn logistic-regression machine-learning python random-forest scikit-learn

Last synced: 22 Jan 2025

https://github.com/akashshnkr/multi-disease-prediction

Developed and integrated three machine learning models for predicting diabetes, Parkinson's, and heart disease into a Streamlit-based web application. The interface allows users to input data and receive accurate health predictions, enhancing early detection and healthcare outcomes.

logistic-regression machine-learning-algorithms numpy pandas python scikit-learn streamlit-webapp svm

Last synced: 22 Jan 2025

https://github.com/aarryasutar/hate_speech_detection

This project aims to detect hate speech on Twitter using advanced NLP and machine learning techniques, exploring feature extraction methods like TF-IDF and sentiment analysis, and evaluating models such as Logistic Regression and SVM.

confusion-matrix doc2vec gensim logistic-regression matplotlib naive-bayes nltk numpy pandas python random-forest scikit-learn seaborn stemming stopwords-removal svm tf-idf-vectorizer tokenization vader word-cloud

Last synced: 12 Jan 2025

https://github.com/jesly-joji/house-price-prediction

House Price Prediction using Linear Regression with Scikit-learn and Flask

flask regression scikit-learn

Last synced: 22 Jan 2025

https://github.com/idaraabasiudoh/knn-customer-classification

Labels telecommunication customer base to respective groups to determine service type required for each customer.

data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn

Last synced: 14 Jan 2025

https://github.com/chrislemke/scikit-tabtrans

TabTransformer ready for Scikit learn 🧑‍🔬

deep-learning machine-learning python scikit-learn transformer

Last synced: 09 Nov 2024

https://github.com/yvesemmanuel/machine_learning

Implements data problems solved with machine learning algorithms.

data-science keras keras-tensorflow linear-algebra machine-learning neural-network python scikit-learn

Last synced: 16 Jan 2025

https://github.com/george-gca/ai_papers_search_tool

Automatic paper clustering and search tool by fastext from Facebook Research

fasttext fasttext-embeddings fasttext-python nlp python scikit-learn

Last synced: 14 Jan 2025

https://github.com/jordandeklerk/pygridge

A scikit-learn compatible Python package for data-driven group regularized ridge regression

python regression regularized-regression scikit-learn

Last synced: 19 Dec 2024

https://github.com/haloapping/ml-with-me

Kalo dengar istilah ML, biasanya rada ambigu. Soalnya punya beberapa kepanjangan, seperti Mobile Legend, Makan Lontong, dan lain-lain. Tapi pada repo ini membahas Machine Learning :)

ml pusing python3 scikit-learn stress tau-ah-gelap

Last synced: 06 Jan 2025

https://github.com/7irelo/regressor-model

Video Game Sales Prediction using Decision Tree Regression

joblib matplotlib numpy pandas scikit-learn

Last synced: 13 Jan 2025

https://github.com/haloapping/ml-workflow

Template alur kerja machine learning.

mahine-learning numpy pandas python3 scikit-learn

Last synced: 06 Jan 2025

https://github.com/ayberkyavuz/ml_model_server_docker_deployment

This repository is for containing source codes of machine learning model server deployment.

deployment docker flask machine-learning model python random-forest scikit-learn

Last synced: 06 Dec 2024

https://github.com/lechemi/machine-learning-vademecum

Un notebook contenente nozioni di base ed esempi pratici in python sul machine learning.

machine-learning python scikit-learn

Last synced: 19 Dec 2024

https://github.com/colinwu0403/heartbpmusic

Music discovery platform that recommends you a song based on your heart's BPM and your mood using Machine Learning.

django neurokit2 scikit-learn spotify-web-api vuejs

Last synced: 22 Jan 2025

https://github.com/bhimrazy/iris-species-prediction-using-decision-tree-algorithm-grip

Iris Species Intelligence: Classifying Iris Species with Confidence using Decision Trees | The Sparks Foundation: GRIP

decision-tree-classifier fastapi gripjan23 machine-learning python scikit-learn sparkfoundation

Last synced: 17 Jan 2025

https://github.com/kookmin-sw/capstone-2023-29

자리있어? - 경기도 광역버스 좌석예측 시스템

fastapi lstm postgresql python3 pytorch react scikit-learn sqlalchemy

Last synced: 13 Nov 2024

https://github.com/mpolinowski/isometric-mapping

Non-linear dimensionality reduction through Isometric Mapping

isomap matplotlib-pyplot python scikit-learn

Last synced: 28 Jan 2025

https://github.com/nirmalyabag20/crop-yield-prediction-using-machine-learning

This project uses machine learning to predict crop yields based on factors like region, crop type, rainfall, temperature, and pesticide use. By analyzing a dataset of over 28,000 records, the models provide accurate yield forecasts, helping optimize farming decisions and resource management, ultimately contributing to sustainable agriculture.

jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 19 Dec 2024

https://github.com/kohlerhector/trex-tree-reward-exploration

Using Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.

decision-trees drl exploration rl scikit-learn stable-baselines3

Last synced: 30 Dec 2024

https://github.com/udityamerit/all_mlpackages

The `all_MLpackage` is a Python tool for breast cancer detection using scikit-learn's machine learning algorithms. It automates model training, generates accuracy comparisons, and helps identify the best-performing algorithm for diagnosis.

classification knn-algorithm logistic-regression machine-learning numpy pandas scikit-learn svm-model

Last synced: 31 Oct 2024

https://github.com/pockerman/tech3python

Collection of Python based algorithms on numerics, statistics, control etc

algorithms control estimation kalman-filter machine-learning numerical-methods particle-filter python3 scikit-learn statistics

Last synced: 06 Jan 2025

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 08 Dec 2024

https://github.com/rcghpge/pymo

A framework in Mojo for AI/ML/DL applications and other domains.

deep-learning machine-learning magic-cli modular mojo mojo-language pymo python scikit-learn

Last synced: 21 Jan 2025

https://github.com/myahninsi/customer-segmentation-recommendation-ml

This project addressed challenges in understanding customer behavior and personalizing shopping experiences for an e-commerce platform. Developed ML solutions including K-Means clustering for segmentation, Random Forest regression for CLV prediction, and collaborative filtering for product recommendations.

collaborative-filtering k-means-clustering pandas python random-forest scikit-learn

Last synced: 30 Dec 2024

https://github.com/csengupta1101/housing-price-prediction

Based on given features , this project deals with the prediction of house prices. From data extraction till deployment , everything is detailed in here.

jupyter-notebook matplotlib numpy pandas python python3 scikit-learn vscode

Last synced: 29 Dec 2024

https://github.com/josancamon19/boston_housing

Predicting Boston Housing Prices for Udacity Machine Learning Nanodegree

boston-housing-price-prediction machine-learning machine-learning-nanodegree scikit-learn udacity

Last synced: 28 Dec 2024

https://github.com/belzebu013/prever_nivel_colesterol

Projeto de IA com algoritmo de Regressão Linear múltipla para prever o nível de colesterol de um individuo.

ia jupiter-notebook pandas python regressao-linear-multipla scikit-learn

Last synced: 19 Dec 2024

https://github.com/sundarmd/breast-cancer-detection

Breast-Cancer-Detection is a machine learning project that utilizes logistic regression to predict whether a tumor is benign or malignant based on the Breast Cancer Wisconsin (Diagnostic) dataset. The project demonstrates data preprocessing, model training, and evaluation using the `scikit-learn` library.

logistic-regression machine-learning python scikit-learn

Last synced: 30 Dec 2024

https://github.com/wlopezm-unal/titanic_ship-streamlit

Machine Learning model, where using titanic ship data and see if is be able to predict if a passager was salved or died. This apply use Machien learning (Random Forest, gassianNB and Logistic Regressión) . Further, using streamlit together to FastApi be able to see the predict result

docker machine-learning optuna scikit-learn streamlit supervised-machine-learning

Last synced: 08 Jan 2025

https://github.com/rahulb813/mvscr

An alternative movie recommendation system for the CineMatchSM with better predictions. Trained with a Kaggle dataset of over 450000+ entries.

gridsearchcv knn python scikit-learn scipy-sparse svd tensorflow xgboost

Last synced: 02 Jan 2025

https://github.com/rohra-mehak/sciencesync

System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis

agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3

Last synced: 28 Dec 2024

https://github.com/drkbluescience/wids2024_challenge2_metastaticdiagnosisregression

This notebook presents an exploratory data analysis (EDA) and regression modeling approach for the WiDS Datathon 2024 Challenge #2.

catboost data data-visualization ensemble-learning exploratory-data-analysis gradient-boosting imputation-methods lgbm machine-learning scikit-learn women-in-data-science

Last synced: 19 Dec 2024

https://github.com/deliprofesor/ridge-regression-for-sales-prediction-model-evaluation-and-hyperparameter-tuning

This project builds and optimizes a model on a dataset using Ridge regression and polynomial features. Model accuracy is enhanced through regularization and polynomial transformations. Grid search and cross-validation are used to find the best parameters, and the model's performance is evaluated.

cross-validation data-science data-visualization grid-search machine-learning model-optimization mse overfitting-prevention polynomial-regression python r2-score regression-analysis regularization ridge-regression rmse scikit-learn

Last synced: 30 Dec 2024

https://github.com/abdulshaikh55/ml-involuntary-denied-boarding

A machine learning model that predicts whether you will be denied onboarding your plane.

first-timers ipynb machine-learning scikit-learn

Last synced: 19 Dec 2024

https://github.com/official-biswadeb941/clopimedi---your-heart-s-trusted-care

ClopiMedi is an AI-driven healthcare application that simplifies doctor appointment bookings, offering personalized recommendations based on medical conditions to enhance patient-provider connections.

adam ai flask flask-api flask-api-backend full-stack-web-development joblib machine-learning scikit-learn tensorflow

Last synced: 31 Oct 2024

https://github.com/myahninsi/housing-price-prediction-ml

Final project for Big Data Visualization for Business Communications 01 (DSMM Group 1). Analyzes housing data, identifies key price factors, and builds predictive models using machine learning. Includes Power BI dashboards for interactive visualizations and Flask for deployment.

elastic-net lasso-regression linear-regression matplotlib numpy pandas ridge-regression scikit-learn seaborn

Last synced: 30 Dec 2024

https://github.com/adadalshabab/human-stress-analysis-greadsearch-classifier

The project leverages data from physiological signals, self-reported surveys, behavioral observations, or other relevant sources to infer and analyze stress levels.

classification knn-classification machine-learning machine-learning-algorithms matplotlib pandas scikit-learn

Last synced: 21 Jan 2025

https://github.com/dmschauer/aws-sagemaker-deployment-test

I did a simple test to see how deploying a machine learning model on AWS Sagemaker and thus turning it into an API works. Since scikit-learn models require less dependencies than e.g. TensorFlow models I went with them for this test. To do so I used a tutorial.

aws boto3 python sagemaker scikit-learn

Last synced: 08 Jan 2025

https://github.com/fgebhart/handson-ml

hands-on machine learning notebooks collection

jupyter-notebook machine-learning scikit-learn

Last synced: 13 Jan 2025

https://github.com/emmarhoffmann/starsort

A multi-model machine learning project that achieves up to 100% accuracy in classifying star types using five different AI algorithms

decision-tree naive-bayes neural-network python random-forest scikit-learn svm

Last synced: 19 Dec 2024

https://github.com/rakibhhridoy/appliedmachinelearninghousing-regression

Let's take the Housing dataset which contains information about different houses in Boston. This data was originally a part of UCI Machine Learning Repository and has been removed now. We can also access this data from the scikit-learn library. The objective is to predict the value of prices of the house using the given features.

deep-learning housing-market housing-prices machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 25 Dec 2024

https://github.com/ericmaddox/crime-analyst-ai

AI-powered Python project that analyzes crime data, predicts trends using advanced models, and visualizes insights on interactive maps with validation for accuracy and reliability.

ai artifical-intelligence crime-analysis crime-data crime-data-analysis crime-prediction crime-statistics ollama ollama-api python python3 scikit-learn

Last synced: 08 Jan 2025

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 14 Jan 2025

https://github.com/eljandoubi/dynamic_risk_assessment_system

ML Model Scoring and Monitoring

os pandas scikit-learn sqlite sys

Last synced: 24 Jan 2025

https://github.com/theengineerprogrammer/kmeans-medoids

Changing cluster centers from centroids to medoids for kmeans

clustering k-means k-medoids kmeans kmedoids scikit-learn

Last synced: 19 Dec 2024

https://github.com/pierrealexandre78/deathpredict

Predict Hospital mortality rate using Machine Learning for patients admitted in ICU (Intensive Care Unit)

healthcare hospital machine-learning predictions python random-forest-classifier scikit-learn xgboost-classifier

Last synced: 13 Jan 2025

https://github.com/darkdk123/customer-churn-prediction-innobytes

Predicting Customer churns as an Internship project at Innobytes services.

data-science python scikit-learn streamlit xgboost-classifier

Last synced: 28 Dec 2024

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 14 Jan 2025

https://github.com/aysenurcftc/breast_cancer_streamlit

Breast Cancer Wisconsin Dataset Classifier with Scikit-learn and Streamlit

breast-cancer classification gridsearch scikit-learn streamlit

Last synced: 14 Jan 2025

https://github.com/taeefnajib/predicting-heart-disease-using-logistic-regression

In this project, we trained a model that predicts whether a person will have heart disease or not. We also created a web app using Flask and deployed our application on Render.

classification deployment disease flask heart logistic-regression render scikit-learn sklearn

Last synced: 21 Jan 2025

https://github.com/drtey/tryingml

Hands on Machine Learning´s fundamentals

jupyter-notebook machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 21 Jan 2025

https://github.com/eco786786/spotify-playlist-generator

This project uses machine learning to cluster songs by features like tempo, genre and mood with K-Means. It then creates personalised Spotify playlists based on these clusters, providing dynamic, genre specific track collections. Integrating the Spotify API, it enables users to explore new music within custom groupings.

flask matplotlib pandas python3 scikit-learn seaborn

Last synced: 21 Jan 2025

https://github.com/arizdn234/spotify-api-with-colab

Crawling, Analyzing, Clustering music data from Spotify API

machile-learning scikit-learn spotify-api spotipy-library

Last synced: 30 Dec 2024

https://github.com/paragon279/cancercellclassificationbyscikitlearn

Classifies cancer cell using Scikit-learn

machine-learning python scikit-learn

Last synced: 20 Jan 2025

https://github.com/shreeparab1890/duplicate-question-predictor

The ipython notebook is working to build a model which will detect duplicate questions if two questions pair are given.

bag-of-words nlp nlp-machine-learning nltk numpy pandas python random-forest scikit-learn sklearn streamlit

Last synced: 01 Jan 2025

https://github.com/miguellopezvirues/azure_keyword_cpc

Development and deployment of simple regression model in Azure Machine Learning.

azureml deplyment machine-learning mlflow pandas scikit-learn

Last synced: 24 Dec 2024

https://github.com/mnj-tothetop/english-handwritten-characters-recognizer

A handwritten english character recognizer [0-9, A-Z, a-z] made by using a Dataset of 3409 images. Tensorflow, Keras, Scikit-learn, and OpenCV was used to implement the Convolution Neural Network (CNN). Matplotlib and Seaborn were used to visualize the data.

artificial-intelligence convolutional-neural-networks keras matplotlib opencv-python scikit-learn seaborn tensorflow

Last synced: 21 Jan 2025

https://github.com/bkamapantula/discover

Code search utility to assist developer workflows via code discovery. Currently uses TF-IDF estimator.

developer-tools python scikit-learn tf-idf

Last synced: 06 Dec 2024

https://github.com/mitchmedeiros/mlcompare

Quickly compare machine learning models across libraries and datasets

huggingface-datasets kaggle openml pytorch scikit-learn xgboost

Last synced: 10 Oct 2024

https://github.com/mpoojithavigneswari/sentiment-analysis

The primary goal of this project is to build a sentiment analysis model that can predict the sentiment of a given review (positive or negative).

deep-learning keras machine-learning nlp python rnn-lstm scikit-learn tensorflow

Last synced: 10 Oct 2024

https://github.com/i30101/mathworks2024

Coding tools for 2024 MathWorks Math Modeling Challenge

machine-learning mathematical-modelling python scikit-learn

Last synced: 12 Jan 2025