Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
scikit-learn
scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.
- GitHub: https://github.com/topics/scikit-learn
- Wikipedia: https://en.wikipedia.org/wiki/Scikit-learn
- Repo: https://github.com/scikit-learn/scikit-learn
- Created by: David Cournapeau
- Released: January 05, 2010
- Related Topics: scikit, python,
- Aliases: sklearn,
- Last updated: 2025-02-02 00:26:48 UTC
- JSON Representation
https://github.com/aravindnathan02/credit-card-fraud-detection
This is a Machine Learning project on classifying fraudulent credit card transactions.
classification-model fraud-detection logistic-regression machine-learning python random-forest scikit-learn
Last synced: 25 Jan 2025
https://github.com/hon777225/autoworth
Used Car Price Prediction (India)
data-analysis data-analysis-python data-analytics data-preprocessing data-science-projects eda fine-tuning gridsearchcv machine-learning matplotlib-pyplot pandas random-forest-regressor scikit-learn seaborn
Last synced: 25 Jan 2025
https://github.com/macdung123/fake-job-post-detection
This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.
classification data-analysis data-science deep-learning job-posting joblib machine-learning matplotlib-pyplot numpy pandas python scikit-learn tf-idf tkinter
Last synced: 25 Jan 2025
https://github.com/samsoumyajitd/food_ai
The AI Food Weather-Based Recommendation System provides personalized food and restaurant suggestions using AI. It uses GenAI and AI techniques like TF-IDF Vectorization, Cosine Similarity, and FuzzyWuzzy for tailored recommendations.
axios cosine-similarity flask flask-cors fuzzywuzzy generative-ai google-generativeai json nlp python-dotenv python3 reactjs scikit-learn sklearn tf-idf vanilla-css weather-api
Last synced: 25 Jan 2025
https://github.com/bjornmelin/ml-algorithm-playground
🧪 Core ML algorithm implementations with GPU acceleration. Featuring optimized implementations across various libraries with comprehensive analysis. 📈
algorithms cuda gpu-computing lightgbm machine-learning python scikit-learn xgboost
Last synced: 25 Jan 2025
https://github.com/12345far/metrics-calculation-precision-recall
Laboratory 7 - Retrieval Information
data-preprocessing educational-project information-retrieval lowercase-conversion punctuation-removal python scikit-learn short-words-filter text-processing tokenization vocabulary-optimization
Last synced: 25 Jan 2025
https://github.com/hassanislam463/nyc_airbnb_eda
This project is a comprehensive data analysis of Airbnb listings in New York City, exploring pricing trends, seasonality effects, host market dynamics, rental preferences, and revenue estimation. It provides valuable insights for hosts, investors, and policymakers to optimize Airbnb operations and understand the short-term rental landscape in NYC.
exploratory-data-analysis matplotlib python scikit-learn seaborn
Last synced: 25 Jan 2025
https://github.com/1adore1/deadlock-match-tracker-bot
Telegram bot for tracking real-time Deadlock matches of top 250 players. Fetches match data and predicts winners using a machine learning model.
aiogram api deadlock pandas python scikit-learn
Last synced: 25 Jan 2025
https://github.com/yelamankarassay/personal-health-wellness-dashboard
A Streamlit-based dashboard for visualizing and analyzing personal daily data—weight, mood, meals, sleep, and more. This project uses pandas, plotly, matplotlib, seaborn, scikit-learn, and wordcloud to present insights about your health and daily habits.
matplotlib pandas plotly scikit-learn seaborn wordcloud
Last synced: 25 Jan 2025
https://github.com/priyanshul28/ml_classification_eda_parkinsonsdisease
A guided Machine Learning Classification exercise on the Parkinson's Disease dataset demonstrating the use of Logistic Regression, Neural Network Classifiers, Decision Trees, Random Forests and XGBoost algorithms, as well as Data Preprocessing and Exploratory Data Analysis.
classification machine-learning pandas python scikit-learn statistics
Last synced: 25 Jan 2025
https://github.com/darshhv/fraud-detection-system
A machine learning project for detecting fraudulent transactions using Random Forest and XGBoost models, with data preprocessing and model evaluation.
data-preprocessing-and-cleaning fraud-detection-using-machine-learning model-evaluation pandas random-forest scikit-learn xgboost
Last synced: 25 Jan 2025
https://github.com/abhivur/connections-ai---tamu-datathon-2024
Contributors: Meet Gamdha, Gaurav Nimmagadda
Last synced: 25 Jan 2025
https://github.com/aryar-06/linear-regression
A Python project demonstrating basic linear regression with gradient descent and matrix operations, alongside scikit-learn comparison.
data-analysis data-preprocessing educational-project gradient-descent linear-regression machine-learning python regression-algorithms scikit-learn
Last synced: 25 Jan 2025
https://github.com/avtorgenii/ml-playground
A repository for exploring and experimenting with datasets, building machine learning models, and testing various techniques in data preprocessing, feature engineering, and model evaluation.
matplotlib ml pandas scikit-learn
Last synced: 25 Jan 2025
https://github.com/ki3mono/naive_bayes_classifier
This project implements Naive Bayes Classifiers for two data types: Multinomial Naive Bayes Classifier and Gaussian Naive Bayes Classifier
iris-dataset mushroom-dataset naive-bayes-classifier numpy python scikit-learn
Last synced: 25 Jan 2025
https://github.com/anshvaid4/ml_practice
This is the new repository, where I have added all the notebooks demonstrating the usage of various transformers and models for Supervised and Unsupervised algorithms
anaconda jupyter-notebook machine-learning machine-learning-algorithms python scikit-learn
Last synced: 25 Jan 2025
https://github.com/luliatuccu/weather_analysis
This project highlights a combination of data science techniques and Python programming to explore real-world weather data.
data-preprocessing eda feature-engineering machine-learning matplotlib numpy pandas regex scikit-learn seab seaborn weather weather-patterns
Last synced: 25 Jan 2025
https://github.com/taquynhnga2001/regression-calories-burnt-prediction
Develop regression models which can predict the total calories a person has burnt during a workout based on some biological measures.
machine-learning python regression-analysis scikit-learn
Last synced: 25 Jan 2025
https://github.com/abhivur/graduate-income-forecaster---aggie-data-science-club-2024
Contributors: Abdussalam Raheem, Chiara Su, and Joseph Botros
matplotlib numpy pandas python scikit-learn seaborn
Last synced: 25 Jan 2025
https://github.com/himanshkr03/loan_default_prediction_using_machine_learning
This repository contains a Python-based project that uses machine learning to predict loan defaults. It explores data preprocessing, feature engineering, and model training techniques to build a predictive model for assessing loan risk.
data-science finance loan-default-prediction machine-learning pandas prediction-model python risk-assessment scikit-learn
Last synced: 25 Jan 2025
https://github.com/ranimeshehata/softmax-regression-on-mnist
A PyTorch-based project for classifying the MNIST dataset using Softmax Regression, including training, validation, results and visualization.
matplotlib mnist python3 pytorch scikit-learn softmax-regression torchvision
Last synced: 25 Jan 2025
https://github.com/manishkumarpatel07/heartattack_risk_prediction
"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.
boruta knn-algorithm matplotlib numpy pandas python scikit-learn
Last synced: 25 Jan 2025
https://github.com/ranimeshehata/feed-forward-neural-network-on-mnist
A PyTorch-based project for classifying the MNIST dataset using Feed Forward Neural Networks, including training, validation, results and visualization.
feedforward-neural-network matplotlib mnist python3 pytorch scikit-learn torchvision
Last synced: 25 Jan 2025
https://github.com/lkethridge/integrated_project_2
Integrated Project 2 from TripleTen
anomaly-detection cross-validation data-analytics data-cleaning-and-preprocessing data-science feature-engineering gold-recovery machine-learning metal-purification model-evaluation pandas portfolio-project python scikit-learn smape supervised-learning
Last synced: 25 Jan 2025
https://github.com/usmana5809/quran-recitation-audio-classification
Quran Recitation Audio Classification project aims to classify different recitations of the Quran using machine learning techniques. It involves preprocessing audio data, extracting features, training models, and evaluating their performance
audio-classification classification-model islamic-studies librosa machine-learning python quran scikit-learn
Last synced: 25 Jan 2025
https://github.com/santoshn86/dlp-ev-system-for-pa-optimization
This system is a game-changer, enabling smarter energy management through predictive insights and personalized optimization strategies.
aiml django flask keras pytorch scikit-learn tensorflow typescript
Last synced: 25 Jan 2025
https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation
An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.
clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn
Last synced: 31 Jan 2025
https://github.com/afonsojramos/feup-iart
Projects developed for Artificial Intelligence class.
feup feup-iart iart neural-network python scikit-learn tensorflow
Last synced: 25 Jan 2025
https://github.com/fdauti/sklearn_proj
Data analysis with scikit-learn and other Python libraries
matplotlib pandas scikit-learn seaborn
Last synced: 31 Jan 2025
https://github.com/smaddanki/pattern-pursuit-challenge
A personal challenge to build a production-ready trading signal system for S&P 500 stocks using deep learning. This project progresses from basic ML models to a complete trading infrastructure, focusing on 5-day forward return prediction and signal generation.
deep-learning machine-learning pytorch quantative-trading quantitative-finance quantitative-research scikit-learn
Last synced: 01 Feb 2025
https://github.com/alejandrolara11/machinelearningcourse
Machine Learning Basics: From Setup to Clustering
data-analysis data-science machine-learning numpy pandas plotly preprocessing-data python scikit-learn seaborn streamlit
Last synced: 01 Feb 2025
https://github.com/dadvaiahpavan/ai-data-scientist-
AI-powered tool for dataset analysis, featuring data preprocessing, classification, regression, anomaly detection, and text analysis. Built with scikit-learn, pandas, and Plotly for visualization. Includes an interactive Streamlit web interface for real-time data analysis.
ai anomaly-detection classification data-analysis data-science machine-learning panda plotu regression scikit-learn sentiment-analysis streamlit
Last synced: 01 Feb 2025
https://github.com/nahom32/mlp-assignment
This repository is an implementation for machine learning assignment demonstrating the machine learning process.
eda logistic-regression machine-learning scikit-learn
Last synced: 01 Feb 2025
https://github.com/bhaveshbhakta/diabetes-prediction
Note* The hosted website link might take some time to load. Please be patient while the application initializes.
diabetes-prediction flask machine-learning python scikit-learn svm web-development
Last synced: 01 Feb 2025
https://github.com/vijaykumarr1452/startup_success_predictor
This project demonstrates the use of Multiple Linear Regression to predict the profits of startups based on investment in R&D, Administration, and Marketing of dataset (50_Startups.csv)
machine-learning multi-linear-regression numpy pandas python regression rsquare-values scikit-learn
Last synced: 01 Feb 2025
https://github.com/giatraskon/machine_learning_assignments
Machine learning assignments covering regression, classification, neural networks, adversarial examples, and real-time emotion detection using Python. Includes theoretical insights and practical implementations.
adversarial-examples bayesian-inference bias-variance-tradeoff cifar10 classification deep-learning emotion-recognition iris-dataset k-nearest-neighbours keras machine-learning mnist neural-networks opencv pima-indians-diabetes python regression ridge-regression scikit-learn tensorflow
Last synced: 01 Feb 2025
https://github.com/galaxy092/samsung-innovation-campus-big-data-capstone-project
Samsung Innovation Campus Big Data Capstone Project - Weather Prediction
hadoop jupyter-notebook pandas pyspark scikit-learn sparksql
Last synced: 01 Feb 2025
https://github.com/dukebw/ml-model-selection
Machine learning model selection using Dlib and scikit-learn.
dlib machine-learning ranking scikit-learn
Last synced: 01 Feb 2025
https://github.com/ksasi/smartcab
machine-learning numpy pandas python reinforcement-learning scikit-learn
Last synced: 01 Feb 2025
https://github.com/ksasi/dog-breed-classifier
Dog Breed Classifier
cnn cnn-classification computer-vision deep-learning deep-neural-networks keras keras-neural-networks machine-learning numpy pandas python scikit-learn
Last synced: 01 Feb 2025
https://github.com/nitor-infotech-oss/aiml-data-processing
Data Processing Algorithms
aiml numpy pandas scikit-learn
Last synced: 01 Feb 2025
https://github.com/sanjurajveer/stock_price_prediction
App to predict 10 days price of stocks
keras lstm pandas python scikit-learn streamlit-webapp tensorflow yfinance
Last synced: 01 Feb 2025
https://github.com/gregoritsch3/dl_cv_e2e_potatodiseaseclassification
A guided CodeBasics Deep Learning Project where a Convolutional Model is deployed onto a Website (FastAPI) and Mobile App (React Native, Google Cloud). It's purpose is the classification of potato plant images into "healthy", "Early Blight" and "Late Blight" categories.
cnn-classification gcp model-deployment scikit-learn tensorflow
Last synced: 01 Feb 2025