An open API service indexing awesome lists of open source software.

scikit-learn

scikit-learn is a widely-used Python module for classic machine learning. It is built on top of SciPy.

https://github.com/pooranjoyb/health-bridge

Predicting the disease of a patient from a patient's video (or text) using ML algorithms. The algorithms used in this project are Natural Language Procession, and Random Forest Tree. This Project is under INTEL OneAPI Hackathon 2023

hackathon intel machine-learning nlp oneapi pandas pickle python scikit-learn

Last synced: 04 Sep 2025

https://github.com/azrdev/sklearn-seco

Implementation of the *Separate and Conquer* / *Covering*-Algorithm for scikit-learn

covering machine-learning scikit-learn sklearn

Last synced: 21 Jan 2026

https://github.com/vartikaraj2512/dsml-internship-devtown-notebooks-

🌟 Data Science & Machine Learning Internship Projects 📊 Explore a curated collection of DS & ML notebooks covering topics like regression models, clustering, NLP, and deep learning. Dive into real-world projects such as price prediction, sentiment analysis, and customer segmentation. This repository reflects modern data-driven industry solutions

data-science filehandling googlecolab json kaggle keras machine-learning matplotlib numpy pandas python scikit-learn seaborn sql tensorflow

Last synced: 29 Jan 2026

https://github.com/vidhi1290/robust-yield-prediction-

"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing

Last synced: 15 Apr 2026

https://github.com/csinva/trees-to-networks

Bridging random forests and deep neural networks. Partial implementation of "Neural Random Forests" https://arxiv.org/abs/1604.07143

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Last synced: 12 Apr 2026

https://github.com/ejw-data/ml-myopia

A variety of machine learning techniques used to identify nearsighted patients

cross-validation gridsearchcv imbalanced-classification kmeans knn machine-learning pca pipeline python random-forest scikit-learn svc tensorflow tsne

Last synced: 11 Jul 2025

https://github.com/akash-peace/face-recognition

AJ Face Recognizer project objective is to make a face recognizing model from own dataset of two faces.

facenet keras matplotlib mtcnn npz numpy opencv pickle pillow python3 sav scikit-learn

Last synced: 10 Apr 2026

https://github.com/udityamerit/complete-machine-learning-for-beginners

This repository is structured as a complete ML roadmap combining theory (PDFs) with hands-on coding (Jupyter Notebooks) to help you build a solid foundation in data science and machine learning. Ideal for students, self-learners, and professionals looking to revise or upgrade.

artificial-intelligence classification clustering clustering-algorithm machine machine-learning machinelearning matplotlib matplotlib-figures numpy pandas regression regression-models regressionalgorithms regressionanalysis scikit-learn scikitlearn-machine-learning scipy seaborn tensorflow

Last synced: 15 May 2026

https://github.com/abhiramdodda/rainfall_prediction

Machine Learning model built on Telangana dataset cropped from Indian weather dataset merged with average temperature dataset

numpy pandas python3 scikit-learn scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/vopaaz/learning-utility

Assist small-scale machine learning.

data-science machine-learning pandas python3 scikit-learn

Last synced: 10 Apr 2025

https://github.com/rickiepark/ml-with-python-cookbook-2nd

<실무로 통하는 ML 문제 해결 with 파이썬>

deep-learning machie-learning pytorch scikit-learn

Last synced: 29 Oct 2025

https://github.com/docsallover/movie-recommendation

Content-Based Movie Recommendation System In Python Using Numpy, Scikit-Learn & Pandas

data-science jupyter machine-learning numpy pandas python scikit-learn

Last synced: 08 Apr 2026

https://github.com/pateash/kisanmitra-python

Python Machine learning Utility for Kisanmitra Web App

jupyter-notebook machine-learning python scikit-learn

Last synced: 31 Jul 2025

https://github.com/34j/sklearn-utilities

Utilities for scikit-learn. Append prediction to x, append prediction to x single, append x prediction to x, compose var estimator, data frame wrapper, drop by noise prediction, drop missing rows y, dummy regressor var, estimator wrapper base, excluded column transformer pandas, feature union pandas, id transformer, included column transformer pand

catboost feature-engine feature-engineering multioutput pandas pca python pytorch regression scikit-learn sklearn sklearn-compatible skorch torch tqdm

Last synced: 13 Apr 2025

https://github.com/sayakpaul/floydhub-k_means-blog

Contains the Jupyter Notebook made for a FloydHub article on K-Means

numpy pandas scikit-learn yellowbricks

Last synced: 20 Sep 2025

https://github.com/pythonicshariful/insurance-charge-predictor

This project predicts medical insurance charges based on personal details such as age, gender, BMI, number of children, smoking habits, and region. It uses a Machine Learning model trained on the insurance.csv dataset and provides a Flask web app interface for user input

flask insura machine-learning mlapp python regression scikit-learn

Last synced: 09 May 2026

https://github.com/27ahmad/medicine-recommendation-system

This project aims to create a medicine recommendation system based on symptoms provided by the user. The system is built using machine learning models trained on a dataset of symptoms and their corresponding diagnoses. The frontend is designed using Bootstrap for an intuitive user interface.

bootstrap machine-learning medicine-applications pandas recommendation-system scikit-learn

Last synced: 25 Oct 2025

https://github.com/mantreshkhurana/simple-ml-python-chatbot

This is a simple python chatbot which uses json file to store the data. It uses TF-IDF (Term Frequency-Inverse Document Frequency) vectorizer and a Multinomial Naive Bayes classifier to classify the user input and give the response.

chatbot flask machine-learning ml python scikit-learn sklearn tf-idf

Last synced: 05 Mar 2025

https://github.com/shervinnd/cifar-10-image-classification-with-cnn

Description: Dive into this exciting project that builds a CNN with TensorFlow/Keras to classify CIFAR-10 images into 10 vibrant classes! 📸 It features data preprocessing, model training, stunning ROC curve visualizations 📊, sample predictions, and custom image classification

auc-analysis batch-normalization cifar-10 cnn-model computer-vision convolutional-neural-networks custom-image-prediction deep-learning image-classification image-preprocessing keras machine-learning matplotlib-visualization numpy opencv python roc-curve scikit-learn softmax-classifier tensorflow

Last synced: 03 Sep 2025

https://github.com/m-rishab/research-paper-recommendation

This project aims to build a research paper recommendation system. Given a paper title as input, the system provides the top 5 recommended research papers.

deep-learning flask llm nlp numpy pandas python scikit-learn sentence-transformers tensorflow torch transformers

Last synced: 02 Apr 2026

https://github.com/mantreshkhurana/aetherai

This is a simple python chatbot which uses json file to store the data. It uses TF-IDF (Term Frequency-Inverse Document Frequency) vectorizer and a Multinomial Naive Bayes classifier to classify the user input and give the response.

chatbot flask machine-learning ml python scikit-learn sklearn tf-idf

Last synced: 01 Mar 2026

https://github.com/ugyenn-tsheringg/predicting-loan-defaults-with-deep-learning

A deep learning model to predict loan default probabilities for the African Credit Scoring Challenge.

adam-optimizer deep-learning early-stopping matplotlib numpy pandas relu scikit-learn seaborn tensorflow

Last synced: 02 Jul 2025

https://github.com/magnusax/automl

The project aims to develop a customized ML framework on top of existing libraries

data-science machine-learning machine-learning-algorithms machine-learning-library python scikit-learn

Last synced: 20 Jan 2026

https://github.com/mgobeaalcoba/inferential_statistics_with_python

Explore the world of inferential statistics using Python. Learn hypothesis testing, confidence intervals, and statistical analysis techniques for data-driven decision-making and insights.

google-colab inferential-statistics jupyter-notebook matplotlib numpy pandas python3 scikit-learn seaborn statistics

Last synced: 09 Apr 2026

https://github.com/aadityatamrakar/captcha_ai

ML program to decode text from captcha images.

matplotlib python scikit-learn sklearn

Last synced: 17 Apr 2026

https://github.com/shervinnd/titanic-survival-predictor

🚢 Dive into the Titanic dataset with this ML project! Using TensorFlow, predict passenger survival via neural networks. Features data preprocessing, model training, and visualization with pandas, scikit-learn, and matplotlib. Perfect for beginners in deep learning. 📊💻

classification data-preprocessing data-science data-visualization deep-learning jupyter-notebook machine-learning neural-network pandas predictive-modeling python scikit-learn tensorflow titanic-dataset

Last synced: 16 Sep 2025

https://github.com/rajaprerak/ml_dl_webapp

Machine learning and Deep learning project

flask heroku keras knearest-neighbors python scikit-learn tensorflow

Last synced: 19 Oct 2025

https://github.com/psychomita/intellicv

IntelliCV is an AI-driven platform for efficient and intelligent resume screening.

jupyter-notebook numpy pandas python scikit-learn seaborn streamlit svc-model tfidf-vectorizer

Last synced: 19 Apr 2025

https://github.com/engageintellect/bitcoin-price-predictor

This Python project predicts whether the price of Bitcoin will increase or decrease on the next day, using historical price data and machine learning. Additionally, the project visualizes Bitcoin's price movements using candlestick charts along with moving averages for different timeframes.

bitcoin machine-learning matplotlib mplfinance numpy pandas python scikit-learn visualization yfinance

Last synced: 23 Oct 2025

https://github.com/spockoo/pylegend

A fusion between Python and legend, a name that suggests that the code is both modern and mythical. 3 months of work, with tons of errors to establish the calculations necessary for the superposition, I want to publish my work and improve it and share it under Apache 2.0 License. Designed to work with NBminer!

crypto crypto-tools how-to-farm-crypto kerastuner matplotlib matrix-multiplication mining nbminer numpy performance-optimization pickle project quantum quantumcircuits quantumcomputing qubits scikit-learn tensorflow

Last synced: 27 Jan 2026

https://github.com/zacheddy/issuelabeler

Automatically label issues on GitHub repositories with machine learnin' (demo available)

classifier machine-learning python scikit-learn

Last synced: 22 Mar 2025

https://github.com/pr38/dask_tfidf

A Dask native implementation of 'Term Frequency Inverse Document Frequency' for dask-ml and scikit-learn

dask dask-ml distributed-computing machine-learning python scikit-learn

Last synced: 02 May 2026

https://github.com/bhattbhavesh91/selectkbest-from-scratch

Implementing SelectKBest from Scratch for a Regression problem

feature-selection regression scikit-learn selectkbest

Last synced: 17 Apr 2025

https://github.com/anaconda/intel-green-ai

Code and Experimental Package attached to the article "Greener Machine Learning Computing with Intel AI Acceleration"

anaconda anaconda-environment green-ai machine-learning machine-learning-benchmarks scikit-learn scikit-learn-benchmarks

Last synced: 05 Feb 2026

https://github.com/ffstghc/caco2ml

Main code chunks used for models in the publication "Exploring the Potential of Adaptive, Local Machine Learning (ML) in Comparison ton the Prediction Performance of Global Models: A Case Study from Bayer's Caco-2 Permeability Database"

caco-2 local-models machine-learning pharmacokinetics scikit-learn

Last synced: 18 Oct 2025

https://github.com/jktujq/lumenn

LumeNN is an application that solves problem of binary and multiclass classification of stars with variable luminosity with the usage of different machine learning models.

astronomy classifiers machine-learning maths neural-networks python scikit-learn torch

Last synced: 02 Jul 2025

https://github.com/yancotta/smartcrops-iot-ml-system

An IoT-ML project for smart agriculture: Dual ESP32 nodes (sensor via ESP-NOW, gateway to MQTT/Ubidots) collects temp, humidity, soil moisture data. ML Model analyzes crop yield and real-time plant health classification. Python, Notebooks & C++ for code. Built for FIAP's Phase 5 of the first year of the AI/ML tech degree.

agriculture arduino embedded-systems esp-now esp32 iot machine-learning mqtt python scikit-learn smart-farming

Last synced: 14 Sep 2025

https://github.com/hariprashad-ravikumar/ai-datascience-lab

AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.

ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn

Last synced: 02 Aug 2025

https://github.com/prithivsakthiur/data-board

Data Boards - Visualization of various plots ( Analysis )

data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces

Last synced: 25 Feb 2026

https://github.com/kingabzpro/github-actions-for-machine-learning-beginners

A project on automating ML workflow using scikit-learn pipelines, CML, and GitHub actions.

cml github-actions machine-learning mlops scikit-learn

Last synced: 13 Oct 2025

https://github.com/shreyansh055/time-series-forecasting_055

The Time Series Forecasting Project predicts future trends using historical data with Python, Pandas, and models like ARIMA, LSTM, and Prophet, focusing on scalable, accurate forecasting for business and finance.

lstm matplotlib numpy pandas python scikit-learn seaborn

Last synced: 27 Jan 2026

https://github.com/saheedniyi02/krecommend

A python package for creating content-based text recommender systems on pandas dataframes and SQLAlchemy tables

cosine-similarity flask-sqlalchemy nlp numpy pandas python recommendation-algorithms recommendation-engine recommendation-system recommender-system scikit-learn sql sqlalchemy sqlite3 tfidf-vectorizer

Last synced: 10 Mar 2026

https://github.com/aayushker/DevFoolU

Uncover originality, empower authenticity

django-rest-framework keybert nextjs nltk plagiarism-detection scikit-learn

Last synced: 15 Jul 2025

https://github.com/khaymanii/movie-recommendation-model

This is a model built using Python and Cosine Similarity algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/yaqoah/used-cars-ai

🚗 predicts used car prices using a full ML pipeline

beautifulsoup eda machine-learning pandas regression scikit-learn selenium xgboost

Last synced: 19 Apr 2026

https://github.com/ksalama/gcp-ml-serving

Examples of how to serve ML models on GCP

app-engine dataflow kubernetes machine-learning scikit-learn tensorflow

Last synced: 12 Oct 2025

https://github.com/otakmager/projectml-clusteringweb

This repo is the result of a project assignment for a machine learning course at my university which was assisted by other group members. This project is to create a website that can cluster from the models that have been made. This model was created using the KMeans algorithm with 3 clusters that were trained with the seed dataset

bahasa-indonesia clustering-algorithm flask jupyter-notebook kmeans-clustering numpy pandas pickle python scikit-learn seed-dataset

Last synced: 08 Apr 2026

https://github.com/avrtt/paysage

Pandas add-on library: find data quality issues and clean/improve dataframes in one line using scikit-learn transformer

data-analysis data-cleaning data-compression data-profiling data-quality data-quality-checks data-reporting pandas pandas-dataframe schema-validation scikit-learn scikit-learn-transformer

Last synced: 14 May 2026

https://github.com/tschechlovdev/ml2dac

Implementation of "ML2DAC: Meta-Learning to Democratize AutoML for Clustering Analyses", published at SIGMOD 2023. The paper has awarded the "reproducibility" badge by SIGMOD's reproducibility reviewers.

automl clustering meta-learning paper python reproducible-research scikit-learn

Last synced: 13 Oct 2025

https://github.com/boemer00/netflix

We’re helping Netflix decide what content their users enjoy. By modelling a relationship between features and user scores we can predict how well-received new content will be, before spending on licences-- reducing the risk of buying dud content.

data-engineering machine-learning netflix pipelines python regression scikit-learn

Last synced: 20 May 2026

https://github.com/jagadishdas21/brain-tumor

This repository contains the implementation of a deep learning model to detect brain tumors from MRI images using Convolutional Neural Networks (CNN). The goal of this project is to classify MRI images as either having a brain tumor (Positive) or not having one (Negative).

computer-vision convolutional-neural-networks matplotlib scikit-learn tensorflow

Last synced: 09 Apr 2025

https://github.com/docsallover/music-recommendation

Machine Learning Music Recommendation System: Hybrid Approach (Content & SVD) with Flask

content-based-filtering flask machine-learning matplotlib ml music-recommendation numpy pandas scikit-learn scikit-surprise scipy seaborn surprise svd

Last synced: 07 Apr 2026

https://github.com/hamidurrk/ground-station

Visualization and analysis tool to analyze signal strength data to identify areas with poor network coverage

machine-learning mean-shift network-analysis robotics scikit-learn

Last synced: 18 Apr 2026

https://github.com/mekhyw/facial-emotion-classification

Very fast classification model for facial expressions using Mediapipe facial landmarks, Scikit-learn and OpenCV, as part of the DYNAMO project

keras-tensorflow mediapipe opencv scikit-learn spektral

Last synced: 01 May 2025

https://github.com/aritrakar/data-science-projects

🧪A collection of data science projects and experiments.

machine-learning python scikit-learn

Last synced: 11 Apr 2026

https://github.com/tritonix711/ai-content-verifier

AI Content Verifier is a tool that finds out if text is written by AI or humans. It uses machine learning and natural language processing to give clear results and confidence scores. With an easy-to-use interface, it helps everyone from researchers to content creators check if the content is real or not.

git machine-learning nlp nltk numpy pandas python scikit-learn spacy tkinter

Last synced: 07 Jan 2026

https://github.com/rahamniabdelkaderseifelislem/prodigy_ml_01

Task 1 of the Prodigy InfoTech ML internship which involves building a basic Linear Regression model.

linear-regression machine-learning machine-learning-models predictive-modeling python scikit-learn

Last synced: 25 Jan 2026

https://github.com/oneapi-src/intelligent-indexing

AI Starter Kit for Intelligent Indexing of Incoming Correspondence using Intel® Extension for Scikit-learn*

machine-learning scikit-learn

Last synced: 04 Apr 2025

https://github.com/danielendler/datason

A comprehensive Python package for intelligent serialization that handles complex data types with ease, especially ML/AI workflows.

ai api-development data-persistence data-science deserialization json machine-learning ml numpy pandas python pytorch scikit-learn serialization tensorflow workflow-automation

Last synced: 06 Mar 2026

https://github.com/bdrupieski/anagramfinder

Java app to find anagrams in the Twitter firehose

anagrams java jdbi postgres python scikit-learn sql twitter-bot twitter4j

Last synced: 14 Apr 2026

https://github.com/sn1027/hand-gesture-recognition-using-machine-learning-and-mediapipe

This Hand gesture recognition project using mediapipe is developed to recognize various hand gestures. The user can custom train any number of various hand gestures to train a model.

computer-vision customization custommodels handgesture-recognition machine-learning mediapipe modeltraining scikit-learn scikitlearn-machine-learning

Last synced: 02 Feb 2026

https://github.com/meyiapir/nlu-api

Web API for accessing NLU models. With tools for training models.

api chat fastapi learning machine-learning nlu python ru russian-language scikit-learn

Last synced: 15 Feb 2026

https://github.com/an-exodus/dubai-real-estate-price-prediction-ml

This repository contains a comparative analysis of machine learning algorithms to predict real estate prices in Dubai. Using data from Bayut, we evaluate Decision Tree, Linear Regression, Random Forest, and Gradient Boosting models based on their predictive accuracy.

decision-tree gradient-boosting linear-regression machine-learning random-forest scikit-learn

Last synced: 07 Apr 2025

https://github.com/adityajn105/kagglesolvedproblem

Data Science problem on Kaggle Solved by me. Keep improving Accuracy.

kaggle kaggle-competition numpy pandas python scikit-learn tensorflow

Last synced: 12 Mar 2026

https://github.com/raihan4520/ml

A collection of machine learning projects showcasing various algorithms and techniques, including a final project for the Machine Learning course at AIUB.

data-preprocessing jupyter-notebook machine-learning model-evaluation numpy pandas python scikit-learn

Last synced: 11 Jan 2026

https://github.com/alextanhongpin/spam-api

Microservices for spam filtering system

python scikit-learn

Last synced: 21 Apr 2026

https://github.com/shourya1997/programming-language-detector

The Software will enable the user to detect the programming language without the help of file extension.

machine-learning python scikit-learn scikitlearn-machine-learning

Last synced: 29 Apr 2025

https://github.com/priyanshulathi/url-based-web-phishing-detection-model

A machine learning project for detecting phishing URLs by extracting key features from web addresses and applying various classification algorithms to identify potential threats effectively.

decision-tree knn logistic-regression machine-learning python random-forest scikit-learn

Last synced: 11 Feb 2026

https://github.com/pr38/socraticbumpsearch

A scikit-learn compatible implementation of Bumping as described by “Elements of Statistical Learning” second edition (290-292).

bumping machine-learning python scikit-learn scikit-learn-api

Last synced: 06 May 2026

https://github.com/ahmedshahriar/telco-customer-churn-prediction-streamlit-app

This streamlit app predicts the churn rate using Gradient Boosting models (XGBoost, Catboost, LightGBM) on IBM Customer Churn Dataset

binary-classification binary-classifiers data-science jupyter-notebook machine-learning pandas python scikit-learn sklearn stacking-ensemble streamlit streamlit-webapp

Last synced: 13 Apr 2025

https://github.com/plantaest/feverfew

Comprehensive link checker tool for Wikipedia

aws-lambda caddy java mantine onnx python quarkus react scikit-learn typescript

Last synced: 12 Apr 2025

https://github.com/oskar-j/thresher

Thresher - THRESHold EvaluatoR for Python

automl fine-tuning machine-learning python scikit-learn

Last synced: 02 May 2026

https://github.com/ymorsi7/hatespeechnlp

Detecting and analyzing hate speech on videos relating to sexism on a right-wing platform (NLTK, scikit-learn, pandas).

decision-tree-classifier nlp nlp-machine-learning nltk-python pandas scikit-learn tf-idf

Last synced: 13 Apr 2026