An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/amritendunath/appointmate_1.0

A LangGraph-powered multi-agent/supervisor system for appointment scheduling. It enables users to interact in natural language to check doctor availability, book, reschedule, or cancel appointments.

fastapi langchain langgraph nltk numpy pandas psycopg2 python sqlalchemy uvicorn websocket

Last synced: 07 Apr 2026

https://github.com/vishalgaud17/stroke

A simple Streamlit web app that predicts stroke risk based on user input features like age, BMI, glucose level, and lifestyle factors, using a pre-trained machine learning model.

machine-learning numpy pandas python scikit-learn streamlit

Last synced: 14 Apr 2026

https://github.com/shoaib1522/introduction-to-data-science

Unlock the world of data science with hands-on tutorials, interactive notebooks, and essential resources! Whether you're a beginner or looking to refine your skills, this repo offers a comprehensive guide to mastering Python, Git, NumPy, Pandas, and more—empowering you to analyze, visualize, and transform data like a pro! 🚀📊💻

confusion-matrix github-config kaggle matplotlib numpy pandas python3 script visualization

Last synced: 14 Apr 2026

https://github.com/gurramankit/censusproject_ml_randomforest

The objective of this project is to build a classification model using the Census Income dataset from the UCI Machine Learning Repository. The model predicts whether an individual's income exceeds $50,000 per year, based on their demographic and employment-related attributes.

matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/thekartikeyamishra/colorextractor

A Python-based project that extracts the dominant colors from an image using the KMeans clustering algorithm. This project is useful for applications like image processing, color palette generation, or analyzing the most prominent colors in images.

kmeans-clustering matplotlib matplotlib-pyplot numpy p pillow python sklearn

Last synced: 05 May 2026

https://github.com/rscr1/summer_practice_mai_2

Решение всех заданий летней практики

matplotlib numpy pandas python scipy sympy

Last synced: 14 Apr 2026

https://github.com/sabdikay/analysis-of-biodiversity

This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.

data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/paulokarabyna/ml-foundations-day1

🔧 Set up a Python environment and practice linear algebra with NumPy while plotting vectors and reflections in this Day 1 ML foundations starter repository.

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy terminal-mac tutorial visualization

Last synced: 18 May 2026

https://github.com/adi3042/diamond_price_prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupyter-notebook matplotlib numpy pandas price-prediction readme regressor sckiit-learn setuptools venv

Last synced: 14 Apr 2026

https://github.com/badranalyst/time-series-analysis-of-global-trends-in-diet-gym-and-finance

This project analyzes global trends in diet, gym, and finance over time using time series data. The analysis is performed using Python libraries like Pandas, Matplotlib, and Seaborn to visualize trends and identify patterns in these sectors across various countries.

data-analysis dataset matplotlib-pyplot numpy pandas python seaborn time-series

Last synced: 14 Apr 2026

https://github.com/ljadhav25/false-alarm-detection

The data was first pre-processed and analysis libraries like Numpy and Pandas were used to make it ready to be utilized by a machine learning algorithm.Problems like standard scaling, categorical data and missing values were handled with appropriate techniques. Then, we used LogisticRegression model.

flask flask-api logestic-regression numpy pandas pkl pycharm-ide python

Last synced: 19 Jan 2026

https://github.com/venkat-a/python-eda-exploratory-data-analysis-

This project focuses on conducting exploratory data analysis (EDA) using Python. Utilizing powerful libraries such as Pandas for data manipulation and Plotly for interactive visualization, the project aims to uncover insights from datasets through comprehensive analysis and visualization techniques.

descriptive-statistics matplotlib numpy pandas seaborn statistical-analysis visualization

Last synced: 07 May 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/priyanshulathi/air-quality-index-prediction

Machine learning based air quality index prediction using environmental and pollutant data to classify and forecast pollution levels.

machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 19 Jan 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/ramchaik/cinebrain

CineBrain uses ML & NLP to analyze movies and recommend similar ones based on user preferences through cosine similarity.

cosine-similarity flask htmx kaggle machine-learning movie-recomendation-system nlp nltk numpy pandas sklearn tailwind tmdb tmdb-api unsupervised-learning vectorization

Last synced: 19 Jan 2026

https://github.com/lorenzorottigni/ml-breast-cancer

Machine Learning python bootcamp: Support Vector Machines using breast cancer dataset

ipynb machine-learning numpy pandas python scikit-learn seaborn support-vector-machines

Last synced: 14 Apr 2026

https://github.com/soumyapro/parkinson-disease-prediction

This project predicts Parkinson's disease using machine learning models.

logistic-regression numpy pandas scikit-learn svc xgboost

Last synced: 19 Jan 2026

https://github.com/sudarshanc00/smishing

This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.

nltk numpy pandas python scikit-learn scipy

Last synced: 14 Apr 2026

https://github.com/charancholaveti/imdb-sentimentanalysis

IMDB Movie Review Sentiment Analysis-It will give the sentiment of the review whether it is Positive or Negative

deeplearning keras numpy rnn-tensorflow tensorflow

Last synced: 14 Apr 2026

https://github.com/chaakshay/heartdrive

A Streamlet-based tool that analyzes cardiovascular health data, predicts population risk using ML, and suggests targeted government actions like awareness campaigns, health checkups, and policy changes.

csv mathplotlib numpy pandas pandas-dataframe python scikit-learn seaborn streamlit

Last synced: 05 Apr 2026

https://github.com/dhruvrajsinhzala24/house-price-predictor-ml-model

A machine learning project to predict house prices using linear regression and random forest models. The project leverages housing data to train and evaluate models, comparing their performance using metrics like MSE and R². Visualizes actual vs. predicted prices to assess model accuracy.

linear-regression machine-learning matplotlib ml numpy pandas python random-forest-regression sklearn

Last synced: 14 Apr 2026

https://github.com/ricky9667/taiwanoceantrashpredict

A very simple DNN project using Keras.

dnn keras numpy python

Last synced: 14 Apr 2026

https://github.com/ashutosh65000/invisible-cloak

Have you watched the Harry potter movie? The Invisible Cloak scene pushed me to make this small code.

image-processing numpy opencv-python

Last synced: 19 Apr 2026

https://github.com/gayatri018/ab-testing-marketing-campaign

This project involves conducting A/B testing to evaluate the performance of two marketing campaigns: Facebook Ads and Google Adwords for an online retail business. The primary goal is to determine which campaign is more effective in driving user engagement and conversions, helping the company optimize its marketing strategies.

ab-testing linear-regression matplotlib numpy pandas python scipy seaborn sklearn-

Last synced: 10 Apr 2026

https://github.com/jonad/quora_w2v

In-domain word embedding using Quora dataset.

gensim-word2vec matplotlib numpy pandas python3

Last synced: 11 Apr 2026

https://github.com/dev-dur/student_exam_analysis

This is a beginner-friendly data analysis mini-project using NumPy, Pandas, and Matplotlib. The project generates random student exam scores, analyzes them, and visualizes the results.

matplotlib-pyplot numpy pandas python

Last synced: 16 Apr 2026

https://github.com/sayidmushtaq/reportcard

Tracking and analyzing student grades. It calculates averages, identifies top performers, and generates performance reports with ease.👨🏼‍🎓👨🏼‍🎓

begginer-project numpy reporting-tool

Last synced: 27 Jan 2026

https://gitlab.com/DLR-SY/boxbeam

Boxbeam is a legacy Fortran tool translated to python. It calculates effective beam properties of composite cross sections comprised of rod-like elements.

fortran numpy python

Last synced: 12 Oct 2025

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 21 Jan 2026

https://github.com/khushi130404/regulexa

Regulexa is a Python project that showcases and compares Ridge, Lasso, and Elastic-Net regularization techniques in machine learning. It includes visualizations and performance insights to help prevent overfitting and improve model generalization.

elastic-net-regression lasso-regression numpy ridge-regression

Last synced: 17 May 2026

https://github.com/abhishekyadav915/multiple_disease_prediction_using_ml

Developed a web-based application for predicting the likelihood of multiple diseases using machine learning models with accuracy up to 95%.

colab-notebook logistic-regression numpy pandas python3 sklearn spyder-python-ide streamlit-web streamlitcloud svm-classifier

Last synced: 05 May 2026

https://github.com/helabenkhalfallah/art-bloom

ArtBloom is a backend application designed for art enthusiasts and researchers.

analytics backend-api numpy pandas-python python recommendationsystem

Last synced: 22 Jan 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/avik-pal/kaggle-titanic

Predicting whether a given set of people survive on the Titanic

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 14 Apr 2026

https://github.com/hoyirul/flask-starter-fs

Flask-Starter-FS is a concise full-stack starter kit for Flask web development, offering quick setup and pre-configured settings for seamless integration of front-end and back-end components. Accelerate your full-stack web application development with Flask-Starter-FS.

flask machine-learning numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/hansalemaos/locate_pixelcolor_cython_multi_auto_compile

Detects colors in images 8 x faster than Numpy / Uses Cython - returns a dict

bot c color colors colour colours cpp cython fast find multiprocessing numpy numpy-array opencv rgb rgba search

Last synced: 24 Feb 2026

https://github.com/grand-27-master/fingers-tap

A real-time multi-finger tap detection system

mediapipe numpy opencv python3

Last synced: 14 Apr 2026

https://github.com/aqueeqazam/student-placements-using-machine-learning-and-flask

Through the use of Flask and machine learning, I show how to implement the student placement prediction model.

flask machine-learning numpy pandas

Last synced: 14 Apr 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/gregoritsch3/ml_eda_classification_diabetes

An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.

anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics

Last synced: 14 Apr 2026

https://github.com/mizcausevic-dev/embedding-drift-graph

Track how entity embeddings drift across encoder model versions. SQLite store + Strawberry GraphQL API. Cosine drift events computed automatically on every record. Reference impl for RAG/eval pipelines re-encoding their corpus.

ai-governance drift-detection embeddings graphql llm numpy python rag sqlite strawberry-graphql vector-search

Last synced: 01 Jun 2026

https://github.com/sralter/potential_talents

Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet

Last synced: 09 Apr 2026

https://github.com/al-ghaly/stock-market-simulation

Simulate and visualize stock market behavior

matplotlib numpy python python-visualization

Last synced: 16 May 2026

https://github.com/alansteinbarth/irysy

🪻 Irysy 🌼 eksploracyjna analiza danych – wizualizacje, ML, transparentny kod i autorskie grafiki. Projekt łączący AI, estetykę i praktyczne umiejętności!

ai classification clean-code data-science exploratory-data-analysis feature-engineering github-pages iris-dataset jupyter-notebook machine-learning matplotlib modern-ui numpy open-source pandas portfolio-project python reproducible-research seaborn visualization

Last synced: 17 Oct 2025

https://github.com/cosmoquester/numpy-neural-net

This is neural network samples implemented with numpy without machine learning libraries.

implementation neural-network numpy

Last synced: 22 Jan 2026

https://github.com/deepcloudlabs/dcl702-2021-jul-12

DCL-702: Data Analytics using Python

data-analytics numpy pandas python3

Last synced: 05 May 2026

https://github.com/sunilvarma-l/liverdiseaseprediction

"Streamlit app to predict liver disease risk using a machine learning model based on patient input data."

machine-learning matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/architj6/gesture-volume-control

Easily control your system volume with simple hand gestures! This project leverages OpenCV, MediaPipe, and Pycaw to track hand movements and adjust the system volume dynamically. Say goodbye to keyboard shortcuts and volume buttons—just move your hand to set the volume!

handgesture mediapipe numpy opencv os pycaw python python3 volume-control

Last synced: 03 May 2026

https://github.com/neelays/xor-xnor_neural_network

NumPy neural network to approximate XOR/XNOR

numpy

Last synced: 15 May 2026

https://github.com/farhad-here/predict_student_performance

Predict Student Performance, is a data analysis and machine learning project aimed at predicting students' final performance (g3) based on demographic, family, and academic features. The project supports both Regression (predicting exact grades) and classification (Pass/Fail categories).

classification data-analysis data-visualization linear-regression machine-learning numpy pandas postgresql powerbi scikit-learn streamlit

Last synced: 14 Apr 2026

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/navindafernando/lane-detection

Lane Detection with Image Feature Extraction Techniques :aerial_tramway:

image-feature-extraction kernel-filters machine-learning neighborhood numpy opencv sobel-filter

Last synced: 13 Apr 2026

https://github.com/saivarun2611/potato_disease_classifier

This project is a Potato Disease Classifier that uses a Convolutional Neural Network (CNN) to classify potato leaves as healthy, suffering from early blight, or late blight. The model helps in early detection of diseases, which can be crucial in maintaining crop health and yield.

cnn keras matplotlib numpy pandas python3 tensorflow

Last synced: 14 Apr 2026

https://github.com/icepanorama/internship-visualizations-and-demonstrations

A collection of some of the programs that I've written over the course of my internship.

artificial-intelligence machine-learning matplotlib numpy pandas python3 pytorch scikit-learn

Last synced: 14 Apr 2026

https://github.com/juzershakir/predicting_boston_housing_prices

Builded a model to predict the value of a given house in the Boston real estate market using various statistical analysis tools. Identified the best price that a client can sell their house utilizing machine learning.

bias-variance boston-housing-price-prediction data-exploration decision-tree-regression gridsearchcv k-fold machine-learning matplotlib mlfnd model-evaluation model-validation numpy pandas python3 r2-score sklearn supervised-learning udacity-nanodegree

Last synced: 22 Oct 2025

https://github.com/shuyib/pimadlkeras

Using keras specify-compile-fit- predict workflow on this binary classification problem to investigate if i'll get better predictions.

diabetes hidden-layers keras-neural-networks matplotlib-figures model-validation numpy pima-indians-dataset prediction predictive-modeling python-3-5 seaborn

Last synced: 15 Mar 2026

https://github.com/hifly81/photox

photo and album organizer for gtk3

gtk3 numpy opencv-python photography pil pillow python

Last synced: 24 Oct 2025

https://github.com/harmanveer-2546/reducing-data-entries

Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.

csv data data-entry delete-data excel numpy pandas python

Last synced: 05 May 2026

https://github.com/bilgenurbekar/turkishcyberbullying

Contains fine-tuned BERT models and results in the text classification category using Turkish social media data

bert-fine-tuning huggingface-transformers matplotlib numpy pandas python pytorch scikit-learn transformers

Last synced: 07 Mar 2026

https://github.com/yashaka/learncv

Toolset used in learning Computer Vision at R_D course

cv numpy opencv utitlities

Last synced: 14 Apr 2026

https://github.com/manuel-lang/numpymongo

A python package to export NumPy data to MongoDB

mongodb numpy wrapper

Last synced: 23 Feb 2026

https://github.com/miasanromen/math-modeling

Math modeling with Python. Statistics, math, graphics

math matplotlib numpy probality-theory scipy-stats statistics

Last synced: 14 Apr 2026

https://github.com/pradeep31747/smartsuggest-personalized_product_recommendations

This project implements a personalized product recommendation system using machine learning techniques to enhance user experience and drive engagement.

jupyter-notebook keras numpy pandas pyhton scikit-learn sql tensorflow vscode

Last synced: 28 Jan 2026

https://github.com/jsd115/tensorflow_bankfrauddetection

This repository aims to provide examples of my ability to produce ML models using tensorflow and sklearn

numpy pandas sklearn tensorflow

Last synced: 06 Feb 2026

https://github.com/ggrbill/phd-plot-scripts

My personal plot scripts used to generate graphs for my PhD Thesis

hacktoberfest matplotlib numpy python

Last synced: 18 Apr 2026

https://github.com/pushtogithub23/projectilemotion-simulation

This Jupyter notebook explores projectile motion through mathematics, visualizations, and interactive simulations.

jupyter-notebook matplotlib-pyplot numpy physics-simulation projectile-motion-simulation

Last synced: 28 Jan 2026

https://github.com/asut00/Machine-Learning-Program_42AI

Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.

linear-regression machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 27 Oct 2025

https://github.com/sahraiidle/email-spam-detector

Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.

data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm

Last synced: 24 Jan 2026

https://github.com/jaybfn/single-molecule-electrophysiology-data-analysis

Pynanopore serves as a sophisticated toolkit for analyzing single molecules, designed to interpret complex states within nanopore data automatically.

docker numpy pandas plotly pytest python3 streamlit streamlit-webapp

Last synced: 28 Jan 2026

https://github.com/andres29ac/website_heatmaprisk

Mapa hecho con Streamlit y Folium para mostrar la distribución delictiva de cada distrito de Trujillo mediante Mapas de Calor.

folium-maps heatmap-visualization numpy pandas ploty python stadistics streamlit

Last synced: 15 Apr 2026

https://github.com/guy1998/synthetic-data-generator-for-event-cameras

Automated workflow for the generation of synthetic data that can be used as an input for event cameras based on previously taken videos

computer-vision event-cameras numpy opencv synthetic-data

Last synced: 15 Apr 2026

https://github.com/hansalemaos/cv2_stack_images

Stacks 2 images (horizontal/vertical)

concat cv2 images numpy python stack

Last synced: 28 Jan 2026

https://github.com/shwetapardhi/assignment-03-q5--hypothesis-testing

Chi2 contengency independence test Q5. Fantaloons Sales managers commented that % of males versus females walking in to the store differ based on day of the week. Analyze the data and determine whether there is evidence at 5 % significance level to support this hypothesis. Assume Null Hypothesis as Ho: Independence of categorical variables (% of

alternate-hypothesis chi2-contingency contingency-analysis hypothesis-testing null-hypothesis numpy pandas python scipy significance-testing stats

Last synced: 28 Jan 2026

https://github.com/drsartoriuss/portfolio

Учебные проекты из Яндекс Практикума "Специалист по Data Science+"

catboost computer-vision data-science deeplearning mashinelearning matplotlib nlp-machine-learning numpy pandas scipy seaborn sklearn sql torch tsf

Last synced: 28 Jan 2026

https://github.com/shahaba83/airplane-ticket-cancellation

In this project, we try to predict the possibility of canceling the plane ticket by the buyer

datatime numpy pandas python scikit-learn seaborn

Last synced: 25 Feb 2026

https://github.com/deepankkartikey/ml-projects

A repository of projects done while learning and exploring ML

classification csv machine-learning numpy pandas regression sklearn

Last synced: 15 Apr 2026

https://github.com/jaypanchal9/fraud-detection-case-study

A comprehensive case study applying machine learning techniques to detect fraudulent transactions effectively.

machine-learning matplotlib numpy pandas python3 scikit-learn seaborn xgboost

Last synced: 15 Apr 2026

https://github.com/hercules-peters/learn-ml-with-me

Welcome to **ML-Journey**, my personal progress tracker for mastering machine learning! This repository showcases my projects, notebooks, and explorations as I advance my skills, starting with hands-on projects.

machine-learning numpy pandas python sklearn

Last synced: 15 Apr 2026

https://github.com/muhkartal/fr-photo

face-recognition system works on images first of all you need to register valid image as recognition

face-recognition numpy opencv pillow python streamlit

Last synced: 15 Apr 2026