An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/prathameshdhande22/data-visualization-tutorial

Repository Contains all the stuff required for Data Visualization. Numpy Tutorial, Pandas Tutorial, Matplotlib Tutorial, Seaborn Tutorial

data-visualization jupyter-notebook matplolib numpy pandas pandas-dataframe python seaborn tutorial

Last synced: 09 Apr 2026

https://github.com/alexandrbig1/py-design-analysis-algo

Advanced Python project on the design and analysis of algorithms. Includes practical modules on dynamic programming, graph theory, big data (Bloom filter, HyperLogLog), parallel computing (MapReduce), and optimization. Uses libraries like NumPy, Matplotlib, NetworkX, BTrees, PrettyTable, and Datasketch for efficient computation and visualization.

algorithms asyncio git heapq matplotlib numpy python python3 turtle

Last synced: 09 Apr 2026

https://github.com/htlinh0604/invoice-data-extraction

This project demonstrates a classic OCR pipeline. This Flask app takes an image, applies an OpenCV preprocessing pipeline, and uses Tesseract OCR to digitize Vietnamese invoices (Bách Hóa Xanh)..

flask numpy ocr-recognition opencv pandas python regex tesseract-ocr

Last synced: 10 Nov 2025

https://github.com/zazi2002/neuroscience-project-cnn

This project is part of the Neuroscience of Learning, Memory, Cognition course and focuses on the visual processing pathways in the brain, particularly the dorsal and ventral streams.

cnn-for-visual-recognition learning matplotlib numpy scipy visual-processing

Last synced: 09 May 2026

https://github.com/mhmudfzli/exploring-mental-health-data

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

catboost lightgbm matplotlib numpy pandas scikit-learn seaborn xgboost

Last synced: 09 Apr 2026

https://github.com/mrtarantoga/mdl-density-histogram

Cython-accelerated MDL histogram density estimation. Implements Kontkanen & Myllymaki's dynamic programming algorithm for optimal variable-width bins, parametric complexity with Ramanujan approximations, and automatic bin count selection. Based on "MDL Histogram Density Estimation" (JMLR 2007).

cython-library mdl numpy pypi-package python3

Last synced: 30 Nov 2025

https://github.com/jswong65/machine_learning_nanodegree

Projects of Udacity Machine Learning nanodegree

machine-learning numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/garcane/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 08 Apr 2026

https://github.com/sahajrajmalla/co2-calculator-api

A Web Application backend API that enables people to create events, join events, and measure the carbon footprint produced by each participant to get insights about online and onsite carbon emissions.

apis carbon-emissions carbon-footprint fastapi makefile numpy pandas precommit sqlalchemy sqlite3 uvicorn

Last synced: 09 Apr 2026

https://github.com/nirmalyabag20/wine-quality-prediction-machine-learning

This project analyzes the chemical properties of wines to identify key factors influencing quality. By leveraging machine learning techniques, i aim to develop predictive models that accurately classify wine quality, providing valuable insights for producers and enthusiasts alike.

k-neighbors-classifier logistic-regression machine-learning matplotlib numpy pandas python random-forest seaborn svc

Last synced: 09 Apr 2026

https://github.com/upul/indi

Small Machine Learning library written in Python

algorithms machine-learning numpy python scipy

Last synced: 10 Apr 2026

https://github.com/dulajkavinda/live-sketching

✍️ Converting live video stream from webcam into a sketch using OpenVC and Keras

added image-filtering keras machinelearning matplotlib numpy opencv python

Last synced: 10 Apr 2026

https://github.com/thedevreda/python

My Full Backup Packagaes & Files Guide for Python 👷‍♂️

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/Tritonix711/FractureAI

This tool helps people upload X-rays to find broken bones. It uses a machine to mark where the breaks are and gives users marked pictures to download. A smart computer also helps people understand their broken bones and gives them advice.

css cv2 flask gorq html javascript matplotlib npm numpy pandas pydantic python react scikit-learn torch torchvision ultralytics

Last synced: 19 Aug 2025

https://github.com/mnitin-reddy/reducing-review-overhead-with-ml-based-application-screening

A machine learning classification project to filter out low-probability visa applications using historical data. It features an end-to-end implementation with CI/CD on AWS, achieving 93% accuracy with a KNN model optimized through Optuna, alongside integration of MLOps tools like Evidently and MLflow.

aws docker githubactions hypothesistesting machinelearning matplotlib mlflow mlops mongodb numpy optuna pandas python scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/alisonmitchell/boston-housing

Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.

data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 10 Apr 2026

https://github.com/alibowndyn/flickerforge

A customizable image and GIF generator mimicking old television static, offering control over colors, blur, dimensions, pixel size, and many more.

numpy pillow pyqt6 python

Last synced: 05 May 2026

https://github.com/mattdelaune/saas_cohort_analysis

This project analyzes customer segmentation and behavior using data science and cohort analysis. Key metrics like CRR, NRR, CLR, and CLV are examined through detailed charts, including the cohort layer cake and CLR vs. CLV cost efficiency analysis. Exploratory Data Analysis and systematic data manipulation reveal actionable insights.

anaconda git jupyter-notebook matplotlib numpy pandas python saas-data sales-data sales-data-analysis-project seaborn

Last synced: 03 Jan 2026

https://github.com/gandhasiri-rahul-mohan/random-forests-q2-company_data

About the data: Let’s consider a Company dataset with around 10 variables and 400 records.

data-science machine-learning numpy pandas python random-forest seaborn-plots

Last synced: 07 Apr 2026

https://github.com/cusyio/datenverarbeitung-mit-numpy

Kurs zum Schreiben und Optimieren eigener Programme zur Datenverarbeitung mit Numpy.

numpy

Last synced: 08 Jun 2026

https://github.com/surtecha/multi-physics-simulation-engine

A real world physics simulation application built with python

jax numpy physics pygame python scipy simulation

Last synced: 14 Apr 2025

https://github.com/thamirisq/data_analysis_culinary_journey

Project to analyze key information from Culinary Jorney, a marketplace Company, and develop an initial strategy to lead the entire team effectively.

folium-maps groupby-method jupyter-notebook markercluster marketplace-ds matplotlib-pyplot numpy pandas pandas-dataframe streamlit

Last synced: 10 Apr 2026

https://github.com/kumartusha/data_scientist-ai-ml-_training_anudip_foundation

Anudip Foundation Data Analytics Training 📊 A repository containing projects, assignments, and coursework from the Anudip Foundation Data Analytics training program. This includes work on: 🧹 Data Cleaning & Preprocessing 📉 Data Visualization & EDA 📊 Statistical Analysis

excel matplotlib numpy pandas powerbi python scipy sql

Last synced: 10 Apr 2026

https://github.com/huangcongqing/python-libraries

python库(numpy,pandas...)学习

matplotlib numpy pandas python3 xgboost

Last synced: 09 Mar 2026

https://github.com/williyam-m/movie-recommendation-system

Developed a web app with a cosine similarity machine learning model for personalized recommendations based on user history, likes, bookmarks, and activity. Implemented user auth and CRUD operations for movies.

django machine-learning numpy pandas prediction-model python scikit-learn

Last synced: 10 Apr 2026

https://github.com/burakkagann/pixels2genai-2.0

Free, open-source curriculum teaching generative art and AI across 15 modules, from a single pixel to diffusion models.

computer-vision creative-coding curriculum deep-learning design-based-research diffusion-models education gans generative-ai generative-art machine-learning numpy pytorch touchdesigner vae

Last synced: 19 Jun 2026

https://github.com/simran1002/strip-analyzer

The Strip Color Identification is used to analyze images of urine strips uploaded by users. Using computer vision techniques, particularly leveraging OpenCV, the interface identifies and extracts the colors present on the strip.

flask multer nodejs numpy opencv python reactjs

Last synced: 10 Apr 2026

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: 10 Apr 2026

https://github.com/msthamizh/bizcardx-extracting-business-card-data-with-ocr

Developing a Streamlit application that enables users to upload business card images and extract key details using easyOCR. The app allows users to manage the extracted data (view, update, delete) and securely stores it in a MySQL database. It also highlights extracted text directly on the business card image using Pillow.

easyocr mysql numpy pandas pillow python regular-expression streamlit

Last synced: 10 Apr 2026

https://github.com/lakhidaksh-01/carinsightanalyzer

Interactive Car Ratings & Insights

cars matplotlib numpy pandas python seaborn streamlit

Last synced: 10 Apr 2026

https://github.com/lolosoares/dataanalises

This repository shows my projects made in my journey learning about data analises, ciency, engeniering and ML.

acces data-science dax excel mysql numpy pandas powerbi powerquery python r seaborn sql vba

Last synced: 10 Apr 2026

https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing

Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts

colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn

Last synced: 12 Apr 2026

https://github.com/salmandeveloperz/ml_house_prediction

project for house price prediction using Classification & Regression models. Includes Docker setup for easy deployment.

classification-model clustering deep-learning machine-learning matplotlib numpy pandas python3 regression-models scikit-learn

Last synced: 10 Apr 2026

https://github.com/camilajaviera91/prediction-of-housing-prices-using-linear-regression

This project provides tools to search for datasets on Kaggle, download and preprocess them, and perform predictions using a Linear Regression model. It includes interactive text-based user interfaces built with `curses`.

curses kaggle linear-regression matplotlib-pyplot mean-absolute-error mean-square-error numpy pandas pathlib python scikit-learn train-test-split

Last synced: 10 Apr 2026

https://github.com/dmarks84/coursework_coursework_project_automobile-sales-visualization

Project for IBM Data Science course on Visualization & Dashboards -- Analyzed historical sales data, performing EDA and setting up an interactive dashboard

communication dash dashboards data-modeling elt etl folium matplotlib numpy pandas pipelines plotly python scipy seaborn visualization

Last synced: 10 Apr 2026

https://github.com/mothnik/robust_fourier

Noise- and Outlier-Robust Fourier Transform with Hermite Functions with NumPy and Numba

chebyshev-polynomials fourier-transform hermite-functions hermite-polynomials least-square-regression logsumexp numba numpy python robust-statistics

Last synced: 04 Jan 2026

https://github.com/aryansk/fake-news-detection

A sophisticated machine learning solution to detect fake news using multiple classification algorithms. Identify the credibility of news articles with advanced text analysis techniques!

fake-news-detection machine-learning machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 18 Jan 2026

https://github.com/34j/numpy-flint-arb

Arbitrary precision floating / ball arithmetic (interval arithmetic) dtype in NumPy / array API

acb arb arbitrary-precision array-api flint interval-arithmetic mpmath numpy

Last synced: 10 Mar 2026

https://github.com/datarohit/fifa-2020--data-analysis

This is dataset is from Kaggle.com which contains data of 18000+ fifa players with more than 100 features about them for analysis. Simple analysis performed on this Dataset.

exploratory-data-analysis exploratory-data-visualizations matplotlib-pyplot numpy pandas seaborn

Last synced: 10 Apr 2026

https://github.com/prashver/customer-segmentation

This project utilizes unsupervised machine learning to segment bank customers for targeted marketing campaigns. It covers tasks like data exploration, determining optimal clusters, and applying k-means for segmentation. Ideal for marketing departments in banking and retail industries.

customer-segmentation matplotlib numpy pandas seaborn sklearn

Last synced: 07 Apr 2026

https://github.com/achalesh27022003/python-assignments-iwt

This repository is objected to have all the assignments and project related to IWT (Semester - 3).

cosinewave frame grid image imagetk labels matplotlib-pyplot numpy numpy-arrays numpy-library photoimage pil python3 sinewave spyder tkinter tkinter-gui window

Last synced: 10 Apr 2026

https://github.com/nemeslaszlo/sale-price-of-bulldozers

The goal of predicting the sale price of bulldozers. How well can we predict the future sale price of a bulldozer, given its characteristics previous examples of how much similar bulldozers have been sold for? (Archive kaggle competition)

matplotlib numpy pandas random-forest-regressor regression scikit-learn seaborn

Last synced: 10 Apr 2026

https://github.com/g-eoj/cv-tl-keras

Use the cross validation functions from scikit-learn to evaluate image classification transfer learning with Keras models.

cross-validation keras numpy scikit-learn transfer-learning

Last synced: 10 Apr 2026

https://github.com/kishlayjeet/zomato-data-exploration

In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.

data-analysis eda matplotlib numpy pandas zomato-data-exploration

Last synced: 10 Apr 2026

https://github.com/datarohit/titanic-survival-data--classification

This is a Titanic Survival Data from Kaggle Competition with Multiple Classification Algorithm testing and Detailed Exploratory Data Analysis.

classification-algorithm kaggle-competition machine-learning-algorithms matplotlib-pyplot numpy pandas seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken

Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.

3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics

Last synced: 11 Apr 2026

https://github.com/amishmathur1/resumeats

✨ With Resume ATS Analyzer, users can improve their resume quality, boost ATS compatibility, and increase their chances of landing interviews — all with smart AI assistance.

express-python matplotlib-python nltk nltk-python numpy openai-api pandas plotly plotly-dash plotly-express pypdf2 python streamlit wordcloud

Last synced: 11 Apr 2026

https://github.com/tasninanika/callifornia-housing-price-prediction-svr

Support Vector Regression (SVR) is a type of Support Vector Machine used for predicting continuous values.

matplotlib numpy pandas python3 scikit-learn seaborn svm-regression

Last synced: 11 Apr 2026

https://github.com/dagdelenbunyamin/barcode-scanner-flb

Ein interaktives Barcode-Scanner-Tool zur Verwaltung von Schülerabmeldungen. Mit Webcam-Unterstützung, automatischer Zeiterfassung und PDF-Export für Lehrkräfte – entwickelt mit Streamlit, OpenCV und SQLite.

fpdf numpy opencv pandas python pyzbar streamlit

Last synced: 11 Apr 2026

https://github.com/suraj-sedai/lstm-deep-learning

A step-by-step guide to mastering sequence prediction using TensorFlow and LSTM. This repository covers everything from basic linear predictions to advanced multi-step forecasting, many-to-many LSTM models, and real-world applications like stock price forecasting and sales prediction.

lstm-neural-networks lstm-sentiment-analysis numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/sergio11/breast_cancer_diagnosis_mlp

🩺🔬 MLP-based Breast Cancer Diagnosis: Predicts tumor malignancy from image features, aiding in early detection. 📊🤖

deep-learning deep-neural-networks machine-learning mlp-classifier mlp-networks numpy sklearn sklearn-classify

Last synced: 21 Feb 2026

https://github.com/kaikymoura/backcourt-api

Repository for the Basketball Advanced Stats API

fastapi nba-api nba-stats numpy pandas python swagger uvicorn

Last synced: 03 May 2026

https://github.com/faisal-khann/diwali-sales-analysis

The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future

csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 17 Apr 2026

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 11 Apr 2026

https://github.com/elazzouzihassan/si-fraud-detection-prototype

Système de Détection des Fraudes avec Python (Prototype).

googlecolab matplotlib numpy pandas python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/prithivsakthiur/qwen-image-edit-2509-loras-fast

Qwen-Image-Edit-2509-LoRAs-Fast is a high-performance, user-friendly web application built with Gradio that leverages the advanced Qwen/Qwen-Image-Edit-2509 model from Hugging Face for seamless image editing tasks.

aoti diffusers diffusion-models flash-attention-3 huggingface-spaces huggingface-transformers kernel numpy peft python pytorch qwen-image-edit qwen-image-edit-2509 qwen2-5-vl qwen3-vl torch torchvision

Last synced: 03 May 2026

https://github.com/gaurangdave/house_price_predictions

Machine Learning Application to predict House Prices

hands-on learning-by-doing machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/hamdaniqhmqd/project-predict-saham-bbri

Repository Project-Predict-Saham-BBRI is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/listiangr/play_tennis_classification

This project analyzes weather data to predict whether a person will play tennis or not. The main goal is to make accurate predictions based on factors such as weather, temperature, humidity, and wind.

classification flask html-css jupyter-notebook kaggle-dataset naive-bayes numpy pandas pickle play-tennis python sklearn

Last synced: 11 Apr 2026

https://github.com/hansalemaos/cythonunique

Fast implementation of unique elements in an array - up to 30x faster than NumPy

faster numpy unique

Last synced: 28 May 2026

https://github.com/nemeslaszlo/lending-club-analysis

Lending Club analysis, with a model that can predict whether or nor a borrower will pay back their loan.

binary-classification classification data-visualization keras matplotlib numpy pandas seaborn tensorflow

Last synced: 12 Apr 2026

https://github.com/kr1shnasomani/sentimentscope

Sentiment analysis on movie review using TensorFlow and GloVe embeddings

deep-learning keras matplotlib natural-language-processing neural-networks numpy pandas scikit-learn tensorflow

Last synced: 12 Apr 2026

https://github.com/rayyan9477/machine-learning-driven-backorder-prediction-system

Experience a state-of-the-art Django web application designed to predict product backorders with exceptional accuracy. This platform leverages advanced machine learning techniques, incorporating pre-trained Random Forest Classifier, Decision Tree, and LGBM models.

matplotlib notebook numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/shahinuralambhuiyan/learning-ml

Start learning Machine Learning basic from CodeBasics Channel

anaconda jupyter-notebook kaggle-dataset matplotlib numpy pandas pip python

Last synced: 12 Apr 2026

https://github.com/prashver/stock-price-prediction-lstm

Using Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM), this project predicts Nestle India stock prices. The dataset spans from May 18, 2018 to May 20, 2022, and results are visualized using matplotlib.

keras lstm-model matplotlib numpy pandas recurrent-neural-network

Last synced: 12 Apr 2026

https://github.com/kavayk29/quora-duplicate-question-pair

This project improves information retrieval by detecting duplicate question pairs in the Quora dataset using data exploration, text preprocessing, feature engineering, and models like Random Forest and LSTM, aiming to streamline question-answering.

beautifulsoup4 bilstm gensim keras lstm matplotlib numpy pandas pytorch random-forest seaborn sklearn tensorflow xgboost

Last synced: 21 Jan 2026

https://github.com/dhruvsrikanth/pynn

Purely NumPy based API for building deep neural networks.

deep-learning deep-learning-framework machine-learning numpy python pytorch

Last synced: 09 May 2026

https://github.com/datarohit/imdb-reviews-sentiment-analysis

This is a NLP - Sentiment Analysis Project built using Bernoulli-Naive-Bayes Algorithm to Predict is the IMDB Movie Review is Positive or Negative.

bernoulli-naive-bayes classification count-vectorizer multinomial-naive-bayes nltk numpy pandas porter-stemmer regex sentiment-analysis sklearn

Last synced: 29 Apr 2026

https://github.com/pranav-bot/ml-dl-models-from-scratch

This contains the most commonly used machine learning and deep learning models written from scratch without the use if any libraries except Numpy which is used for calculations

deep-learning descion-tree from-scratch linear-regression logistic-regression machine-learning naive-bayes neural-networks numpy perceptron pytorch qlearning random-forest regression reinforcement-learning support-vector-machines svm

Last synced: 15 Apr 2026

https://github.com/nazli-d/binary-classification-using-cnn

This project utilizes a CNN model to classify cat and dog images through training and testing processes. The model is created using the Keras library on the TensorFlow backend.

binary-classification cat-and-dog-classifier cnn convolutional-neural-networks flatten keras max-pooling numpy opencv python relu-activation sigmoid-function tensorflow

Last synced: 13 Apr 2026

https://github.com/abtaaahi/smartxi

SmartXI — A Streamlit app to recommend football teams based on budget, formation, and style, plus predict player market values using machine learning models. Easy setup and interactive visualizations included.

machine-learning machine-learning-algorithms numpy pandas plotly python random-forest streamlit

Last synced: 13 Apr 2026