An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/harmanveer-2546/supply-chain

Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.

customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis

Last synced: 10 Apr 2026

https://github.com/otivedani/hedge_hog

Histogram of Oriented Gradients, full numpy

feature-extraction histogram-of-oriented-gradients hog-features numpy

Last synced: 13 Apr 2026

https://github.com/zion-dev1/e-commerce-analysis

Short machine learning project which analyses customer behaviour 🔎

analysis clustering ecommerce machine-learning numpy pandas sklearn

Last synced: 05 Jul 2025

https://github.com/tasninanika/k-nearest_neighbors

K-Nearest Neighbors (KNN) is a simple and powerful machine learning algorithm used for classification and regression.

knn matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/pvlbzn/kot-or-not

Logistic regression (shallow neural network) which predicts whether input image is a cat or not.

classification-algorithm image-classification logistic-regression machine-learning neural-network numpy

Last synced: 01 Mar 2025

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/parmeetbhamrah/air-quality-india-analysis

Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.

air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/jayanthsd2003/dms_sarathi_-driver-monitoring_system-

This is a project which can be seen on TATA Trucks and other premium edition cars based and focussed on the safety and monitoring driver Consciousness state

dlib dlib-face-detection numpy opencv-python pygame python pyttsx3 scipy

Last synced: 13 Apr 2026

https://github.com/ngirimana/decision-trees-and-random-forest-

For this project we will be exploring publicly available data from [LendingClub.com](www.lendingclub.com). Lending Club connects people who need money (borrowers) with people who have money (investors). Hopefully, as an investor you would want to invest in people who showed a profile of having a high probability of paying you back. We will try to create a model that will help predict this. Lending club had a [very interesting year in 2016](https://en.wikipedia.org/wiki/Lending_Club#2016), so let's check out some of their data and keep the context in mind. This data is from before they even went public. We will use lending data from 2007-2010 and be trying to classify and predict whether or not the borrower paid back their loan in full. You can download the data from [here](https://www.lendingclub.com/info/download-data.action) or just use the csv already provided. It's recommended you use the csv provided as it has been cleaned of NA values. Here are what the columns represent: * credit.policy: 1 if the customer meets the credit underwriting criteria of LendingClub.com, and 0 otherwise. * purpose: The purpose of the loan (takes values "credit_card", "debt_consolidation", "educational", "major_purchase", "small_business", and "all_other"). * int.rate: The interest rate of the loan, as a proportion (a rate of 11% would be stored as 0.11). Borrowers judged by LendingClub.com to be more risky are assigned higher interest rates. * installment: The monthly installments owed by the borrower if the loan is funded. * log.annual.inc: The natural log of the self-reported annual income of the borrower. * dti: The debt-to-income ratio of the borrower (amount of debt divided by annual income). * fico: The FICO credit score of the borrower. * days.with.cr.line: The number of days the borrower has had a credit line. * revol.bal: The borrower's revolving balance (amount unpaid at the end of the credit card billing cycle). * revol.util: The borrower's revolving line utilization rate (the amount of the credit line used relative to total credit available). * inq.last.6mths: The borrower's number of inquiries by creditors in the last 6 months. * delinq.2yrs: The number of times the borrower had been 30+ days past due on a payment in the past 2 years. * pub.rec: The borrower's number of derogatory public records (bankruptcy filings, tax liens, or judgments).

artificial-intelligence decision-tree-classifier machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn

Last synced: 11 Apr 2026

https://github.com/fearless-spider/stooq-dax-seasonal

Parsing seasonality from time series data can often be useful in data analytics. It helps with analyzing seasonality for decision making as well as for more accurate forecasts. Python can be used to separate out these trend and seasonal components.

numpy pandas python statistics

Last synced: 13 Apr 2026

https://github.com/mahnoorsheikh16/Credit-Card-Default-Prediction

This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.

chi-square-test encoding hiplot imblearn json knn-imputer matplotlib numpy pandas pca-analysis pillow plotly robust-scalar scipy seaborn sklearn smote streamlit ttest visualization

Last synced: 01 Mar 2025

https://github.com/nafisalawalidris/tools-for-data-science

It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.

arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development

Last synced: 11 Apr 2026

https://github.com/aymen016/image-processing-algorithms

Explore Python-based image processing from scratch with core algorithms like interpolation, convolution, and filtering!

matplotlib numpy pillow python

Last synced: 13 Apr 2026

https://github.com/itancio/realestateprice

This is an application of deep learning to train and predict real estate price dataset in King County, USA.

datat-visualization machine-learning numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/Krisseck/hki-traffic-predict

Keras neural network to predict traffic in Helsinki

avoindata helsinki keras numpy open-data python traffic

Last synced: 13 Jul 2025

https://github.com/chaymabghozzi/somnolence

Détection de somnolence

dlib numpy opencv pygame python

Last synced: 13 Apr 2026

https://github.com/rara-ch/data-analysis-portfolio

This repository to store my data analytics projects, showcasing my skills in SQL and Python.

data-analysis mathematics matplotlib numpy pandas portfolio probability python seaborn sql statistics

Last synced: 12 Mar 2025

https://github.com/neerajcodes888/diwali-sales-analysis

An open-source repository for sales data analysis. Dive into insightful trends, metrics, and visualizations to empower data-driven decision-making. Ideal for data analysts, business professionals, and enthusiasts seeking comprehensive sales insights. Clone, customize, and contribute to enhance your sales analytics journey.

data-science-projects data-visualization numpy pandas-dataframe python3 sales-analysis seaborn-plots

Last synced: 26 Mar 2025

https://github.com/camilajaviera91/clustering-first-approach

This code will help to understand, in a simple way, how the clustering model works. To achieve this, we will input certain variables (strings and integers), then plot them to visualize and analyze how the values are positioned.

array cluster clustering dictionaries matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/imswappy/ads-eda-predictor

Interactive Streamlit app for marketing campaign analytics and prediction. Includes dashboards, EDA, econometrics tests (ADF, cointegration, OLS diagnostics), ML pipelines with preprocessing, CV, and persistence. Predict outcomes with Linear/Ridge/Lasso/Random Forest regressors.

adf breusch-pagan cointegration lasso-regression linear-regression matplotlib-pyplot numpy pandas random-forest ridge-regression seaborn sklearn

Last synced: 02 May 2026

https://github.com/avs123a/image-color-spaces-morphology-demo

Simple demo that shows extracting RGB and HSV channels, converting from RGB to HSV and reverse converting. There are 4 main morphological operations used: erosion, diletion, opening and closing image

demo image-processing jupyter-notebook numpy opencv python3

Last synced: 05 May 2026

https://github.com/gauravsakure02/python_libraries

Data Analysis using Python libraries

matplotlib numpy pandas-library streamlit

Last synced: 14 Apr 2026

https://github.com/imaginesamurai/advanced-vision-system

A Python-based camera handling utility that provides seamless control over video capture with advanced zoom functionality. Features include adjustable resolution, smooth digital zoom controls, and efficient frame processing, ideal for computer vision applications.

camera computer-vision cv2 cv2-library night-vision night-vision-camera night-vision-goggles numpy opencv opencv-python python python3 real-time realtime stable supoort thermal thermal-camera thermal-images video

Last synced: 16 Apr 2026

https://github.com/carlthome/ipytoolbox

A collection of IPython magics and other utilities

ipython jupyter matplotlib numpy

Last synced: 18 Jan 2026

https://github.com/jabhij/eda_experiments

In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.

ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis

Last synced: 14 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_08

Модуль 8. Глибоке навчання. Tensorflow.

classification data-science matplotlib minst numpy pandas python sklearn tensorflow

Last synced: 14 Apr 2026

https://github.com/smusab9152/pokemon_data_analysis

This repo that explores and analyzes a dataset of Pokémon attributes. The analysis includes data cleaning, exploratory data analysis (EDA), and visualizations .

analytics data-analysis data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas pokemon python seaborn statistical-analysis

Last synced: 02 May 2026

https://github.com/dimatura/imgutils

Miscellaneous Numpy and PIL utilities for images.

image numpy pil pillow python

Last synced: 19 Jan 2026

https://github.com/cyberoctane29/unicorn-companies-analysis

This project explores unicorn companies, private startups valued at over $1 billion, using Python for data analysis. It covers industry trends, geographic distribution, and investment patterns through EDA, including data cleaning, handling missing values, datetime transformations, and visualizations to uncover key insights.

data-analysis eda numpy pandas python

Last synced: 02 May 2026

https://github.com/kwanit1142/sars-cov-2-detection-radiology-based-multi-modal-multi-task-framework

Keras-based Implementation for "SARS-CoV-2 Detection: Radiology based Multi-modal Multi-task Framework" (Accepted in 45th IEEE EMBC 2k23)

gans keras matplotlib numpy opencv os pandas pillow scipy sklearn tensorflow transfer-learning wasserstein-distance

Last synced: 14 Apr 2026

https://github.com/nirmalyabag20/loan-status-prediction-using-machine-learning

This project focuses on predicting the loan status (approved or not approved) based on various applicant details. The goal is to develop a machine learning model that accurately classifies whether a loan should be approved, helping financial institutions make informed lending decisions.

matplotlib numpy pandas python scikit-learn seaborn support-vector-machine

Last synced: 19 Jan 2026

https://github.com/4thel00z/numpy-serializer

numpy-serializer is a nice and high-level way to serialize numpy.arrays while preserving their shape and type.

bytes deserialization deserialize deserializer msgpack numpy raw serialize serializer serializers

Last synced: 27 Jan 2026

https://github.com/avinesh-masih/employee-report-generator

A Python-based tool that reads employee and purchase data from Excel, generates personalized PDF reports using FPDF, and organizes them into folders. Includes a progress bar for tracking report generation.

excel flask fpdf matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/shsiena/Evolutionary-Dots

A simple pygame genetic-algorithm project simulating evolution of dots to find a path around obstacles to a goal.

evolutionary-algorithm genetic-algorithm machine-learning mutation numpy pathfinding pygame python3

Last synced: 14 Oct 2025

https://github.com/ayaanjawaid/google_playstore_data_analysis

This project provides an in-depth analysis of Google Play Store apps and user reviews, focusing on understanding app performance, user sentiment, and key trends in app categories. Using Python, I performed data cleaning, feature engineering, and exploratory data analysis (EDA) on app data and reviews.

data-analysis eda html numpy pandas-dataframe plotly python vizualisation

Last synced: 24 Feb 2026

https://github.com/nissanjk/linear-solver

A Python desktop application to solve systems of linear equations using various numerical methods.

anaconda3 fpdf imageio matplotlib numpy python3 tkinter

Last synced: 16 May 2026

https://github.com/ola-kaznowska/ai-digit-recognition

AI digit recognition. Artificial neural network for digit recognition

ai artificial-intelligence machine-learning neural-network numpy tensorflow

Last synced: 14 Apr 2026

https://github.com/cedricstrolz/digitalisierung-statistik-ws24

Hier handelt es sich NICHT um Lösungen sondern Lernunterlagen

matplotlib numpy pandas python statistics

Last synced: 25 Feb 2026

https://github.com/sultanazhari/analyze-client-behavior

Megaline Telecomunication Company want to know how their customer habbits in using their services. As a data Analyst we're provided Customer Habbits and suggest to determine the options that need to be taken

matplotlib-pyplot numpy python3 scipy seaborn telecommunications

Last synced: 15 Apr 2026

https://github.com/g0r0kh/forecast

layouts statistical analys for forecasting

matplotlib numpy pandas

Last synced: 30 Jan 2026

https://github.com/abhihirekhan/abhi-s-data-science-portfolio

A list of data science projects completed by me for academic, self learning, and creative purposes

datascience ipynb-jupyter-notebook jupyter jupyter-kernels jupyter-notebook keras-tensorflow machine-learning numpy pandas portfolio python tensorflow

Last synced: 31 Jan 2026

https://github.com/simranjeet97/quotes-analysis

Kaggle Dataset on Quotes Analysis and Visualization With Python, Pandas and MatplotLib Using Jupyter Notebook.

data data-science datavisualization jupyter-notebook kaggle kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python quotes quotes-application

Last synced: 15 Apr 2026

https://github.com/nemeslaszlo/heart-disease

Heart disease classification project with different models (LogisticRegression, KNeighboursClassifier, RandomForestClassifier) and detailed reports.

classification knearest-neighbor-classifier logistic-regression mathplotlib numpy pandas randomforest-classification scikit-learn seaborn

Last synced: 15 Apr 2026

https://github.com/fsubject/rotating-cube

Rotating Cube is a 3D engine made from scratch

3d 3d-engine 3d-graphics from-scratch numpy pygame python

Last synced: 31 Jan 2026

https://github.com/zolabar/interactive-calculus

Interactive Calculus with SymPy and Jupyter Widgets

ipywidgets jupyter-notebook numpy plotly python sympy voila-dashboard

Last synced: 15 Apr 2026

https://github.com/mohd-faizy/learn_numpy

NumPy (short for Numerical Python) is a powerful Python library used for working with arrays, matrices, and numerical computations.

array-manipulations numpy numpy-arrays numpy-matrix numpy-neural-network numpy-python numpy-tutorial

Last synced: 26 Feb 2026

https://github.com/rickydoan/machine-learning-risk-model-prediction-classification

This project leverages machine learning to provide insights into loan and credit risk. By analyzing user-provided financial data, it predicts the likelihood of loan default, generates a credit score, and assigns a risk rating. Designed to assist financial institutions and individuals in making informed decisions

classification joblib machine-learning numpy pandas python sklearn-library streamlit

Last synced: 27 Feb 2026

https://github.com/sralter/sustainability_insights

A data analysis project that derived insights from an emissions dataset sourced from Climate TRACE.

duckdb eda matplotlib numpy pandas tableau

Last synced: 12 Feb 2026

https://github.com/muhammadabyaz/facial-recognition-based-attendance

Facial Recognition Based Attendance System

numpy open-cv pandas pyqt python3 qml supabase

Last synced: 12 Feb 2026

https://github.com/huaxiaz/honor_engineering_analysis

This repository is adapt from the course materials for Honors Engineering Analysis at Northwestern University. The course is designed for Engineering first-year undergraduate students to learn about linear algebra and its applications.

complex-numbers complexity degree-distribution echelon-form erdos-renyi-model fractals iterative-methods jacobi-iteration jupyter-notebook linear-algebra markov-chain networkx networkx-tutorial numpy numpy-matrix pagerank-algorithm python random-graphs randomgraph

Last synced: 14 Feb 2026

https://github.com/giuliotaralli/machine-learning-exercises

Practical exercises from the Machine Learning course

graphviz matplotlib numpy pandas python scipy sklearn

Last synced: 15 Feb 2026

https://github.com/bestmahdi2/uni__dataminningstackoverflowproject

A university project related to data mining lesson on StackOverflow website data with Python language

cart csv data-mining logistic-regression matplotlib mlp naive-bayes nltk numpy pandas python scikit-learn scipy seaborn stackoverflow svc textblob tqdm xgboost

Last synced: 16 Feb 2026

https://github.com/swimshahriar/machine-learning

This repository contains the machine learning models that I have worked on during learning "Machine Learning".

numpy pandas python3 sklearn

Last synced: 16 Apr 2026

https://github.com/dataspieler12345/dash-by-plotly-lib-project

The Dash web application framework, is used to create browser-based interactive data visualization interfaces with Python.

bootstrap dash numpy pandas plot python

Last synced: 18 Apr 2026

https://github.com/texnoforge/texnomagic

TexnoMagic library for digital Magic

gmm magic numpy python recognition scikit-learn scipy

Last synced: 03 Mar 2026

https://github.com/asadullah-imran/student-performance-data-analysis

This is a very simple exploratory data analysis of Student Performance in a class. data is taken from Kaggle.

datanalysis matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/mohd-mb2205153/data-science-project

A data analysis project using Python to study climate patterns and genocide-related datasets from Gaza and the West Bank.

climate-change data-science matplotlib numpy palestine python3

Last synced: 05 Mar 2026

https://github.com/itdaniher/snr

Lin1 Finale

chirpz dsp numpy python svd sympy

Last synced: 17 Apr 2026

https://github.com/uni-creator/handwritten_number_recognition_system

A Handwritten Number Recognition System built from scratch using Deep Learning from Scratch. The model is trained on digit images and can classify handwritten numbers with high accuracy.

computer-vision deep-learning handwritten-digit-recognition image-processing implementation-from-scratch minst nueral-networks numpy pandas pytorch

Last synced: 17 Apr 2026

https://github.com/rhazra-003/fake_news_detector

A Machine Learning model to detect fake news with more than 95% accuracy

fake-news numpy pandas scikit-learn

Last synced: 18 Apr 2026

https://github.com/sreejabethu/cost-of-living-index-globally

This Streamlit app is a data visualization tool that allows users to explore and compare the cost of living indices across different countries. The app takes in a dataset of cost of living indices for various countries and provides an interactive interface for users to select specific indices and countries to visualize.

bargraphs costofliving costoflivingindex csv datavisualization index matplotlib-pyplot numpy pandas-dataframe plotly python seaborn streamlit-webapp

Last synced: 19 Apr 2026

https://github.com/dwarkadhish2005/weather-predictor

Built a model based on collected data which takes various parameters and predicts the weather

classifier matplotlib numpy pandas sklearn xgboost

Last synced: 20 Apr 2026

https://github.com/salmandeveloperz/student-performance-prediction-and-grade-classification

A Machine Learning Project that predicts student grade and performance from a Dataset. Copule of Libraries are used for data pre-processing, training model, heatmaps, trees and other Algorithms.

ann classification machine-learning metplotlib numpy pandas regression-models skilearn supervised-learning tensorflow unsupervised-learning

Last synced: 20 Apr 2026

https://github.com/gunjangyl/rainfall_prediction_system

The Rainfall Prediction System is a machine learning-based web application that forecasts rainfall based on weather parameters like precipitation, temperature, and wind speed.

bootstrap css django html javascript jupyter-notebook numpy pandas python

Last synced: 20 Apr 2026

https://github.com/nikhilroxtomar/introduction-to-numpy

A list of jupyter notebook explaining the various functions and properties of Numpy.

numpy numpy-tutorial

Last synced: 21 Apr 2026

https://github.com/charmve/numpycnn

Building Convolutional Neural Networks From Scratch using NumPy

cnn computer-vision deep-learning deep-learning-tutorial machine-learning numpy project python tutotrial

Last synced: 22 Apr 2026

https://github.com/samaid/game-of-life

Conway's Game of Life implemented using array libraries and JIT compilers for CPU and GPU

dpnp numba numba-dpex numpy opencv-python python

Last synced: 25 Apr 2026

https://github.com/iusztinpaul/ibm-assets-recommendation-engine

Deployable recommendation system for predicting the fittest IBM articles for a given user.

ibm machine-learning numpy pandas python recommendation-engine recommendation-system svd-matrix-factorisation

Last synced: 27 Apr 2026

https://github.com/akash-rajak/image-resizer

Python Script to resize image to any dimension.

cv2 gui image-resizer numpy os python3 random resizeimage tkinter

Last synced: 27 Apr 2026

https://github.com/iliatrofimov/linearsystems_solvers

Several methods of solving sistems of linear equasions

algebra-lineal numpy

Last synced: 28 Apr 2026

https://github.com/kritimbist/365-days-of-github-challenge-ai-machine-learning

This repository is part of my 365 Days Challenge: AI × Machine learning, where I combine my passion for Machine Learning 🤖 to learn, build, and document projects every single day for one year.

data-science data-visualization deep-learning machine-learning matplotlib numpy python scikit-learn

Last synced: 28 Apr 2026

https://github.com/aryansk/movie-recommendation-system

A sophisticated movie recommendation system that suggests films based on user ratings and collaborative filtering techniques. Get personalized movie suggestions based on viewing patterns and user preferences! 🎯

machine-learning machine-learning-algorithms matplotlib numpy pandas python recommender-system seaborn

Last synced: 29 Apr 2026

https://github.com/tynoee/movies-data-anlysis

This project explores a dataset of movies sourced from IMDb (via Kaggle) to uncover trends and insights about movie production, popularity, ratings, and more.

jupyter-notebook kaggle-dataset matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/rakibhhridoy/exploratorydataanalysis-python

Exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

ab-testing chitest data-science eda exploratory-data-analysis ftest hypotheses hypothesis-testing inferential-statistics numpy pandas python statistical-analysis statistics statsmodels ttest

Last synced: 30 Apr 2026

https://github.com/vidhi1290/ai-ml-cartoonifier

Welcome to Cartoonify, a fun and interactive project that converts real photos into cartoonified masterpieces! This project guides you through the entire process, from loading the image to displaying the final cartoon version.😛

bilateral-filter cartoonify-ai edge-detection matplotlib numpy opencv skimage tensorflow

Last synced: 30 Apr 2026

https://github.com/sralter/term_deposit_marketing

Predicting which customers will most likely purchase a type of financial product, achieving a time savings of over 93%.

matplotlib numpy optuna pandas seaborn sklearn

Last synced: 30 Apr 2026

https://github.com/anarya22/heart-disease-classification

Predicting heart disease using machine learning. This notebook looks into various python base ML and DS libraries in an attempt to build a machine learning model capable of predicting whether or not someone has heart disease based on their medical attributes.

data-cleaning data-visualization machine-learning matplotlib numpy pandas scikit-learn

Last synced: 01 May 2026

https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba

First assignment for the course Data Mining @CSE.UOI

data-analysis data-science numpy scipy seaborn statistics

Last synced: 01 May 2026