An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/pratishtha-abrol/principlecomponentanalysis

Principle Component Analysis with Numpy and Python

numpy principal-component-analysis python

Last synced: 03 May 2026

https://github.com/xaxm007/nn-numpy

A Basic Logistic Regression Neural Network using NumPy and calculus.

critical-indulgence logistic-regression neural-network numpy study

Last synced: 19 May 2026

https://github.com/realtristan/micronet

Neural network library from scratch

neural-networks numpy python pytorch

Last synced: 03 May 2026

https://github.com/garcane/Income-Prediction-ML

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 24 Oct 2025

https://github.com/nemeslaszlo/lending-club-analysis

Lending Club analysis, with a model that can predict whether or nor a borrower will pay back their loan.

binary-classification classification data-visualization keras matplotlib numpy pandas seaborn tensorflow

Last synced: 12 Apr 2026

https://github.com/duoan/machine-learning-notebook

A notebook repository for tracking learning machine learning notebook.

data-analysis decision-tree ensemble-model gbdt machine-learning numpy pandas xgboost

Last synced: 18 Jun 2026

https://github.com/nissanjk/linear-solver

A Python desktop application to solve systems of linear equations using various numerical methods.

anaconda3 fpdf imageio matplotlib numpy python3 tkinter

Last synced: 16 May 2026

https://github.com/alisonmitchell/boston-housing

Investigation of the Boston housing dataset to evaluate, train and test a regression model to predict house prices.

data-science machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn

Last synced: 10 Apr 2026

https://github.com/reddyprasade/bicycle-sharing-system-in-us

A bicycle-sharing system, public bicycle system, or bike-share scheme, is a service in which bicycles are made available for shared use to individuals on a short term basis for a price or free. Many bike share systems allow people to borrow a bike from a "dock" and return it at another dock belonging to the same system. Docks are special bike racks that lock the bike, and only release it by computer control. The user enters payment information, and the computer unlocks a bike. The user returns the bike by placing it in the dock, which locks it in place. Other systems are dockless. For many systems, smartphone mapping apps show nearby available bikes and open docks.

dataanalysis jupyter maplotlib numpy pandas python r

Last synced: 29 Aug 2025

https://github.com/kentlouisetonino/sw-statistics-calculator

A CLI program that compute mean, median, and percentiles.

numpy os python3 statistics

Last synced: 29 Mar 2025

https://github.com/dudynets/numerical-methods-labs-tkinter

Labs for Numerical Methods course at Ivan Franko Lviv National University.

interpolation labs linear-algebra matplotlib numerical-methods numpy python tkinter university

Last synced: 14 May 2026

https://github.com/nemeslaszlo/loan-data-decision-tree-and-random-forest

This repository covers a "Lending Club" analysis with decision tree and random forest with data visualizations and exploration.

decesion-trees matplotlib numpy pandas random-forest seaborn sklearn

Last synced: 17 Apr 2026

https://github.com/anfany/python3-practice

基于Python3的多种工具实例代码、以及流行库的使用贴士

matplotlib numpy pandas python3 tensorflow tips

Last synced: 04 May 2026

https://github.com/hamdaniqhmqd/project-predict-saham-bbri

Repository Project-Predict-Saham-BBRI is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/brpy/100-prisoners-problem

Simulation of 100 Prisoners Problem using Python

numpy python simulation veritasium

Last synced: 14 Apr 2026

https://github.com/gaurangdave/house_price_predictions

Machine Learning Application to predict House Prices

hands-on learning-by-doing machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/4thel00z/numpy-serializer

numpy-serializer is a nice and high-level way to serialize numpy.arrays while preserving their shape and type.

bytes deserialization deserialize deserializer msgpack numpy raw serialize serializer serializers

Last synced: 27 Jan 2026

https://github.com/harmanveer-2546/heart-failure-prediction

Heart failure is a severe condition in which the heart is unable to pump blood effectively. Early prediction of heart failure can significantly improve patient outcomes. This project aims to build a predictive model using machine learning techniques to identify patients at risk of heart failure.

exploratory-data-analysis feature-engineering jupyter-notebook matplotlib model-training-and-evaluation numpy pandas python sckiit-learn seaborn visualization-of-results

Last synced: 14 Apr 2026

https://github.com/nirmalyabag20/loan-status-prediction-using-machine-learning

This project focuses on predicting the loan status (approved or not approved) based on various applicant details. The goal is to develop a machine learning model that accurately classifies whether a loan should be approved, helping financial institutions make informed lending decisions.

matplotlib numpy pandas python scikit-learn seaborn support-vector-machine

Last synced: 19 Jan 2026

https://github.com/wilfriedmercier/somptimised

An optimised version of sklearn-som with extended functionalities.

machine-learning machine-learning-algorithms numpy python python3 self-organizing-map som

Last synced: 09 Jun 2026

https://github.com/itzzmeakhi/investigate-tmdb-dataset

Tmdb dataset is taken to perform analysis. Messy data is cleaned by performing many steps like dropping null values and rising values and analyzing correlation among them using visualization techniques using matplotlib and seaboarn to derive the answers that i imposed during the early stages of analysis.

csv matplotlib numpy pandas python3 seaborn

Last synced: 03 May 2026

https://github.com/matheussc017/irisclassification_sklearn

Application of Machine Learning technique to classify 3 types of iris flower

data-science machine-learning numpy pandas python

Last synced: 09 Apr 2026

https://github.com/muralisuvvani/airbnb-exploratory-analysis

Exploratory Data Analysis (EDA) on Airbnb NYC dataset to uncover trends, patterns, and insights. Data cleaning and visualization of Airbnb listings in New York City. Analyzing Airbnb NYC 2019 dataset using Python (Pandas, Numpy, Matplotlib, Seaborn)

analytics data-visualization exploratory-data-analysis numpy pandas python3

Last synced: 04 May 2026

https://github.com/mrtaz77/numerical-analysis

Hub of numerical methods, implemented using python and its data analysis libraries like numpy and matplotlib.

interpolation matplotlib-pyplot newton-raphson numpy regression

Last synced: 19 Jun 2026

https://github.com/shreeparab1890/flipkart-laptops-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.

data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 14 Apr 2026

https://github.com/cassiofb-dev/pandasscript

Pandas Script, uma lingaguem para visualizar e gerenciar dados.

compilers lexer numpy pandas parser python

Last synced: 09 Apr 2026

https://github.com/fatma-moanes/deep-learning-labs

My implementation for the labs of the Neural Networks and Deep Learning course that I studied at my university, Zewail City.

classification cnn data-science deep-learning gan keras matplotlib neural-network-architectures neural-networks numpy pandas preprocessing python regression-models regularization seaborn tensorflow time-series-analysis

Last synced: 14 Apr 2026

https://github.com/rahatmoktadir03/market-anomaly-detection

This AI-engineering project involves building a machine learning model to classify financial market conditions and detect potential market crashes. It includes developing investment strategies based on the model’s predictions and designing an AI-driven bot to explain the results to end users.

ai anomaly-detection financial-analysis investment-strategies machine-learning numpy pandas python sklearn

Last synced: 09 Feb 2026

https://github.com/rkarahul/machinelearning

This is for my all machine learning code

analysis computer-vision matplotlib mysql nlp numpy pandas python

Last synced: 11 Apr 2026

https://github.com/mariamabidi/pinn-based-flow-prediction

This repository contains code and experiments for predicting 3D aerodynamic flow around car geometries using Physics-Informed Neural Networks (PINNs) and for analyzing flow features via autoencoder-based clustering.

computer-vision machine-learning neural-network numpy pytorch pyvista scikit-learn

Last synced: 05 Aug 2025

https://github.com/briankopp/fewerbytes

compress numpy arrays. binary and numeric techniques for numpy compression

compress compression numpy python python3 storage

Last synced: 18 May 2026

https://github.com/chirindaopensource/crypto_currencies_interest_rates

End-to-end Python Implementation of Bergault et al.'s (2025) methodology for constructing yield curves without traditional bonds. Implements inverse options replication, robust statistical methods, and closed-form analytical solutions for risk-neutral interest rate discovery in digital asset markets.

academic-research arbitrage-pricing cryptocurrency data-science defi derivatives econometrics financial-engineering interest-rates jupyter-notebook mathematical-finance numpy options-pricing pandas python quantitative-finance ransac risk-management robust-statistics yield-curve

Last synced: 04 May 2026

https://github.com/mattdelaune/saas_cohort_analysis

This project analyzes customer segmentation and behavior using data science and cohort analysis. Key metrics like CRR, NRR, CLR, and CLV are examined through detailed charts, including the cohort layer cake and CLR vs. CLV cost efficiency analysis. Exploratory Data Analysis and systematic data manipulation reveal actionable insights.

anaconda git jupyter-notebook matplotlib numpy pandas python saas-data sales-data sales-data-analysis-project seaborn

Last synced: 03 Jan 2026

https://github.com/ejw-data/proj-nba-eda

NBA exploratory data analysis project with Pandas and Tableau

nba numpy pandas python tableau

Last synced: 14 Apr 2026

https://github.com/vbharadwaj-bk/python_cpp_tutorial

Code for a Tutorial on Writing C++ Extensions for Python.

cpp numpy pybind11 python

Last synced: 10 May 2026

https://github.com/heisenberg141/probability-based-boundary-detection

This Repository consists of implementation of probability based boundary detection algorithm which gives more accurate results than canny edge detection.

boundary-detection canny-edge-detection computer-vision edge-detection gabor-filters gaussian-filters image-gradients image-processing kmeans-clustering lm-filters numpy opencv python3 sobel-filter texton-maps

Last synced: 04 May 2026

https://github.com/dkealvaro/mnist

Achieved 75% accuracy in the MNIST dataset using a simple cosine similarity approach under 50 lines of code, with no training

cosine-similarity machine-learning mnist numpy

Last synced: 18 May 2026

https://github.com/mchenryspagg/analysis-of-car-repair-shop-operations

This project presents a comprehensive data analysis of a car repair shop's operations, utilizing an extended dataset of sales receipts.

dataanalysisusingsql datavisualization matplotlib-pyplot mysql mysql-database mysqlconnector numpy pandas python salesanalysis seaborn sql

Last synced: 07 Jan 2026

https://github.com/gandhasiri-rahul-mohan/random-forests-q2-company_data

About the data: Let’s consider a Company dataset with around 10 variables and 400 records.

data-science machine-learning numpy pandas python random-forest seaborn-plots

Last synced: 07 Apr 2026

https://github.com/cusyio/datenverarbeitung-mit-numpy

Kurs zum Schreiben und Optimieren eigener Programme zur Datenverarbeitung mit Numpy.

numpy

Last synced: 08 Jun 2026

https://github.com/mgobeaalcoba/datacademy

Iré subiendo los distintos desafíos del reto de Platzi llamado Datacademy

anaconda deepnote google-colab jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/lexxai/goit_python_ds_hw_08

Модуль 8. Глибоке навчання. Tensorflow.

classification data-science matplotlib minst numpy pandas python sklearn tensorflow

Last synced: 14 Apr 2026

https://github.com/epsoft/tensorflow-variables

Tensorflow Variables

numpy tensorflow variables

Last synced: 04 May 2026

https://github.com/svetlanam/data-scripts

Simple scripts for data cleaning, etl transformations and data reorganisations

data-cleaning data-transformation dimensions etl facts keboola numpy pandas python3 snowflake sql star-schema

Last synced: 04 May 2026

https://github.com/himel-sarder/complete-numpy

A comprehensive guide to mastering NumPy with practical examples and applications in machine learning. Perfect for learners and developers looking to deepen their knowledge of numerical computations in Python.

data-science dataanalysis machine-learning ml numpy numpy-exercises numpy-library numpy-python numpy-tutorial pythonlibrarires

Last synced: 04 May 2026

https://github.com/kostrykin/blas-benchmark

A benchmark comparison of different BLAS backends for NumPy.

benchmark blas cvxpy mkl numpy openblas python

Last synced: 04 May 2026

https://github.com/manjillama/machine-learning-101

Introduction to machine learning, conda environment, Jupyter Notebook, Pandas, NumPy and Matplotlib.

conda-environment jupyter-notebook machine-learning matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/apoorva-info/cognitive-load-using-blink-rates

Analyzing blink rates and task performance to measure cognitive load during digital span tasks using a custom Python tool.

matplotlib numpy pandas python

Last synced: 08 Apr 2026

https://github.com/abtaaahi/smartxi

SmartXI — A Streamlit app to recommend football teams based on budget, formation, and style, plus predict player market values using machine learning models. Easy setup and interactive visualizations included.

machine-learning machine-learning-algorithms numpy pandas plotly python random-forest streamlit

Last synced: 13 Apr 2026

https://github.com/florianwoelki/mnist_fashion_cnn

This is a classification problem based on the mnist fashion dataset. I've used a CNN to classify the images.

classifier cnn fashion keras mnist numpy pandas tensorflow

Last synced: 08 Apr 2026

https://github.com/kumartusha/data_scientist-ai-ml-_training_anudip_foundation

Anudip Foundation Data Analytics Training 📊 A repository containing projects, assignments, and coursework from the Anudip Foundation Data Analytics training program. This includes work on: 🧹 Data Cleaning & Preprocessing 📉 Data Visualization & EDA 📊 Statistical Analysis

excel matplotlib numpy pandas powerbi python scipy sql

Last synced: 10 Apr 2026

https://github.com/quinta0/pycolors

A python program to help people with color blindness recognize color in an image

colorblindness colors image-processing numpy pandas-dataframe python tkinter-gui

Last synced: 05 May 2026

https://github.com/shreeparab1890/movie-recommender-system

This notebook is trying to build a model which will recommend the movie based on given movie and genre. In this we use Popularity Based Recommendation, Content Based Recommendation and Collaborative Filtering based Recommendation.

bag-of-words cosine-similarity matplotlib numpy pandas python scikit-learn sklearn vectorization

Last synced: 09 Apr 2026

https://github.com/yessasvini23/deepfake_immunization_toolkit

🛡️ AI-powered toolkit to detect deepfakes, educate users, and verify content authenticity using federated learning and blockchain. Built for election security, media integrity, and digital literacy.

blockchain matplotlib numpy opencv python pytorch scikit-learn

Last synced: 11 Apr 2026

https://github.com/khaymanii/fake_news_prediction_model

This model was built using python and logistic regression algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 05 May 2026

https://github.com/jabhij/eda_experiments

In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.

ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis

Last synced: 14 Apr 2026

https://github.com/paulo-santos-ds/rotatividade_de_clientes

A operadora de comunicações InternetGO está interessada em prever a rotatividade de seus clientes (churn). Se for identificado que um usuário está planejando trocar de operadora, a empresa poderá oferecer códigos promocionais e opções de planos especiais para evitar a perda desse cliente.

catboost numpy pandas pyplot python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/subh888999/-online-food-ordering-behavior-analysis-projects

This project analyzes online food ordering behavior to uncover key insights on customer preferences, restaurant performance, and factors influencing order volume. Through data-driven analysis, it aims to help optimize pricing, delivery times, and customer experience for food delivery platforms and restaurants.

matplotlib numpy pandas python seaborn statistics

Last synced: 05 May 2026

https://github.com/kshitijshrivastava1903/neural_network_analysis_on_lending_club-dataset

Used tensorflow's neural network model to predict whether or not a person pays back a loan on the basis of his historical data and personal details of 3.9 lakh people like interest rate, employment details, address, etc.

deep-learning dropout-keras earlystopping keras-tensorflow matplotlib-pyplot neural-networks numpy pandas seaborn tensorflow-models

Last synced: 18 Feb 2026

https://github.com/faisal-khann/comprehensive_banking_portfolio

This analysis explores the banking customer dataset to uncover key patterns in account holdings, loan portfolios, income profiles, and transactional behavior. By integrating both financial and demographic variables, we examine customer engagement, lending trends, and risk exposure across different segments.

data-manipulation eda matplotlib numpy pandas powerbi python seaborn

Last synced: 20 Apr 2026

https://github.com/lbirkert/digit-recognition

[from-scratch] feed forward neural network that can recognize handwritten digets

feedforward-neural-network machine-learning numpy python

Last synced: 19 May 2026

https://github.com/gauravsakure02/python_libraries

Data Analysis using Python libraries

matplotlib numpy pandas-library streamlit

Last synced: 14 Apr 2026

https://github.com/myounus-codes/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression

In this project I have cleaned the data for the model. Project Google Colab Link: https://colab.research.google.com/drive/1vQY-XEFJSdEkW2PQOSf1j13Yk8L-XXNw?usp=sharing

algorithms data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python

Last synced: 05 May 2026

https://github.com/devwithzain/tensorflow-learning

Learning Tensorflow

numpy tensorflow

Last synced: 19 May 2026

https://github.com/intscription/python-programs

Python basics-advance

numpy pandas scikit-learn

Last synced: 05 May 2026

https://github.com/shanmukhsrisaivedullapalli/smsspamclassification

SMSSpamClassification is a machine learning project aimed at accurately classifying SMS messages as either spam or ham (non-spam). It employs natural language processing techniques to extract relevant features from the text data and utilizes various classification algorithms to build a robust spam detection model.

jupyter-notebook numpy pandas pickle python3 sklearn spam-classification spam-detection

Last synced: 11 Apr 2026

https://github.com/varun-khorgade/churnshield-customer-retention-predictor

Built an ML-based classification model to predict customer churn. Applied data preprocessing, feature engineering, and ensemble algorithms to improve prediction accuracy and help businesses implement retention strategies.

classification-algorithm datapreprocessing f1-score feature-engineering hyperparameter-tuning logistic-regression matplotlib model-evaluation numpy pandas python ran roc-auc scikit-learn seaborn xgboost

Last synced: 07 May 2026

https://github.com/willprice/arsenal

Personal arsenal of python utilities

data-science machine-learning numpy pandas personal-tools python

Last synced: 09 Apr 2026

https://github.com/camilajaviera91/clustering-first-approach

This code will help to understand, in a simple way, how the clustering model works. To achieve this, we will input certain variables (strings and integers), then plot them to visualize and analyze how the values are positioned.

array cluster clustering dictionaries matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/heshanera/paclassifier

Online Passive Aggressive Binary Classifier

binary-classifier numpy passive-aggressive python

Last synced: 06 May 2026

https://github.com/tasninanika/mammographic-masses-analysis-dt

This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.

decision-tree-classifier numpy pandas pyhton3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/mchenryspagg/prosper-loan-project

A data analysis project that entails using the data from a fictional loan company known as Prosper to perform exploratory data analysis using univariate, bivariate and multivariate visualizations to produce insights that answers questions asked from the data

datastorytelling exploratory-data-analysis exploratory-data-visualizations jupyter-notebook matplotlib-pyplot numpy pandas python seaborn-plots

Last synced: 13 Apr 2026

https://github.com/simran1002/strip-analyzer

The Strip Color Identification is used to analyze images of urine strips uploaded by users. Using computer vision techniques, particularly leveraging OpenCV, the interface identifies and extracts the colors present on the strip.

flask multer nodejs numpy opencv python reactjs

Last synced: 10 Apr 2026

https://github.com/msthamizh/bizcardx-extracting-business-card-data-with-ocr

Developing a Streamlit application that enables users to upload business card images and extract key details using easyOCR. The app allows users to manage the extracted data (view, update, delete) and securely stores it in a MySQL database. It also highlights extracted text directly on the business card image using Pillow.

easyocr mysql numpy pandas pillow python regular-expression streamlit

Last synced: 10 Apr 2026