An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/v-mayya/python-sales-data-analysis

Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python

analysis data matplotlib numpy python

Last synced: 29 Apr 2026

https://github.com/tynoee/movies-data-anlysis

This project explores a dataset of movies sourced from IMDb (via Kaggle) to uncover trends and insights about movie production, popularity, ratings, and more.

jupyter-notebook kaggle-dataset matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/hernandemonteiro/host_national_search_2015

Análise Estatística da Pesquisa Nacional por Amostra de Domicílios

colab-notebook matplotlib numpy pandas pnad python3 pythonnotebook seaborn

Last synced: 29 Apr 2026

https://github.com/ralstonraphael/water_access_ml_project

This project analyzes life expectancy data from the World Health Organization (WHO), sourced from Kaggle. The dataset spans 183 countries across 6 regions, covering metrics such as life expectancy, mortality rates, healthcare access, and socioeconomic factors.

data-science machine-learning numpy pandas python

Last synced: 29 Apr 2026

https://github.com/prodev717/neurogen

A simple Python library to build and train deep neural networks (SLP and MLP). It includes functions for forward propagation, backpropagation, loss calculation, and model saving/loading.

ai backpropagation deep-learning machine-learning mlp neural-networks numpy python slp

Last synced: 30 Apr 2026

https://github.com/li-plus/tinynn

An educational PyTorch-like neural network framework based on NumPy

autograd deep-learning machine-learning neural-network numpy python

Last synced: 30 Apr 2026

https://github.com/rakibhhridoy/exploratorydataanalysis-python

Exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

ab-testing chitest data-science eda exploratory-data-analysis ftest hypotheses hypothesis-testing inferential-statistics numpy pandas python statistical-analysis statistics statsmodels ttest

Last synced: 30 Apr 2026

https://github.com/akshaypatra/expression_detector

The Expression Detector is an AI-based system that identifies and classifies human facial expressions in real-time . It utilizes computer vision and deep learning techniques to analyze facial features and determine the emotional state of a person, such as happiness, sadness, anger, surprise, fear, disgust, or neutrality.

keras-tensorflow neural-network numpy opencv-python python3

Last synced: 30 Apr 2026

https://github.com/hklyousfi/cyberxnoteloom

Hybrid piano synthesizer with ADSR envelopes, resonance modeling, and MIDI rendering​

adsr-envelope digital-audio hybrid-synth midi-processor numpy piano-synthesizer python

Last synced: 30 Apr 2026

https://github.com/torahappy/takasa

OpenStreetMap 高低差を考えた最短(?)経路問題ソルバー

numpy openstreetmap scipy shortest-path

Last synced: 30 Apr 2026

https://github.com/zazi2002/neuroscience-reinforcement-learning

Neuroscience of Learning, Memory, Cognition project, containing a Jupyter notebook implementing a reinforcement learning approach for a neuroscience-related task.

deep-learning matplotlib numpy pytorch reinforcement-learning

Last synced: 30 Apr 2026

https://github.com/sralter/term_deposit_marketing

Predicting which customers will most likely purchase a type of financial product, achieving a time savings of over 93%.

matplotlib numpy optuna pandas seaborn sklearn

Last synced: 30 Apr 2026

https://github.com/nmsby/pca-machine-learning-lab

Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.

data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization

Last synced: 01 May 2026

https://github.com/anarya22/heart-disease-classification

Predicting heart disease using machine learning. This notebook looks into various python base ML and DS libraries in an attempt to build a machine learning model capable of predicting whether or not someone has heart disease based on their medical attributes.

data-cleaning data-visualization machine-learning matplotlib numpy pandas scikit-learn

Last synced: 01 May 2026

https://github.com/snehilsanyal/ee524

Course webpage for IIT Guwahati EE524 Machine Learning Lab (Jul-Nov 2020) Session

course-webpage machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 May 2026

https://github.com/khaymanii/big_mart_prediction_model

This model was built using Python and Logistic Regression Algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 01 May 2026

https://github.com/epsoft/tensorflow-basics

Tensorflow Basics

basics numpy tensorflow

Last synced: 01 May 2026

https://github.com/macagua/entrenamiento.data_scientist_python

Repositorio de manuales y recursos del entrenamiento "Data Scientist en Python" realizado por Leonardo J. Caballero G.

data-analytics data-scientist data-visualization numpy pandas-dataframe python37 streamlit

Last synced: 01 May 2026

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 01 May 2026

https://github.com/cchandre/rg

Renormalization for the break-up of invariant tori in Hamiltonian flows

expm hamiltonian hamiltonian-dynamics invariant-tori kolmogorov-arnold-moser numpy python3 renormalization-group scipy

Last synced: 01 May 2026

https://github.com/scorleos773/three-body-problem-simulator

Three-Body Problem Simulation - A Python-based simulation of the Three-Body Problem with both 2D and 3D visualizations. Uses `matplotlib` and `scipy` to model gravitational interactions and animate orbital trajectories. 🚀🔭

differential-equations matplotlib mplot3d newtonian-mechanics numerical-integration numpy orbital-mechanics python runge-kutta-methods scipy-integrate three-body-problem three-body-simulation

Last synced: 02 May 2026

https://github.com/back2matching/turboquant-vectors

Compress embeddings 6x instantly with TurboQuant. First pip package using Google's TurboQuant (ICLR 2026) for vector search. 71.9% recall vs FAISS PQ 13.3%.

compression embeddings faiss machine-learning numpy quantization rag turboquant vector-search

Last synced: 08 Jun 2026

https://github.com/y-india/project-smart-attendance-system-opencv

AI-powered Smart Attendance System using Face, Eye, and Mask detection — automates daily attendance with real-time computer vision and CSV logging. | see README below , please.

attendance-system automation cctv-cameras cctv-detection numpy opencv opencv-python pandas project projects python python3 real-world-problem-solving real-world-project webcam

Last synced: 02 May 2026

https://github.com/nekruzash/machine-learning-decision-trees

This is from CS2023 - AI/DS/ML class, trained a machine learning model with heart disease data sets to predict a correlation using numpy and pandas.

jupyter-notebook numpy pandas python

Last synced: 02 May 2026

https://github.com/rakibhhridoy/machinelearning-featureselection

Before training a model or feed a model, first priority is on data,not in model. The more data is preprocessed and engineered the more model will learn. Feature selectio one of the methods processing data before feeding the model. Various feature selection techniques is shown here.

extratreesclassifier feature-selection gridsearchcv lasso-regression logistic-regression machine-learning numpy pandas pca rfe rfecv scikit-learn selectkbest

Last synced: 02 May 2026

https://github.com/prashver/titanic-survival-prediction

This project tackles the Titanic challenge on Kaggle, predicting passenger survival based on variables like age, sex, and passenger class. The Jupyter notebook covers essential steps of a data science pipeline, including exploratory data analysis, data cleaning, feature engineering, and modeling. The dataset used is the Titanic dataset.

classification-algorithm machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 May 2026

https://github.com/oguzhansarigol/expected-goals-xg-data-analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 02 May 2026

https://github.com/iv4n-ga6l/realtime-car-traffic-tracking

Realtime car traffic tracking using yolov8 model

cartracking numpy opencv python ultralytics yolov8 yolov8n

Last synced: 02 May 2026

https://github.com/zazi2002/machine-learning-project

Introduction to Machine Learning project with the goal of improving the classification performance on a dataset by optimizing the number of features and weak learners.

dimentionality-reduction ensemble-learning numpy pca random-forest scikit-learn

Last synced: 02 May 2026

https://github.com/italoseara/cet083

Medidas de Posição (ou separatrizes) - CET083

data-science matplotlib numpy pandas portuguese python statistics university

Last synced: 03 May 2026

https://github.com/cs-joy/numpyv1.25.2

@numpy

numpy

Last synced: 03 May 2026

https://github.com/dhruv-26/parkease-pro

ParkEase Pro is an AI-powered smart parking system built with Python, OpenCV, MySQL, and ANPR technology. It automatically calculates parking fees, tracks entry-exit times, and detects empty spots using AI for real-time monitoring.

automatic-number-plate-recognition computer-vision easyocr multiprocessing numpy opencv pickle pil python sql tkinter-gui

Last synced: 09 Jun 2026

https://github.com/realtristan/micronet

Neural network library from scratch

neural-networks numpy python pytorch

Last synced: 03 May 2026

https://github.com/harizonelopez/face-detector

This project is a real-time face detection application using OpenCV's deep learning-based face detector. It captures faces from a webcam feed, detects faces with high accuracy using a pre-trained Caffe model, and saves the detected faces as image files labeled with the user's name.

cdn dnn numpy opencv python3

Last synced: 03 May 2026

https://github.com/saptak625/doublependulum

This repository is my derivation of the equations of motion of a double pendulum using Lagrangian Mechanics and then solving the nonlinear, coupled equations using a Runge-Kutta 45 integration scheme.

differential-equations lagrangian-mechanics matplotlib numerical-integration numpy python scipy vpython

Last synced: 03 May 2026

https://github.com/vidursatija/tensorpy

A simple NumPy autodiff library

autograd machine-learning numpy python

Last synced: 03 May 2026

https://github.com/wilfriedmercier/somptimised

An optimised version of sklearn-som with extended functionalities.

machine-learning machine-learning-algorithms numpy python python3 self-organizing-map som

Last synced: 09 Jun 2026

https://github.com/itzzmeakhi/investigate-tmdb-dataset

Tmdb dataset is taken to perform analysis. Messy data is cleaned by performing many steps like dropping null values and rising values and analyzing correlation among them using visualization techniques using matplotlib and seaboarn to derive the answers that i imposed during the early stages of analysis.

csv matplotlib numpy pandas python3 seaborn

Last synced: 03 May 2026

https://github.com/asit-14/python-programming

This repository contains all the programs and resources we will use to learn Python programming. It also includes various modules, Python libraries, and several project examples to enhance our learning experience.

matplotlib modules numpy pandas python tkinter

Last synced: 03 May 2026

https://github.com/utkarsh-284/deep-learning

This repository contains the documents and main project file needed for building DNN from scratch using NumPy.

data-science deep-learning deep-neural-networks machine-learning mnist mnist-handwriting-recognition neural-networks numpy python3

Last synced: 03 May 2026

https://github.com/simipk12/eda-healthcare-project

Healthcare Insurance Cost Analysis This project analyzes the Healthcare Insurance dataset to explore factors that influence medical charges. The goal is to identify key drivers of insurance costs, detect outliers, and provide actionable insights for individuals, insurance companies, and policymakers.

eda matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/chirindaopensource/crypto_currencies_interest_rates

End-to-end Python Implementation of Bergault et al.'s (2025) methodology for constructing yield curves without traditional bonds. Implements inverse options replication, robust statistical methods, and closed-form analytical solutions for risk-neutral interest rate discovery in digital asset markets.

academic-research arbitrage-pricing cryptocurrency data-science defi derivatives econometrics financial-engineering interest-rates jupyter-notebook mathematical-finance numpy options-pricing pandas python quantitative-finance ransac risk-management robust-statistics yield-curve

Last synced: 04 May 2026

https://github.com/priyanka7411/customer-churn-prediction

Customer Churn Prediction using machine learning. This project covers the full data science lifecycle—data cleaning, exploratory analysis, feature engineering, model building, and extracting actionable business insights.

jupyter-notebook model numpy pandas python random-forest-classifier scikit-learn-matplotlib-seaborn

Last synced: 04 May 2026

https://github.com/heisenberg141/probability-based-boundary-detection

This Repository consists of implementation of probability based boundary detection algorithm which gives more accurate results than canny edge detection.

boundary-detection canny-edge-detection computer-vision edge-detection gabor-filters gaussian-filters image-gradients image-processing kmeans-clustering lm-filters numpy opencv python3 sobel-filter texton-maps

Last synced: 04 May 2026

https://github.com/megha060199/academic-erp

Python, SQL, Javascript

mysql numpy python

Last synced: 04 May 2026

https://github.com/tingnie/dataanalysis

python数据分析库笔记

numpy pandas python3

Last synced: 04 May 2026

https://github.com/epsoft/tensorflow-variables

Tensorflow Variables

numpy tensorflow variables

Last synced: 04 May 2026

https://github.com/svetlanam/data-scripts

Simple scripts for data cleaning, etl transformations and data reorganisations

data-cleaning data-transformation dimensions etl facts keboola numpy pandas python3 snowflake sql star-schema

Last synced: 04 May 2026

https://github.com/harmanveer-2546/predicting-customer-churn-for-a-telecom-company

Customer churn is a significant issue for big business companies. Companies are attempting to create methods for predicting customer churn to get a direct impact on getting more revenues, particularly in telecom companies.

ann autoencoder churn-prediction customer-churn-prediction encoding mlp model-building-and-evaluation numpy os pandas standardization

Last synced: 04 May 2026

https://github.com/kostrykin/blas-benchmark

A benchmark comparison of different BLAS backends for NumPy.

benchmark blas cvxpy mkl numpy openblas python

Last synced: 04 May 2026

https://github.com/md-emon-hasan/learnpython.org

A repository Python programming language, covering fundamental concepts, advanced topics, and practical applications for beginners.

exception-handling function generator lambda-functions numpy numpy-library python reduce

Last synced: 04 May 2026

https://github.com/flazefy2/ds-global_cybersecurity_threats

https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

data-science data-visualization numpy python squarify statistics

Last synced: 05 May 2026

https://github.com/winter000boy/my-project-portfolio

Welcome to My Project Portfolio! This repository is a comprehensive collection of my work in Java programming, Python development, Machine Learning (ML), Artificial Intelligence (AI), and Web Development. Organized for ease of access, this portfolio highlights innovative and practical projects tailored to solve real-world challenges.

ai artificial-intelligence artificial-neural-networks bootstrap data-science deep-learning flask-application java-projects machine-learning numpy pandas-python potfolio-site python-projects python3 spring-boot tkinter-gui web-development

Last synced: 05 May 2026

https://github.com/anupam0202/contextual-rag-chatbot

Contextual RAG Chatbot that processes PDF documents using the Google Gemini API

google-generativeai numpy pypdf2 scikit-learn streamlit

Last synced: 05 May 2026

https://github.com/mauriciovazquezm/machinelearning_course_spring2023

Machine Learning course tasks focused on the implementation of the ML algorithms using libraries such as Numpy, Pandas, etc.

machine-learning machine-learning-algorithms numpy python r-language

Last synced: 05 May 2026

https://github.com/guidorice/hab-detection-wyvern-hsi

Harmful Algal Bloom (HAB) Detection from Wyvern hyperspectral imagery

gdal geopandas gis hyperspectral jupyter numpy python rasterio

Last synced: 05 May 2026

https://github.com/kiritoind/neural-network-for-cat-image-classification

This script defines essential functions for constructing a deep neural network from scratch for image classification. It includes activation functions like sigmoid and ReLU, data loading from H5 files, parameter initialization, forward and backward propagation for training, gradient descent for parameter updates, and functions for prediction

classification computer-vision deep-learning deep-neural-networks jupyter-notebook numpy python

Last synced: 05 May 2026

https://github.com/flazefy2/ds-customer_shopping

https://www.kaggle.com/datasets/bhadramohit/customer-shopping-latest-trends-dataset

data-science data-visualization jupiter-notebook numpy python sales shopping squarify statistics

Last synced: 05 May 2026

https://github.com/dan-shields/rocketleagueirl

Rocket League comes to Real Life!

arduino numpy opencv2 python robotics rocket-league

Last synced: 05 May 2026

https://github.com/lintangwisesa/python_recommendation_system

Basic recommendation system using Python & Sklearn

numpy pandas python recommendation-system sklearn

Last synced: 05 May 2026

https://github.com/sarthak-1408/rain-fall-prediction

This repository represents the End to End Machine Learning Project (Rain Fall Prediction in Australia).

heroku heroku-deployment machine-learning numpy pandas rain-fall rain-fall-prediction scikit-learn xgboost-algorithm

Last synced: 05 May 2026

https://github.com/heshanera/paclassifier

Online Passive Aggressive Binary Classifier

binary-classifier numpy passive-aggressive python

Last synced: 06 May 2026

https://github.com/scarblase/homeless-animals-analysis

A data-driven exploration of homeless animal statistics 🐶🐱. Analyze age distribution, shelter dynamics, and adoption patterns using Python, Pandas, and Seaborn.

animals data-analysis data-mining data-science data-science-projects data-visualization matplotlib matplotlib-pyplot numpy pandas plotly python python3 ukraine

Last synced: 06 May 2026

https://github.com/mmsaki/whale_portfolio

Determine which Whale portfolio is performing the best across multiple areas: volatility, returns, risk and Sharpe ratios.

datetime finance jupyter-notebook matplotlib numpy pandas pathlib python quantitative-finance seaborn stock-market trading-strategies

Last synced: 06 May 2026

https://github.com/khaymanii/house-price-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 06 May 2026

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/bredalis/numpy

✨ Library to work with arrays ✨

arrays data matrix numpy numpy-arrays numpy-library python

Last synced: 06 May 2026

https://github.com/rudrakshi99/movie-recommendation-system

Movie recommendation system using machine learning and predict user ratings for the movies.

dataframes machine-learning matplotlib numpy pandas seaborn

Last synced: 07 May 2026

https://github.com/nemeslaszlo/emergency-911-calls

Covers emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA analysis with visualizations.

jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 06 May 2026

https://github.com/abhishek010397/logisticregression

Using Logistic Regression Classifier to Predict Target Using Three Features

csv iloc logistic-regression machine-learning numpy pandas-dataframe python3 sigmoid-function sklearn

Last synced: 06 May 2026

https://github.com/thevarunsharma/similar-word-finder

A web application with Python backend which finds words similar to a given word, using pre-trained GloVe word embeddings.

flask-application machine-learning nlp numpy python word-embeddings

Last synced: 07 May 2026

https://github.com/affromero/pixelcache

A versatile image processing library for Python with built-in support for caching, using Pillow, NumPy, and PyTorch.

image-processing numpy opencv pillow pytorch torchvision

Last synced: 07 May 2026

https://github.com/gehad-ahmed30/data-analysis

This repository offers a guide to data visualization with Pandas and includes practical examples and two complete projects. Learn to preprocess, analyze, and visualize datasets to solve real-world problems effectively.

analysis matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/asut00/machine-learning-program_42ai

Comprehensive Machine Learning path by 42AI: hands-on modules on regression, gradient descent, and real-world ML applications.

linear-regression machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 07 May 2026

https://github.com/sumonta056/bangladeshi-vehicle-number-plate-detection

Image Processing Pipeline: Enhance, rotate, extract features, and segment characters in images for text recognition and enhancement.

candy-edge connected-components edge-detection grayscale-images morphological-analysis numpy opencv python radon-transform scikit-image sobel-edge-detector

Last synced: 07 May 2026

https://github.com/nitsc/spell-from-threebodytrilogy

Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.

3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization

Last synced: 02 May 2026