An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 09 Apr 2026

https://github.com/blinksta1ker/co2-prediction

EDA and Supervised ML model using Regression to predict Co2 Emissions from Vehicles

jupyter-notebook machine-learning numpy pandas python regression seaborn sklearn supervised-learning

Last synced: 09 Apr 2026

https://github.com/dpgitaccount/project-customer-segmentation-analysis-with-power-bi

This project segments customers based on their purchasing behavior to identify different target groups. It demonstrates skills in data analysis, clustering, and visualization using Power BI.

kmeans-clustering-algorithm linear-regression numpy pandas powerbi python seaborn segmentation

Last synced: 09 Apr 2026

https://github.com/hansalemaos/npsearchsequence

Sequence / string search with NumPy

fast numpy search sequence string

Last synced: 19 May 2026

https://github.com/klima7/numpynet

Convolutional Neural Network written from scratch using numpy with API similar to tensorflow.

cnn convolution network nn numpy scratch tensorflow

Last synced: 24 Sep 2025

https://github.com/alejoduarte23/fast_fdd

Fast implementation of frequency domain decomposition (FDD) in python with multiple identification techniques

numpy scipy-signal

Last synced: 10 May 2026

https://github.com/aravinda-1402/covid-detection-model-using-chest-x-ray

The objective of this project is to develop a Deep Learning Model to identify the X-Rays of healthy vs. Pneumonia (Corona) afflicted patients using the Chest X-Ray dataset, and use this model to power the AI application to test the Corona Virus in a faster phase.

classification cnn covid flask keras numpy pandas tensorflow

Last synced: 09 Apr 2026

https://github.com/ashar97/advanced-machine-learning

Advanced Machine Learning MOOC Specialisation Assignments

keras numpy python3 tensorflow

Last synced: 09 Apr 2026

https://github.com/0xunkn0wn4m1r/data_engineering_banking_project

🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.

automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning

Last synced: 09 Apr 2026

https://github.com/jjwizardmp/gallery-art-problem

Resolve the Gallery Art Problem implement BFS and Delaunay Triangulation

jupyter jupyter-notebook jupyterlab mathplotlib numpy pandas python python-env python3 scipy shapely

Last synced: 09 Apr 2026

https://github.com/paul-bokelman/nn

Basic neural network in python

machine-learning neural-networks numpy python

Last synced: 09 May 2026

https://github.com/rezapace/machine-learning-sound-detection

Welcome to the Machine Learning Detection Sound project! This project harnesses the power of machine learning to analyze car sounds, enabling the detection of vehicles based on their audio signatures.

colab-notebook googlecolab jupyter-notebook machine-learning numpy pathlib python python3 sound source tensorflow voice voice-recognition

Last synced: 09 Apr 2026

https://github.com/niteshchawla/delhivery-featureengineering

The Data team builds intelligence and capabilities using this data that helps them to widen the gap between the quality, efficiency, and profitability of their business versus their competitors.

feature-engineering hypothesis-testing matplotlib-pyplot numpy pandas-python scipy-stats seaborn

Last synced: 14 May 2026

https://github.com/sunilvarma-l/zomato-data-analysis-using-python

its a Python-based project to analyze and visualize Zomato restaurant data. This project explores various aspects of restaurant data, such as customer reviews, restaurant ratings, locations, and cuisines, with the aim of uncovering insights and trends in the food industry.

googlecollab matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/mrandolph95/grad-projects

Projects completed during my ASU grad program for Social Data Science.

beautifulsoup nltk numpy pandas regression-analysis sckiit-learn seaborn selenium

Last synced: 09 Apr 2026

https://github.com/kiok46/subset-sum-problem

Solving the Subset Sum Problem using Python, Pandas and Numpy.

numpy pandas python subset-sum

Last synced: 05 May 2026

https://github.com/oaerobert/python-for-finance

Implementations of financial mathematics and statistics using the "Python for Finance" textbook. Includes Monte Carlo simulations, predictive stock prices using ML strategies and real world data visualisations.

matplotlib numpy pandas python

Last synced: 09 Apr 2026

https://github.com/waghraj1699/codeclauseinternship_customer_segmentation_tool

Implementing customer segmentation using ML, covering exploring advantages, preprocessing, K-means clustering, and visualization.

data-preprocessing kmeans-clustering machine-learning numpy pandas visualization

Last synced: 08 May 2026

https://github.com/iv4n-ga6l/simple-image-segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 13 May 2026

https://github.com/bagusperdanay7/absa-with-bilstm-undergraduate-thesis

My undergraduate thesis program, Aspect-Based Sentiment Analysis Towards Matket Place Application Review Using Bidirectional Long Short-Term Memory used Python, Keras and Tensorflow

ai aspect-based-sentiment-analysis bilstm deep-learning gensim imbalanced-learning ipython-notebook keras machine-learning matplotlib natural-language-processing nltk numpy pandas python scikit-learn seaborn tensorflow

Last synced: 11 Apr 2026

https://github.com/niteshchawla/aerofit-casestudy

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 09 Apr 2026

https://github.com/mrmalik2512/catsvsdog.github.io

A CNN model integrated with flask backend the project is trained on image data of dogs and cats and integrated with a website predicts the given image is dog or a cat

deep-learning numpy python scikit-learn tensorflow

Last synced: 09 Apr 2026

https://github.com/lefteris-souflas/modern-slavery-analysis

Jupyter notebook using machine learning techniques to explore the complex drivers of modern slavery. Models from a research paper are replicated and evaluated . Actions also include filling missing data, training regression models, and analyzing feature importance.

decision-tree feature-importance grid-search-cv imputation jupyter-notebook lasso-regression linear-regression matplotlib mean-absolute-error numpy pandas preprocessing principal-component-analysis python3 random-forest ridge-regression scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/jahanostg/healthguard_with-smart-ai-diagnosis-hub

This project is a Flask-based platform for patients, donors, and doctors, integrating hospital management in one solution. It uses Machine Learning models for predictive health analysis and Power BI for interactive dashboards. Through this project, I enhanced my skills in full-stack development, applied ML, and healthcare data solutions.

css flask-api google-colaboratory-notebooks html5 js machine-learning mysql numpy pandas pkl-model powerbi python-3

Last synced: 09 Apr 2026

https://github.com/clchinkc/zombie

Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker

algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation

Last synced: 05 Apr 2026

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 05 May 2026

https://github.com/ondiekelijah/data-analysis-udacity

Udacity Data Analysis Nano Degree course resources

jupyter-notebook matplotlib numpy pandas python

Last synced: 17 Mar 2025

https://github.com/adijo/multilayeredperceptron

An implementation of a multi-layered perceptron.

deep-learning machine-learning numpy

Last synced: 01 May 2026

https://github.com/jpgiant/training_project

Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.

bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python

Last synced: 30 Apr 2026

https://github.com/jalijuhola/amazon-textual-reviews-recommender-

predicting score and recommending using amazon textual reviews

numpy pandas python scikit-learn typescript

Last synced: 09 Apr 2026

https://github.com/ananupam/eda-stock_market

An Exploratory Data Analysis project done on Financial data read from a search engine (yahoo) to predict the trends of stocks of 4 Tech companies.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/josafatburmeister/circle_detection

A Python package for circle fitting.

circle numpy objectdetection pointcloud python

Last synced: 09 Apr 2026

https://github.com/vipinchaudhary31122002/valuevanguard

ValueVanguard is a machine learning project for accurate house price predictions. Using advanced algorithms and real-world data, it empowers users to analyze and forecast property values efficiently. Perfect for developers, data enthusiasts, and real estate professionals. πŸ πŸ“ˆ

machine-learning matplotlib numpy pandas python sklearn streamlit

Last synced: 12 Apr 2026

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026

https://github.com/gitluffywd/advanced_video_stabilization_and_roi_pixel_analysis

This repository offers tools for advanced video stabilization and pixel analysis, enhancing your video processing capabilities. Explore features like optical flow tracking and ROI selection to dive deeper into your video data. πŸ› οΈπŸŽ₯

data-structures debugging english-language git github html markdown numpy opencv project-management python

Last synced: 20 Jan 2026

https://github.com/iwitaly/aws-lambda-docker-builder

Docker image for building packages for AWS Lambda with Python environment

aws aws-lambda docker docker-image numpy python python3

Last synced: 09 Apr 2026

https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project

In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.

data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql

Last synced: 09 Apr 2026

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 09 Apr 2026

https://github.com/ravi0529/e-commerce-annual-spend-model

A basic Linear Regression model for predicting annual customer's spending

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/abhisek-13/multi-commodity-price-predictor

The Multi-Commodity Price Predictor is a machine learning project that predicts the prices of cars, bikes, and laptops. Using advanced techniques, it provides accurate price predictions based on user inputs. The project features an easy-to-use interface built with Streamlit, enabling users to get price predictions quickly and easily.

machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn sklearn-library streamlit

Last synced: 09 Apr 2026

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/erikbrinkman/homeotopy

A python library for computing homeomorphisms between some common continuous spaces

homeomorphism numpy

Last synced: 10 Apr 2025

https://github.com/djpetti/pyunits

Unit-aware Python with dimensional analysis support.

dimensional-analysis numpy python36 python37 python38 scientific-computing unit-conversions

Last synced: 17 Jan 2026

https://github.com/kahleryasla/signal-convolution

implementation of signal convolution in Python using both a custom function and NumPy's built-in convolve function. Also includes a function for recording and playing audio using the sounddevice library.

audio-recorder convolution matplotlib numpy python signal-processing sounddevice

Last synced: 02 Mar 2025

https://github.com/L-Gre/TradeFX-AI

A Python-based project exploring algorithmic trading strategies, including backtesting, real-time data integration, and predictive modeling with TensorFlow and Keras. Key topics include technical indicators, risk management, and leveraging AWS and broker APIs for automated trading.

algorithmic-trading aws backtesting broker-api keras machine-learning matplotlib numpy pandas portfolio-optimisation python real-time-data-integration risk-management technical-indicators tensorflow

Last synced: 26 Sep 2025

https://gitlab.com/Makman2/scipy-steadystate

SciPy extension modules for general purpose steady-state solvers and optimizations for differential state-equations!

numpy optimization scipy steady-state

Last synced: 07 Oct 2025

https://github.com/thchilly/mlds102_py_exercises

Complete exercise sets from MLDS Practical Data Science and Applications course

data-science matplotlib numpy pandas python scikit-learn scipy tensorflow

Last synced: 06 Apr 2026

https://github.com/kumartusha/Data_Scientist-AI-ML-_Training_Anudip_Foundation

Anudip Foundation Data Analytics Training πŸ“Š A repository containing projects, assignments, and coursework from the Anudip Foundation Data Analytics training program. This includes work on: 🧹 Data Cleaning & Preprocessing πŸ“‰ Data Visualization & EDA πŸ“Š Statistical Analysis

excel matplotlib numpy pandas powerbi python scipy sql

Last synced: 27 Sep 2025

https://github.com/srikarveluvali/dataanalysis

The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.

css data-analysis html javascript matplotlib numpy pandas python seaborn tableau

Last synced: 09 Apr 2026

https://github.com/singhvishal003/bert-sentiment

Sentiment Analysis Model To Detect the Sentiments.

beautifulsoup numpy pandas requests torch torchvision

Last synced: 09 Apr 2026

https://github.com/wasifsohail5/virtual_mouse

Virtual Mouse is an AI-powered, gesture-based control system that replaces traditional peripherals with hand movements. Using MediaPipe for hand tracking and PyAutoGUI for cursor control, it enables seamless clicking, scrolling, and navigation through simple gestures. Experience a futuristic, touchless interaction with your PC! πŸš€πŸ–±οΈ

cv2 mediapipe mediapipe-hands numpy opencv pyautogui python

Last synced: 02 Aug 2025

https://github.com/dimits-ts/computational_statistics

Fundemental ML algorithm mathematics, algebraic and Python implementations, data-based problem solving

applied-mathematics machine-learning numpy regression statistics

Last synced: 14 May 2026

https://github.com/saif-gitreps/face-mask-detection

A CNN classification model scoring 94% accuracy built from scratch using a dataset that detects if an image of a person has Mask on.

cnn-classification image-classification image-processing kaggle keras numpy pandas

Last synced: 09 Apr 2026

https://github.com/magnitopic/ft-linear-regression

Your first implementation of a machine learning algorithm. Predicting the price of a car by it's milage

42school ai ai-algorithm ft-linear-regression linear-regression machine-learning matplotlib numpy pandas prediction-algorithm python

Last synced: 16 Apr 2026

https://github.com/azaz9026/eda

Exploratory Data Analysis (EDA) refers to the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers, and identify relationships between variables. EDA is normally carried out as a preliminary step before undertaking extra formal statistical analyses or modeling.

data-cleaning data-visualization encoding machine-learning matplotlib numpy pandas plotly python3 seaborn sklearn-library

Last synced: 15 Apr 2026

https://github.com/tigureis/data-preparation-from-kickstarter-campaigns

Kickstarter Data Prep: A hands-on guide to basic data cleaning and transformation.

data-cleaning data-construction data-integration data-science data-selection numpy pandas

Last synced: 19 Apr 2026

https://github.com/lc-rezende/eqx_boston_dataset

Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.

data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/house-price-prediction-

We all have experienced a time when we have to look up for a new house to buy. But then the journey begins with a lot of frauds, negotiating deals, researching the local areas and so on. So to deal with this kind of issues Today, I prepared a MACHINE LEARNING Based model, trained on the House Price Prediction Dataset.

catboost-classifier linear-regression machine-learning matplotlib numpy pandas python random-forest seaborn svm

Last synced: 09 Apr 2026

https://github.com/docsallover/lbw-detection-in-cricket

LBW Detection in Cricket: A Deep Dive with OpenCV & NumPy

ball-detection cricket cvzone data-science lbw machine-learning numpy opencv python python3

Last synced: 08 May 2026

https://github.com/prarthana-singh/heart-attack-prediction-model

A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.

classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn

Last synced: 25 Jun 2025

https://github.com/hansalemaos/colorcountcython

This module provides functionality to count unique colors in an image represented as a NumPy array and, optionally, group their coordinates. It includes a Cython-based implementation for improved performance.

colors colour colours count cv2 numpy

Last synced: 01 May 2026

https://github.com/neofelis2x/gl3wxpyg

A playground and testing repo to implement OpenGL 3.3 in a WxPython interface using pyglet as an OpenGL-wrapper.

3d-graphics numpy opengl pyglet wxpython

Last synced: 11 May 2026

https://github.com/akshaypatra/cardiovascular_disease_detection

AI-driven ECG classification model that detects cardiovascular abnormalities such as arrhythmia and atrial fibrillation using a hybrid CNN-LSTM deep learning approach.

keras matplotlib numpy pandas python3 scikit-learn seaborn tensorflow wfdb

Last synced: 14 Apr 2026

https://github.com/mayankmittal29/duplifinder-quora-clone-catcher

An advanced system for detecting semantically duplicate question pairs using cutting-edge NLP techniques. Combines traditional ML models (XGBoost, SVM, Random Forest) with deep learning architectures (BiLSTM, Siamese Networks, Transformers) and contextual embeddings (BERT, RoBERTa). Features engineered using token similarity, fuzzy matching, and em

bert bilstm cross-validation eda fastext fuzzy-matching glove numpy pandas python3 quora-question-pairs random-forest roberta seaborn stemming svm tf-idf transformers word2vec xgboost

Last synced: 15 Apr 2026

https://github.com/chandkund/titanic-survival-prediction

Welcome to the Titanic Survival Prediction project! 🌊 This project predicts passenger survival on the Titanic using machine learning models. By analyzing features like class, gender, and age, we aim to uncover the factors that influenced survival rates during this historic event. Dive in to explore the insights and models developed!

data-science machine-learning matplotlib numpy python seaborn sklearn visualization

Last synced: 07 Apr 2026

https://github.com/kanika300393/loan_prediction

This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.

data-science machine-learning numpy pandas python scikit-learn svm-classifier

Last synced: 09 Apr 2026

https://github.com/sibendud/csi_2024_dataengineering

Internship on Data Engineering where below topics are applied skills that are used to complete the given tasks through out 8 weeks including the project.

adf adls azure-pipelines databricks docker ipynb json numpy pandas-python parquet-avro pipeline pyspark python sql sql-server

Last synced: 07 Mar 2026

https://github.com/hoyirul/flask_test

This is a Python programming using flask and has been set up with the MVC concept and can connect to the database

flask mysql numpy python3 restful-api

Last synced: 07 Apr 2026

https://github.com/h12m54am/numpy-matplotlib

Best for people who are using it for mathematical and statistical work. Not really a tutorial, but rather a bunch of examples that can help you understand Numpy and Matplotlib

jupyter-notebook learning-by-doing numpy pandas python

Last synced: 18 Apr 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 07 Apr 2026

https://github.com/abolfazl-younesi/levy-walk

This repository contains a Python implementation of the LΓ©vy Walk simulation, a random walk model with a power-law step length distribution. The code simulates multiple walkers with boundary conditions and visualizes their trajectories for different values of the power-law exponent.

levy-walks matplotlib numpy python random-walk

Last synced: 02 May 2026

https://github.com/dipakxettri/numpy

This Repo is dedicated on learning NumPy a first step of learning DataScience/Machine Learning!

numpy python

Last synced: 18 May 2026

https://github.com/akshatkmistry/parkinsons_disease_predictor-voice_measures

This project implements a machine learning system to detect Parkinson's disease using voice measurements. The application uses a Random Forest classifier trained on voice feature data to predict the likelihood of Parkinson's disease with high accuracy (94%).

machine-learning matplotlib numpy pandas random-forest-classifier seaborn sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/manikantasanjay/emotion_prediction_kaggle_cnn

Real time Facial Emotion Recognition System with CNN Model Architecture making use of Transfer Learning, Data Augmentation, and Regularization Techniques.

cnn convolutional-neural-networks dataset deep-learning jupyter-notebook matplotlib numpy pandas python3 pytorch transfer-learning

Last synced: 11 Apr 2026

https://github.com/satheesh-meadi/real_time_financial_risk_dashboard

Financial Risk Analysis Dashboard πŸš€. An interactive Streamlit dashboard designed for analyzing and visualizing portfolio performance. Features include CAPM analysis, portfolio optimization, efficient frontier visualization, and real-time stock data to help optimize investments.

numpy pandas plotly plotly-express python3 scikit-learn streamlit yfinance

Last synced: 05 Apr 2026

https://github.com/alex-ber/alpine-python2

Alpine Linux based based image with Python 2.7 and necessary OS level dependencies (OpenSSL, gcc, ODBC driver, etc).

alpine alpine-linux dockerfile fortran gcc gcc-compiler numpy python python-2 python27

Last synced: 03 Oct 2025

https://github.com/youngermaster/coulomb-superposition-principle

Principio de SuperposiciΓ³n de Coulomb y Ley de Biot-Savart

coulomb-law matplotlib numpy simulation

Last synced: 18 May 2026

https://github.com/asherk7/neural-network-from-scratch

Creating a Convolutional Neural Network from scratch, modeled after ResNet

classification cnn ml neural-network numpy python resnet

Last synced: 18 May 2026