An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/adijo/multilayeredperceptron

An implementation of a multi-layered perceptron.

deep-learning machine-learning numpy

Last synced: 01 May 2026

https://github.com/jpgiant/training_project

Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.

bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python

Last synced: 30 Apr 2026

https://github.com/jalijuhola/amazon-textual-reviews-recommender-

predicting score and recommending using amazon textual reviews

numpy pandas python scikit-learn typescript

Last synced: 09 Apr 2026

https://github.com/ananupam/eda-stock_market

An Exploratory Data Analysis project done on Financial data read from a search engine (yahoo) to predict the trends of stocks of 4 Tech companies.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/josafatburmeister/circle_detection

A Python package for circle fitting.

circle numpy objectdetection pointcloud python

Last synced: 09 Apr 2026

https://github.com/vipinchaudhary31122002/valuevanguard

ValueVanguard is a machine learning project for accurate house price predictions. Using advanced algorithms and real-world data, it empowers users to analyze and forecast property values efficiently. Perfect for developers, data enthusiasts, and real estate professionals. 🏠📈

machine-learning matplotlib numpy pandas python sklearn streamlit

Last synced: 12 Apr 2026

https://github.com/estaheri7/dotawinpredictor

This project aims to predict the outcome of Dota 2 matches based on the heroes picked by both the Radiant and Dire teams

data-science dota2 machine-learning numpy pandas python random-forest sklearn support-vector-machines

Last synced: 09 Mar 2026

https://github.com/gitluffywd/advanced_video_stabilization_and_roi_pixel_analysis

This repository offers tools for advanced video stabilization and pixel analysis, enhancing your video processing capabilities. Explore features like optical flow tracking and ROI selection to dive deeper into your video data. 🛠️🎥

data-structures debugging english-language git github html markdown numpy opencv project-management python

Last synced: 20 Jan 2026

https://github.com/iwitaly/aws-lambda-docker-builder

Docker image for building packages for AWS Lambda with Python environment

aws aws-lambda docker docker-image numpy python python3

Last synced: 09 Apr 2026

https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project

In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.

data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql

Last synced: 09 Apr 2026

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 09 Apr 2026

https://github.com/ravi0529/e-commerce-annual-spend-model

A basic Linear Regression model for predicting annual customer's spending

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/abhisek-13/multi-commodity-price-predictor

The Multi-Commodity Price Predictor is a machine learning project that predicts the prices of cars, bikes, and laptops. Using advanced techniques, it provides accurate price predictions based on user inputs. The project features an easy-to-use interface built with Streamlit, enabling users to get price predictions quickly and easily.

machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn sklearn-library streamlit

Last synced: 09 Apr 2026

https://github.com/manishkumarpatel07/heartattack_risk_prediction

"Heart Attack Risk Prediction" uses machine learning to estimate the likelihood of a heart attack based on user-provided data like physical attributes, symptoms, and medical history. This system enables remote screening, identifying high-risk individuals, and easing medical system burdens by providing early, data-driven health risk assessments.

boruta knn-algorithm matplotlib numpy pandas python scikit-learn

Last synced: 09 Apr 2026

https://github.com/erikbrinkman/homeotopy

A python library for computing homeomorphisms between some common continuous spaces

homeomorphism numpy

Last synced: 10 Apr 2025

https://github.com/djpetti/pyunits

Unit-aware Python with dimensional analysis support.

dimensional-analysis numpy python36 python37 python38 scientific-computing unit-conversions

Last synced: 17 Jan 2026

https://github.com/kahleryasla/signal-convolution

implementation of signal convolution in Python using both a custom function and NumPy's built-in convolve function. Also includes a function for recording and playing audio using the sounddevice library.

audio-recorder convolution matplotlib numpy python signal-processing sounddevice

Last synced: 02 Mar 2025

https://github.com/L-Gre/TradeFX-AI

A Python-based project exploring algorithmic trading strategies, including backtesting, real-time data integration, and predictive modeling with TensorFlow and Keras. Key topics include technical indicators, risk management, and leveraging AWS and broker APIs for automated trading.

algorithmic-trading aws backtesting broker-api keras machine-learning matplotlib numpy pandas portfolio-optimisation python real-time-data-integration risk-management technical-indicators tensorflow

Last synced: 26 Sep 2025

https://gitlab.com/Makman2/scipy-steadystate

SciPy extension modules for general purpose steady-state solvers and optimizations for differential state-equations!

numpy optimization scipy steady-state

Last synced: 07 Oct 2025

https://github.com/thchilly/mlds102_py_exercises

Complete exercise sets from MLDS Practical Data Science and Applications course

data-science matplotlib numpy pandas python scikit-learn scipy tensorflow

Last synced: 06 Apr 2026

https://github.com/kumartusha/Data_Scientist-AI-ML-_Training_Anudip_Foundation

Anudip Foundation Data Analytics Training 📊 A repository containing projects, assignments, and coursework from the Anudip Foundation Data Analytics training program. This includes work on: 🧹 Data Cleaning & Preprocessing 📉 Data Visualization & EDA 📊 Statistical Analysis

excel matplotlib numpy pandas powerbi python scipy sql

Last synced: 27 Sep 2025

https://github.com/srikarveluvali/dataanalysis

The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.

css data-analysis html javascript matplotlib numpy pandas python seaborn tableau

Last synced: 09 Apr 2026

https://github.com/singhvishal003/bert-sentiment

Sentiment Analysis Model To Detect the Sentiments.

beautifulsoup numpy pandas requests torch torchvision

Last synced: 09 Apr 2026

https://github.com/wasifsohail5/virtual_mouse

Virtual Mouse is an AI-powered, gesture-based control system that replaces traditional peripherals with hand movements. Using MediaPipe for hand tracking and PyAutoGUI for cursor control, it enables seamless clicking, scrolling, and navigation through simple gestures. Experience a futuristic, touchless interaction with your PC! 🚀🖱️

cv2 mediapipe mediapipe-hands numpy opencv pyautogui python

Last synced: 02 Aug 2025

https://github.com/dimits-ts/computational_statistics

Fundemental ML algorithm mathematics, algebraic and Python implementations, data-based problem solving

applied-mathematics machine-learning numpy regression statistics

Last synced: 14 May 2026

https://github.com/saif-gitreps/face-mask-detection

A CNN classification model scoring 94% accuracy built from scratch using a dataset that detects if an image of a person has Mask on.

cnn-classification image-classification image-processing kaggle keras numpy pandas

Last synced: 09 Apr 2026

https://github.com/magnitopic/ft-linear-regression

Your first implementation of a machine learning algorithm. Predicting the price of a car by it's milage

42school ai ai-algorithm ft-linear-regression linear-regression machine-learning matplotlib numpy pandas prediction-algorithm python

Last synced: 16 Apr 2026

https://github.com/azaz9026/eda

Exploratory Data Analysis (EDA) refers to the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers, and identify relationships between variables. EDA is normally carried out as a preliminary step before undertaking extra formal statistical analyses or modeling.

data-cleaning data-visualization encoding machine-learning matplotlib numpy pandas plotly python3 seaborn sklearn-library

Last synced: 15 Apr 2026

https://github.com/tigureis/data-preparation-from-kickstarter-campaigns

Kickstarter Data Prep: A hands-on guide to basic data cleaning and transformation.

data-cleaning data-construction data-integration data-science data-selection numpy pandas

Last synced: 19 Apr 2026

https://github.com/lc-rezende/eqx_boston_dataset

Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.

data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/harmanveer-2546/house-price-prediction-

We all have experienced a time when we have to look up for a new house to buy. But then the journey begins with a lot of frauds, negotiating deals, researching the local areas and so on. So to deal with this kind of issues Today, I prepared a MACHINE LEARNING Based model, trained on the House Price Prediction Dataset.

catboost-classifier linear-regression machine-learning matplotlib numpy pandas python random-forest seaborn svm

Last synced: 09 Apr 2026

https://github.com/docsallover/lbw-detection-in-cricket

LBW Detection in Cricket: A Deep Dive with OpenCV & NumPy

ball-detection cricket cvzone data-science lbw machine-learning numpy opencv python python3

Last synced: 08 May 2026

https://github.com/prarthana-singh/heart-attack-prediction-model

A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.

classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn

Last synced: 25 Jun 2025

https://github.com/hansalemaos/colorcountcython

This module provides functionality to count unique colors in an image represented as a NumPy array and, optionally, group their coordinates. It includes a Cython-based implementation for improved performance.

colors colour colours count cv2 numpy

Last synced: 01 May 2026

https://github.com/neofelis2x/gl3wxpyg

A playground and testing repo to implement OpenGL 3.3 in a WxPython interface using pyglet as an OpenGL-wrapper.

3d-graphics numpy opengl pyglet wxpython

Last synced: 11 May 2026

https://github.com/akshaypatra/cardiovascular_disease_detection

AI-driven ECG classification model that detects cardiovascular abnormalities such as arrhythmia and atrial fibrillation using a hybrid CNN-LSTM deep learning approach.

keras matplotlib numpy pandas python3 scikit-learn seaborn tensorflow wfdb

Last synced: 14 Apr 2026

https://github.com/mayankmittal29/duplifinder-quora-clone-catcher

An advanced system for detecting semantically duplicate question pairs using cutting-edge NLP techniques. Combines traditional ML models (XGBoost, SVM, Random Forest) with deep learning architectures (BiLSTM, Siamese Networks, Transformers) and contextual embeddings (BERT, RoBERTa). Features engineered using token similarity, fuzzy matching, and em

bert bilstm cross-validation eda fastext fuzzy-matching glove numpy pandas python3 quora-question-pairs random-forest roberta seaborn stemming svm tf-idf transformers word2vec xgboost

Last synced: 15 Apr 2026

https://github.com/chandkund/titanic-survival-prediction

Welcome to the Titanic Survival Prediction project! 🌊 This project predicts passenger survival on the Titanic using machine learning models. By analyzing features like class, gender, and age, we aim to uncover the factors that influenced survival rates during this historic event. Dive in to explore the insights and models developed!

data-science machine-learning matplotlib numpy python seaborn sklearn visualization

Last synced: 07 Apr 2026

https://github.com/kanika300393/loan_prediction

This project implements a Loan Prediction system using Support Vector Machine (SVM). It includes data preprocessing, visualization of features like income and education, and model evaluation. The goal is to predict loan approval based on the dataset. Clone the repo to explore the code and improve the model.

data-science machine-learning numpy pandas python scikit-learn svm-classifier

Last synced: 09 Apr 2026

https://github.com/sibendud/csi_2024_dataengineering

Internship on Data Engineering where below topics are applied skills that are used to complete the given tasks through out 8 weeks including the project.

adf adls azure-pipelines databricks docker ipynb json numpy pandas-python parquet-avro pipeline pyspark python sql sql-server

Last synced: 07 Mar 2026

https://github.com/hoyirul/flask_test

This is a Python programming using flask and has been set up with the MVC concept and can connect to the database

flask mysql numpy python3 restful-api

Last synced: 07 Apr 2026

https://github.com/h12m54am/numpy-matplotlib

Best for people who are using it for mathematical and statistical work. Not really a tutorial, but rather a bunch of examples that can help you understand Numpy and Matplotlib

jupyter-notebook learning-by-doing numpy pandas python

Last synced: 18 Apr 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026

https://github.com/simon2k/stock-price-prediction-evaluation

This project is indented to present a small evaluation of different types of regression models for predicting stock prices for AAPL.

evaluation machine-learning numpy pandas predicting-stock-prices scikit-learn

Last synced: 07 Apr 2026

https://github.com/abolfazl-younesi/levy-walk

This repository contains a Python implementation of the Lévy Walk simulation, a random walk model with a power-law step length distribution. The code simulates multiple walkers with boundary conditions and visualizes their trajectories for different values of the power-law exponent.

levy-walks matplotlib numpy python random-walk

Last synced: 02 May 2026

https://github.com/dipakxettri/numpy

This Repo is dedicated on learning NumPy a first step of learning DataScience/Machine Learning!

numpy python

Last synced: 18 May 2026

https://github.com/akshatkmistry/parkinsons_disease_predictor-voice_measures

This project implements a machine learning system to detect Parkinson's disease using voice measurements. The application uses a Random Forest classifier trained on voice feature data to predict the likelihood of Parkinson's disease with high accuracy (94%).

machine-learning matplotlib numpy pandas random-forest-classifier seaborn sklearn streamlit

Last synced: 09 Apr 2026

https://github.com/manikantasanjay/emotion_prediction_kaggle_cnn

Real time Facial Emotion Recognition System with CNN Model Architecture making use of Transfer Learning, Data Augmentation, and Regularization Techniques.

cnn convolutional-neural-networks dataset deep-learning jupyter-notebook matplotlib numpy pandas python3 pytorch transfer-learning

Last synced: 11 Apr 2026

https://github.com/satheesh-meadi/real_time_financial_risk_dashboard

Financial Risk Analysis Dashboard 🚀. An interactive Streamlit dashboard designed for analyzing and visualizing portfolio performance. Features include CAPM analysis, portfolio optimization, efficient frontier visualization, and real-time stock data to help optimize investments.

numpy pandas plotly plotly-express python3 scikit-learn streamlit yfinance

Last synced: 05 Apr 2026

https://github.com/alex-ber/alpine-python2

Alpine Linux based based image with Python 2.7 and necessary OS level dependencies (OpenSSL, gcc, ODBC driver, etc).

alpine alpine-linux dockerfile fortran gcc gcc-compiler numpy python python-2 python27

Last synced: 03 Oct 2025

https://github.com/youngermaster/coulomb-superposition-principle

Principio de Superposición de Coulomb y Ley de Biot-Savart

coulomb-law matplotlib numpy simulation

Last synced: 18 May 2026

https://github.com/asherk7/neural-network-from-scratch

Creating a Convolutional Neural Network from scratch, modeled after ResNet

classification cnn ml neural-network numpy python resnet

Last synced: 18 May 2026

https://github.com/dineshdhamodharan24/data_science_final_project

Customer Insights & Recommendation System: Harnessing Decision Tree, Logistic Regression, and Random Forest models for behavior analysis. Utilizing EasyOCR and Python Imaging Library for image information extraction. Employing NLTK for sentiment analysis on textual data

classification final-project guvi-projects image-processing nltk-python numpy ocr pandas recommendation-system sentiment-analysis sklearn-library streamlit text-processing worldcloud

Last synced: 02 May 2026

https://github.com/dmarks84/coursework_project_data-analysis-apache-spark

Project for IBM Data Engineering & Python course on ETL & Big Data -- Read in data, wrote to SQL database and performed queries, performed statistical analysis and issued reports

apache-sprk automation dag data-modeling eda elt etl numpy pandas pipelines python sql statistics visualization

Last synced: 11 Apr 2026

https://github.com/jcardonamde/food-sales-predictions

A project that seeks to predict the sales of a food distributor by applying machine learning algorithms and data analysis.

data-science machine-learning machine-learning-algorithms numpy pandas pipelines python seaborn sklearn

Last synced: 05 Apr 2026

https://github.com/audrbar/py_site

Website for displaying Data Science Projects build on Streamlit

matplotlib numpy pandas python3 requests streamlit

Last synced: 11 Apr 2026

https://github.com/shoaib1522/data-science-in-python

"Unlock the power of data science with my curated collection of Python projects—your gateway to insights, analysis, and hands-on learning! 🌟"

analytics confusion-matrix data-science kaggle-dataset matplotlib mlp-classifier notebook numpy pandas python regression streamlit

Last synced: 04 Apr 2026

https://github.com/ramyacp14/sentimentanalysis

Implements a sentiment analysis model to determine the emotional tone behind text, helping understand attitudes, opinions, and emotions in online mentions.

machine-learning natural-language-processing nltk numpy pandas python scikit-learn

Last synced: 07 Apr 2026

https://github.com/sk-g/mnist_beginners

Model search in traditional machine learning algorithms (non DL) and DL starter codes on MNIST dataset. This is a good starter code for beginners trying to learn about curse of dimensionality, overfitting and other concepts in general

keras machine-learning machine-learning-algorithms mnist mnist-beginners mnist-classification mnist-dataset numpy overfitting python pytorch pytorch-implmention resnet resnet-50 scikit-learn scikitlearn-machine-learning sklearn tensorflow

Last synced: 09 Apr 2026

https://github.com/jainish-prajapati/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 30 Dec 2025

https://github.com/myahninsi/credit_card_fraud_detection

This repository is for the Neural Networks and Deep Learning Course - Assignment 1, focusing on credit card fraud detection. The project utilizes a machine learning model to predict whether a transaction is fraudulent using a synthetic credit card dataset.

matplotlib numpy pandas pickle python scikit-learn seaborn streamlit

Last synced: 09 Apr 2026

https://github.com/maxbleu/f1analisys

A Python repository for analyzing and visualizing Formula 1 data. Includes tools for race statistics, performance comparisons, and custom visualizations.

data-science f1 f1-analisis f1api fastf1 formula1-analysis formula1-data-analysis matplotlib-python motorsport numpy pandas python seaborn

Last synced: 12 Sep 2025

https://github.com/aasmirnov-webdev/data_science_projects

Сборник всех выполненных учебных проектов курса Яндекс.Практикум "Специалист по Data Science".

bert catboost data-science database lgbm mashine-learning matplotlib numpy pandas python pytorch scikit-learn scipy seaborn sql xgboost

Last synced: 06 Apr 2026

https://github.com/hellenilda/ml-housing-price-prediction

Projeto simples para fazer previsão de preços de casas usando Regressão Linear.

artificial-intelligence machine-learning matplotlib numpy python sklearn streamlit

Last synced: 05 Apr 2026

https://github.com/andreazoccatelli/light_permanova

A lightweight implementation of PERMANOVA based on Euclidean distance from centroid

computervision numpy permanova pytorch statistics tabular-data

Last synced: 08 May 2026

https://github.com/philcs/electromag

Électromagnétisme, Université Laval, 2017

matplotlib numpy python

Last synced: 18 May 2026

https://github.com/bonifatius94/backprop

Deep Learning Reference Implementation

deep-learning numpy reference-implementation

Last synced: 18 Apr 2026

https://github.com/coder5omkar/LendingClubCaseStudy

Performed exploratory data analysis (EDA) on the loan dataset from the Lending Club Case Study to identify the key factors influencing loan defaults.

bivariate-analysis data-science exploratory-data-analysis matplotlib numpy pandas seaborn univariate-analysis

Last synced: 21 Jul 2025

https://github.com/AsuquoAA/Energy-_GDP-_and_Scimago_Rankings_Analysis-2006-2015

Analysis of top 15 countries in energy, renewables, and GDP using three global datasets.

data-cleaning numpy pandas

Last synced: 21 Jul 2025

https://github.com/AsuquoAA/Big_4_Sports_Teams_and_City_Population_Analysis-2018-

Analysis of sports teams' win/loss ratios vs. metro area populations across NFL, NBA, MLB, and NHL.

data-cleaning-and-preprocessing numpy pandas

Last synced: 21 Jul 2025

https://github.com/ireneflorez/sf-kindergarten-lottery

Data analysis reviewing data and findings on the public school kindergarten lottery in San Francisco

ipynb-jupyter-notebook jupyter-notebook numpy pandas python sql

Last synced: 17 Feb 2026

https://github.com/ashishsingh789/bcg_virtual_internship

This repository showcases my BCG X virtual internship project on customer churn analysis for PowerCo, covering business understanding, EDA, feature engineering, and modeling using Python and machine learning.

data-manipulation data-science dataanalysis datavisualization eda machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/exclowd/gg

A library for creating terminal based games in python.

numpy python

Last synced: 18 May 2026

https://github.com/scikit-plots/lightnumpy

A lightweight version of NumPy or Cupy (or similar functionality).

cupy lightnumpy numcpp numpy python

Last synced: 17 Feb 2026

https://github.com/paulomppatricio/projeto_challenge_telecomx-br_parte-2

Projeto Challenge TelecomX-BR_Parte-2 - Formação Data Science do programa ONE - Oracle Next Education em parceria com a Alura.

data-science imblearn joblib machine-learning matplotlib modelos-preditivos numpy pandas python scipy seaborn sklearn statsmodels xgboost

Last synced: 12 Apr 2026

https://github.com/muzahid0408/student_marks_analysis_using_numpy

This project analyzes a student marks dataset to calculate key statistical measures such as mean, median, standard deviation, and more.

datasets jupyter-notebook numpy numpy-arrays pythoh

Last synced: 04 May 2026