An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/jbalooshie/movies-etl

Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.

data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/giihcastro/trabalho-fisica

3D VPython Electromagnetic Particle Motion Simulator

numpy python vpython

Last synced: 15 Apr 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/jimmymugendi/bulding-a-decision-tree-to-predict-customer-churn

This repo desribes bulding a decision tree to predict customer churn in a given organisation

accuracy-score decision-tree-classifier matplotlib-pyplot numpy pandas-dataframe scikit-learn

Last synced: 07 May 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/arnie1x/massive-dataset-lab

This project involves basic data manipulation with JSON files, focusing on tasks related to data processing of the MASSive dataset and file management.

data-science numpy pandas pandas-dataframe python3

Last synced: 07 May 2026

https://github.com/khinthandarkyaw98/python_for_engineers

This particular Python notebook is designed to provide Engineers with an opportunity to practice scientific computations.

engineering numpy python scientific-computing youtube

Last synced: 16 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/mustafaozvardar/image-classification-webapp

This project features an AI model for diagnosing pneumonia from chest X-ray images. Users can upload a chest X-ray image through the app, and the model will classify it.

artificial-intelligence computer-vision keras numpy python streamlit streamlit-webapp tensorflow

Last synced: 27 Feb 2026

https://github.com/karthikarajagopal44/numpy-beginner-to-advanced

This repository is designed to help you master NumPy, the fundamental package for scientific computing with Python. Whether you are just getting started or looking to refine your skills with advanced techniques, this repo covers a comprehensive range of topics and functionalities. It covers the range of topics from beginner to advance.

data-science numpy numpy-library numpy-tutorial python

Last synced: 16 Apr 2026

https://github.com/ywatanabe1989/scitex-io

Universal scientific data I/O with plugin registry — save/load 30+ formats with one API. Part of SciTeX.

cli csv data-io hdf5 mcp numpy openscience pandas plugin-registry python research scientific-computing scitex

Last synced: 07 Jun 2026

https://github.com/bharatsharma07/ipl-analysis

This project provides insights into the IPL's historical data, helping to identify trends, player performance, and team strategies.

data-visualization ipl-data-analysis numpy pandas python

Last synced: 16 Apr 2026

https://github.com/aciluca/credit-approval-system

A machine learning application to predict credit card approval using Python, TensorFlow, and Docker.

data-science keras machine-learning numpy pandas python sklearn tensorflow

Last synced: 11 Apr 2026

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 16 Apr 2026

https://github.com/arserchung/ml_cnn

Basis CNN intro& project of recognize to cat or dog

glob keras keras-models numpy opencv os sklearn tensorflow

Last synced: 13 Feb 2026

https://github.com/anibalalpizar/python-machine-learning-example

This code reads and preprocesses a dataset for classification using pandas, numpy, matplotlib and scikit-learn. The dataset is split into three parts for training, validation and testing. The data is then scaled and optionally oversampled for balanced classes.

machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/rakesh3596/tutedude_projects

It consist of 3 major project 1:Covid19 Analysis 2:Rainfall Prediction 3:Tumor Detection

csv-files matplotlib numpy pandas plotly python sklearn-library

Last synced: 16 Apr 2026

https://github.com/4strium/predict-french-tv-audience

🎬 A beginner's machine learning project whose aim is to predict the audience for films on TV in France, based on previous data provided by the CNC and Médiamétrie.

machine-learning numpy pandas python tv xgboost

Last synced: 07 May 2026

https://github.com/andreashovaldt/simple-neural-network

A simple neural network, created with numpy, based on Stochastic Gradient Descent, used for classifying the MNIST dataset

mnist-classification neural-network numpy stochastic-gradient-descent

Last synced: 16 Apr 2026

https://github.com/navid-kalaei/numpy-neural-network

Neural network implemented in Numpy to predict bike sharing on real data

deep-learning neural-network numpy python

Last synced: 07 May 2026

https://github.com/reddyprasade/bird-classifications-problem

Birds Classification by using Scikit-learn and Scikit-image

matplotlib numpy pandas python-3 sckiit-learn sckit-image

Last synced: 08 May 2026

https://github.com/awaisfraaz/python-libraries

Here’s a repository description for a project where youA repository dedicated to exploring and experimenting with Python’s main libraries, featuring sample code, documentation, and practical examples for learning and reference.

matplotlib numpy pandas python seaborn

Last synced: 15 Feb 2026

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/jessicahora/studies-on-linear-algebra

Repositorio com Estudos sobre Algebra Linear.

linalg linear-algebra matplotlib-pyplot matrix numpy python scipy

Last synced: 01 Mar 2026

https://github.com/matheusafonseca/c111

Este repositório é dedicado ao armazenamento e organização dos códigos desenvolvidos na disciplina C111 - Análise de Dados, oferecida pelo Instituto Nacional de Telecomunicações (INATEL).

data-analysis matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/anujdutt9/reinforcement_learning

Reinforcement Learning using Numpy and PyTorch.

numpy python3 pytorch reinforcement-learning

Last synced: 16 Apr 2026

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/ejw-data/tableau-songs

Brief analysis of songs from 1890 - 2015 using pandas and Tableau

numpy pandas python songs tableau

Last synced: 16 Apr 2026

https://github.com/parbhat-cpp/suicidal-ml

A machine learning/NLP-based system to identify signs of suicidal ideation from user text inputs.

bash cicd classification docker fastapi githubactions jinja2 jupyter-notebook machine-learning natural-language-processing nlp numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/abhishekpsonawane07/loan-repayment-prediction-ml

Loan Approval Prediction uses machine learning to analyze factors like income and credit history, automating and improving the loan approval process. The project ensures accurate, efficient, and user-friendly predictions with a Gradio interface.

artificial-neural-networks confusion-matrix decision-trees gradio-interface k-nearest-neighbours loan-dataset loan-prediction-analysis loan-repayment logistic-regression numpy pandas python random-forest sckiit-learn seaborn support-vector-machines

Last synced: 16 Apr 2026

https://github.com/mugambi645/exploring-ebay-car-sales-data

Exploring ebay car sales dataset

car-sales data-analysis numpy pandas

Last synced: 16 Apr 2026

https://github.com/mashalk1/computer-vision-attendance-system

A computer vision based attendance system, using cv2, firebase, numpy, cvzone and face_recognition library in python

computer-vision cv2 cvzone face-recognition numpy

Last synced: 04 Feb 2026

https://github.com/michaelgermini/python-3d-programming-book

Complete Python & 3D Programming Book: 30 chapters, 120+ examples, real-time ray tracing, ML graphics, OpenGL, NumPy

3d-graphics computer-graphics educational game-development graphics-programming machine-learning numpy opengl python ray-tracing tutorial

Last synced: 16 Apr 2026

https://github.com/laurencewarne/eratosthenes

Random implementations of the sieve of Eratosthenes

algorithm eratosthenes numpy python3

Last synced: 16 Apr 2026

https://github.com/littie28/hazy-frames

Hierarchical coordinate frames - crystal clear transforms

3d coordinate-systems geometry numpy python reference-frames scipy transformations

Last synced: 18 Apr 2026

https://github.com/octavioduarte/datascience

A repository with study notes on Data Science concepts, NumPy, pandas, and an introduction to statistics.

data-science jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/pavanreddy565/infosys_python_for_datascience

Data Science Course Tasks Repository This repository contains tasks and projects completed during the Data Science course from Infosys Springboard. It includes setup instructions for running Jupyter Notebook sessions in a virtual environment.

matplotlib numpy pandas

Last synced: 29 Apr 2026

https://github.com/sundanc/besthousefinder

A Streamlit web application that helps users find the best house based on their preferences.

numpy pandas prediction python streamlit

Last synced: 17 Apr 2026

https://github.com/rohansoni45/ipl-win-predictor

A machine learning model that predicts the outcome of IPL matches based on historical data and player performance. Built using Python and libraries like Pandas, NumPy, and scikit-learn. This project demonstrates the use of classification algorithms to predict match outcomes.

classification data-science ipl machine-learning model-evaluation numpy pandas python sklearn sports-analytics

Last synced: 08 May 2026

https://github.com/hansalemaos/npchessimage

Creates a chessboard pattern as an image.

chessboard cv2 image numpy

Last synced: 05 Mar 2026

https://github.com/miozilla/treebasedmodel

treebasedmodel :evergreen_tree::airplane::dancer: : Invistico Airline | Tree-based Modeling, Machine Learning (ML), Predictive, Decision Making, GridSearchCV, sklearn |

accuracy decisiontreeclassifier exemplar experience f1-score flight hyperparameter logistic-regression matplotlib model numpy pandas plot-tree precision pyplot recall seaborn selection train-test-split tuning

Last synced: 03 Jun 2026

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/shashwat9kumar/us-accidents-data-analysis

Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle

accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python

Last synced: 17 Apr 2026

https://github.com/kodemartin/np-pd-portfolio

Portfolio of numpy, pandas, and matplotlib solutions

matplotlib numpy pandas

Last synced: 17 Apr 2026

https://github.com/egrizq/bike_analysis

Exploring the bike sharing system to find the patterns behind user behavior by weather, season, hour, and months.

data-science matplot numpy python seaborn

Last synced: 08 May 2026

https://github.com/esraamhmd/intelligent-skin-cancer-image-based-detection-models

This project combines traditional machine learning approaches with advanced deep learning techniques to assist healthcare professionals in early diagnosis and improve patient outcomes.

ai deep-learning ham10000 isic kaggle machine-learning matplotlib numpy pandas python3 pytorch seaborn skin-cancer skin-detection sklearn time torchvision tqdm

Last synced: 02 Apr 2026

https://github.com/broodhoney/blue-book-for-bulldozers

This repository holds the project which solves a regression problem on predicting the futures sales of bulldozers. This is from a kaggle competition.

matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/mac2bua/autoalgo

Autonomous algorithm optimization research - 19x speedup achieved through self-driven iterations

algorithm-optimization autoresearch numpy performance python

Last synced: 02 Apr 2026

https://github.com/mnagaharshithrao/diamondpriceprediction

Diamond Price Prediction is a machine learning project that predicts the price of diamonds based on features like carat, cut, color, clarity, and more. Using models such as XGBoost, Random Forest, and Gradient Boosting, the project aims to accurately forecast diamond prices, helping buyers and sellers in the diamond industry make informed decisions

machine-learning numpy pandas sckiit-learn xgboost

Last synced: 17 Apr 2026

https://github.com/kiarashyavari/python_interview

Practice intermediate to advance topics of Python here

decorators generator multiprocessing multithreading numpy pandas python python3

Last synced: 17 Apr 2026

https://github.com/rakibhhridoy/predictingsequence-timeseriesanalysis

Time series data easy handling purposes in TensorFlow and python functional programming. Time Series data is handled and pre-processed by different steps,like making window of the data,splitting data for model purpose is also different than other processing methods. All has combined for easy access.

forecasting functional-programming keras lstm lstm-neural-networks moving-average numpy python tensorflow time-series windowed

Last synced: 02 Apr 2026

https://github.com/christiemj/clock-drift-simulation

A Monte Carlo simulation in Python to model vacuum-induced frequency drifts in atomic clocks, with code and visuals (experimental setup, simulated vs. real-world comparison, and drift graph), as described in "Probing Vacuum-Induced Clock Drifts via Quantum Metrology: A Testable Hypothesis" (DOI: 10.5281/zenodo.15163879). Licensed under GPL 3.0.

allan-variance atomic-clocks cesium-ion clock-drift matplotlib monte-carlo-simulation numpy open-science pandas physics precision-metrology python quantum-metrology research-software scipy simulation vacuum-fluctuations vacuum-noise

Last synced: 17 Apr 2026

https://github.com/mgitrov/cnn-on-cifar-10

A computer vision project aiming to classify random images uploaded by the user.

artificial-neural-networks computer-vision convolutional-neural-networks deep-learning keras matplotlib numpy pil regularization tkinter

Last synced: 17 Apr 2026

https://github.com/harmanveer-2546/wafer-fault-detection

The goal is to eliminate manual work in identifying faulty wafers. Opening and handling suspected wafers disrupts the entire process. False negatives result in wasted time, manpower, and costs.

clustering data-transformation feature-selection machine-learning matplotlib numpy pandas python random-forest roc-auc-curve roc-auc-score seaborn sklearn svc xgboost

Last synced: 11 Apr 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/noluyorabi/bravais-pearson-interactive

Bravais-Pearson Interactive – A web app for visualizing Pearson correlation. Generate datasets, analyze correlations, and explore regression lines, r², and p-values interactively.

data-visualization interactive-analysis matplotlib numpy pearson-correlation scipy seaborn statistics streamlit

Last synced: 17 Apr 2026

https://github.com/mwoss/mownit

Various tasks from numerical analysis laboratories solved using Python

jupyter-notebook mathematics matplotlib numerical-analysis numpy python scipy

Last synced: 17 Apr 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchís

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/omogbolahan94/movies-budget-revenue-analysis

Data exploration, data cleaning, data analysis and predictive analysis

dataframe linear-regression matplotlib numpy pandas seaborn statistical-analysis

Last synced: 08 May 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/abdul-rafay19/california-housing-price-prediction

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/sundanc/weatherprediction

This project implements a weather prediction system that predicts the temperature based on real-time weather data, including features like humidity, wind speed, and day-related features (day of the week, month

machine-learning machinelearning numpy pandas programming python scikit-learn scikitlearn-machine-learning weather-prediction

Last synced: 18 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026