An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/carol-neto/sprint-4-statistical-data-analysis

In this project I had the opportunity to test my knowledge by analyzing a phone plan and creating graphs to compare the plans and determine which ones generate the most revenue.

matplotlib-pyplot numpy pandas pytho scipy-stats seaborn statistical-analysis

Last synced: 09 May 2026

https://github.com/raduldev/datasciencepython

Interview Task From CSIR-IGIB, Delhi & maps for journal

biopython colab-notebook counter jupyter-notebook matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/liannekiarra/university-dissertation

Combining Machine Learning Techniques with Multi-Objective evolutionary Algorithms to Solve Real World Engineering Problems

artificial-intelligence engineering hypervolume matplot multi-objective-optimization neuralnetwork nsga-ii numpy pareto-front pareto-optimality pymoo python3 sklearn

Last synced: 09 May 2026

https://github.com/gsaini/streamlit-ui-getting-started

Streamlit is an open-source Python framework for data scientists and AI/ML engineers to deliver dynamic data apps with only a few lines of code. Build and deploy powerful data apps in minutes. Let's get started!

codecov github-actions numpy pandas python streamlit unittest

Last synced: 14 Jun 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customer’s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/anker661/autograd-playground

A simple Numpy & Python based auto differentiation system that supports visualization before and after back-propagation.

auto-differentiation autograd computaion-graph jupyter-notebook numpy python3

Last synced: 09 May 2026

https://github.com/arjunravi26/selenium

This repository contains code from my learning about Selenium for web scraping and automation.

numpy python selenium time undetected-chrome webdriver

Last synced: 09 May 2026

https://github.com/betiniakarandut/portfolio-project

Sukkar and Cornell Integral Approach to Estimate Static Bottom Hole Pressure(SBHP)

numpy pandas python3

Last synced: 09 May 2026

https://github.com/vadimkantorov/fasttsv

TSV parser for Python in pure vectorized NumPy code

numpy parsing simd tsv vectorized-code

Last synced: 09 May 2026

https://github.com/alexgustafsson/word-frequencies

Data and tools to compile word frequencies, trigrams and more for use with NLP, spelling correction etc.

language nlp nltk numpy python python3 sklearn spelling-correction trigram

Last synced: 10 May 2026

https://github.com/faisal-khann/python-practice

This is the section for python practice, where we engage in programming exercises to improve our Python skill.

numpy numpy-arrays pandas python

Last synced: 10 May 2026

https://github.com/tim-w-james/python-data-reference-code

Python reference code for data visualization, machine learning and SQL

data-science jupyter-lab numpy python

Last synced: 10 May 2026

https://github.com/vviveksharma/barcodescanner

The Bar Code scanning Application built using Computer Vision library such as Opencv in python the main objective of the project is to read the barcode that is being shown in the camera and decoding it and telling if the Database or Text file that is attached to it has that permitted decoded number if yes then it shows "Permitted" is green else "Not-Permitted" in red

numpy opencv python3

Last synced: 10 May 2026

https://github.com/alphacrypto246/air-quality-prediction

The Air Pollution Prediction project forecasts pollution levels using the Air Quality UCI dataset, including variables like CO, NOx, temperature, and humidity. It analyzes temporal and environmental factors to predict air quality trends.

machine-learning numpy pandas scikitlearn-machine-learning sckiit-learn

Last synced: 10 May 2026

https://github.com/1adityakadam/computer-vision-approach-for-quality-inspection-of-steel-sheets

Machine Learning and Computer Vision project improves steel defect inspection using computer vision. A segmentation technique partitions steel sheet images into regions, enabling better defect analysis. U-Net is used for segmentation, and MobileNetV2 for classification. Performance is evaluated with the Dice Coefficient.

computer-vision machine-learning numpy pandas unet-image-segmentation

Last synced: 10 May 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/amiegirl/ford_gobike-tripdata

This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 08 May 2026

https://github.com/686f6c61/numeros-primos-gemelos

Este proyecto permite generar y visualizar números primos gemelos de diferentes formas para ayudar a comprender sus patrones y distribución. Incluye múltiples visualizaciones y análisis estadísticos avanzados.

numpy primos python scipy

Last synced: 10 May 2026

https://github.com/octavioduarte/datascience

A repository with study notes on Data Science concepts, NumPy, pandas, and an introduction to statistics.

data-science jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 10 May 2026

https://github.com/dawoudtormos/rain_predicition_ml

This repository contains a machine learning project for rain prediction using historical meteorological data. The goal is to predict whether it will rain the next day based on weather measurements like atmospheric water vapor, cloud formation, and past rainfall records.

ai machine-learning machine-learning-algorithms matplotlib numpy seaborn sklearn

Last synced: 10 May 2026

https://github.com/rebeccamorolong/market--basket--analysis

Market Basket Analysis using the Apriori algorithm to predict association rules and frequent itemsets from transactional data. The goal was to uncover product combinations that frequently co-occur in transactions, which can be used for recommendations, promotions, and inventory management.

apriori-algorithm numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/dkm/stlsplit

Tool to split STL in several objects

numpy python stl

Last synced: 17 Jun 2026

https://github.com/dzakarias/waves

Fun with the sine function inspired by Chladni plates

numpy streamlit

Last synced: 10 May 2026

https://github.com/prateekrajsrivastav/question-answering-model

This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.

huggingface-transformers matplotlib nltk numpy pandas seaborn spacy

Last synced: 10 May 2026

https://github.com/esoltys/climate_data_viz

FastAPI-based web application that fetches and visualizes climate data from the Copernicus Climate Data Store (CDS).

apache-echarts copernicus-climate-data-store fastapi numpy

Last synced: 10 May 2026

https://github.com/jupfu/beck-view-movie

Assemble mp4 video from list of images

multithreading numpy opencv python3 threadpool tqdm

Last synced: 10 May 2026

https://github.com/shibam120302/all_about_python

Here I upload python from basic to advance ,oops in python, dsa using python system design, numpy, pandas, data science, ML also. Follow @shibam120302 and star this repo.

numpy numpy-arrays oops-in-python pandas python python-dsa python-library python3 system-design tabula

Last synced: 10 May 2026

https://github.com/prakashjha1/email-sms-spam-classifier

Many email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.

machine-learning natural-language-processing numpy pandas python

Last synced: 10 May 2026

https://github.com/akansharajput280799/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

data-visualization excel exploratory-data-analysis matplotlib numpy python seaborn

Last synced: 08 May 2026

https://github.com/jjerphan/joml

A minimalist numpy-baked Neural Network API

deep-learning fun minimalist neural-network neural-networks numpy python wheel-implementing

Last synced: 10 May 2026

https://github.com/huggon1/ml-algorithm-implementations

Educational implementations for ML, DL, LLM blocks, ViT, and CUDA.

cuda machine-learning numpy pytorch vision-transformer

Last synced: 28 Jun 2026

https://github.com/zuecopio/vco_2425

Computer Vision (VCO) subject practices

cv2 matplotlib numpy opencv pilllow

Last synced: 08 May 2026

https://github.com/mahmood-anaam/grid-world-value-iteration

An implementation of the Value Iteration algorithm for solving the Grid World problem. This project provides a function to compute the optimal value function for a grid-based environment where a robot navigates to maximize rewards while avoiding penalties.

dynamic-programming grid-world numpy optimization-problem python value-iteration-algorithm

Last synced: 10 May 2026

https://github.com/kartmaan/plotly-intervals

Group values from a Pandas Series according to given intervals and represent them graphically with Plotly

bar-chart grouping grouping-plots intervals intervals-chart numpy numpy-arrays orca pandas pandas-series percentile pie-chart plot plotly plotting values

Last synced: 10 May 2026

https://github.com/florasteve/ml-foundations-day2

Day-2 ML foundations: probability/stats refresh and NumPy logistic regression; notebooks with visuals.

data-science jupyter-notebook logistic-regression machine-learning matplotlib numpy statistics

Last synced: 10 May 2026

https://github.com/m1-elmasry/quickmark

Efficient OMR software designed for quick and precise grading of bubble sheet exams.

bubble-sheet image-processing numpy opencv python

Last synced: 10 May 2026

https://github.com/anshpg/exploring-ipl-rivalries-an-analysis-of-matches-from-2008-to-2022

In this project, I delved into an extensive analysis of IPL matches spanning from 2008 to 2022, utilizing a dataset sourced from Kaggle.com. My aim for this IPL season was to explore something novel, steering away from conventional analyses.

csv-files ipl kaggle-dataset matplotlib numpy pandas

Last synced: 10 May 2026

https://github.com/niteshchawla/loantap-logisticregression

Given a set of attributes for an Individual, determine if a credit line should be extended to them. If so, what should the repayment terms be in business recommendations?

confusion-matrix encoding feature-engineering featureimportance hyperparameter-tuning logistic-regression machine-learning numpy pandas-python precision-recall roc-auc-curve sklearn-metrics smote-sampling

Last synced: 10 May 2026

https://github.com/tomdewildt/interactive-and-explainable-ai-design

Code for The Interactive And Explainable AI Design course of my master's degree

jupyter lime numpy pandas python scikit-learn shap

Last synced: 18 Jun 2026

https://github.com/shubhansu-kr/int213-python-programming

Python Programming | Semester 3 | Python | OOPS | Tkinter | Pandas | Numpy

lpu numpy oops-in-python pandas python sem3 tkinter-gui

Last synced: 10 May 2026

https://github.com/sahil210695/convolution-evolution

This is to see how a kernel will convolve over an image and what will be its output after convolution

cnn convolutional-neural-networks data-science image-kernel image-processing numpy numpy-neural-network python streamlit

Last synced: 10 May 2026

https://github.com/andersoncrs/aprendizaje_no_supervisado_kmeans_customers

Este repositorio contiene un análisis de datos de clientes de un centro comercial utilizando técnicas de aprendizaje no supervisado, específicamente K Means y clustering jerárquico. El objetivo del proyecto es segmentar a los clientes en grupos homogéneos para entender mejor sus comportamientos y características.

data-analysis kmeans-clustering matplotlib numpy seaborn visualization

Last synced: 10 May 2026

https://github.com/jatin-s16/neuron_from_scratch

Hey! I am learning to build neural networks from scratch without the use of built-in Python libraries.

deep-learning neural-networks nnfs numpy python

Last synced: 10 May 2026

https://github.com/charlenry/python_math_machine_learning

Mes notebooks de travaux pratiques sur Python, NumPy, SimPy, SciPy, Matplotlib, Plotly, Seaborn et les Mathématiques pour le Machine Learning

algebra derivatives functions jupyter linear matplotlib matrix numpy plotly probabilities pyplot python pytorch scipy seaborn sklearn statistics sympy tensorflow time

Last synced: 25 Jun 2026

https://github.com/charlenry/python_data_science

Mes notebooks de travaux pratiques sur Python pour la Data Science

analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation

Last synced: 25 Jun 2026

https://github.com/susanketsarkar/nn-from-scratch

A pure NumPy implementation of a deep neural network, built for educational purposes and deep learning understanding. This project implements a multi-layer neural network with various features commonly found in modern deep learning frameworks.

deep-learning implementation-of-algorithms neural-network numpy python

Last synced: 10 May 2026

https://github.com/tinaland101/carcinoma-regression-project

The goal of this project is to analyze and visualize the results of a clinical study involving the treatment of squamous cell carcinoma (SCC) in mice. As a senior data analyst, you are tasked with preparing a technical report for Pymaceuticals, Inc.

linear-regression mathplotlib numpy pandas pyhton

Last synced: 08 May 2026

https://github.com/velascort/image_equalizer_gui

Image equalizer using LUT interface.

image-processing lut numpy python scikit-image ttk

Last synced: 10 May 2026

https://github.com/saagnik-mondal/chessengine

This is a simple and efficient chess engine implemented entirely in Python. It allows users to play chess against the computer, using an AI based on a Minimax algorithm with Alpha-Beta pruning for decision-making. The engine is designed to be easy to understand, extend, and play against the computer.

numpy pygame python3

Last synced: 10 May 2026

https://github.com/tnleite/real-estate-opportunities-analysis

Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.

catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression

Last synced: 10 May 2026

https://github.com/shriansh2002/stock-prediction

This project predicts Apple stock prices using linear regression. It's based on historical stock price data and uses Python and popular data science libraries like Pandas, NumPy, Matplotlib, and scikit-learn.

apple datascience finance linearregression machinelearning matplotlib ml numpy pandas prediction python scikitlearn stock-prediction stockmarket

Last synced: 10 May 2026

https://github.com/relostar-devil/census-income-prediction

An end-to-end ML project using 1994 US Census data to classify income (>50K/<50K). The Jupyter Notebook covers data preprocessing, EDA, and model evaluation with multiple classifiers.

matplotlib numpy pandas scikitlearn-machine-learning seaborn

Last synced: 10 May 2026

https://github.com/alain-godo/spotify-popularity

A simple analysis and modeling for prediction of the popularity of songs on Spotify

linear-regression machine-learning numpy pandas random-forest sckiit-learn spotify tree-decision

Last synced: 08 May 2026

https://github.com/jsnevt/regressao_logistica

Cálculos de probabilidades sobre uma eleição

matplotlib numpy pandas-dataframe python

Last synced: 11 May 2026

https://github.com/dino-kupinic/pocolocosolver

algorithmic solution with 3D visualization

algorithms numpy panda3d puzzle pytest python solver

Last synced: 19 Jun 2026

https://github.com/djdurga/predictive_analysis_in_diabetes

This project applies Logistic Regression to predict diabetes in patients using the Pima Indians Diabetes Dataset. It covers the full data science lifecycle — from data imputation and exploration to model training, evaluation, and insights.

matplotlib numpy pandas

Last synced: 08 May 2026

https://github.com/adityarohatgi11/regression_analysis_london_crime

This project examines how police presence impacts crime rates in London using regression analysis, panel data, and a natural experiment from the 2005 London attacks.

data-wrangling numpy pandas regression-analysis statsmodels

Last synced: 11 May 2026

https://github.com/gatuno1/details_meson_numpy_error

Documents for exploration of meson error on numpy installation over PyPy on Windows

meson numpy pypy unicode windows

Last synced: 11 May 2026

https://github.com/jayemscript/lab-to-code

A complete Python learning roadmap for scientists and researchers — covering data science, biology, chemistry, physics, and mathematics with curated libraries, tools, and resources.

bioinformatics chemistry data-science jupyter-notebook machine-learning mathematics numpy pandas physics python research roadmap scientific-computing scikit-learn

Last synced: 19 Jun 2026

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/prakashjha1/stock-investment-analysis

Stock Investment Analysis Project can help investor to select the better performing stocks.

data-analysis data-science numpy pandas pandas-datareader parallel-programming python

Last synced: 08 May 2026

https://github.com/akashash01/decision_tree

A very specific type of probability tree that enables you to make a decision on certain process(chance event outcomes, resources cost, utility).

data-science decision-tree-classifier machine-learning numpy pandas-dataframe python sklearn

Last synced: 11 May 2026

https://github.com/dhruvil-26/python-projects

This repository contains Python projects showcasing data analysis and visualization. 1. IMDB Movie Analysis: Analyzing movie trends, genres, and ratings. 2. Loan Default Analysis EDA: Exploring factors contributing to loan defaults.

eda imdb-dataset loan-default-analysis matplotlib numpy pandas python seaborn visualization

Last synced: 08 May 2026

https://github.com/chaganti-reddy/heart_disease_prediction

Machine Model creation for Heart Disease Prediction

data-science deep-learning machine-learning numpy python

Last synced: 11 May 2026

https://github.com/williyam-m/sentiment_analysis

A Django-based Sentiment Analysis web app leveraging the power of the BERT transformer model to classify text sentiment accurately.

beautifulsoup bert django numpy pandas python torch transformers

Last synced: 20 Jun 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/parthds02/customer-segmentation-with-kmeans-clustering

Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.

data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode

Last synced: 11 May 2026

https://github.com/rphadtare/python_by_example

This repository consist sample programs to learn and understand core concepts

duckdb numpy pandas python-os python3

Last synced: 27 Jun 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/chdl17/imdb-assignment

This GitHub repository contains a project that performs an analysis of the IMDb movie rating dataset. The data is used to explore the relationships between various features and the movie rating. Visualizations are used to illustrate the results of the analysis. The code is organized into a Jupyter Notebook file and data files.

numpy pandas-dataframe python

Last synced: 08 May 2026

https://github.com/bhuvilol/objdetector_webcam_and_video

Object Detection from Webcam input or video input

cvzone numpy opencv python3 yolov8

Last synced: 11 May 2026

https://github.com/omogbolahan94/google-play-store-app-analysis

Data exploration, data cleaning, interactive data visualisation and insight presentation

matplotlib numpy pandas plotly

Last synced: 11 May 2026

https://github.com/takana671/fluidcube

Particle Movement Using Curl Noise

curl-noise cython numpy panda3d python

Last synced: 22 Jun 2026

https://github.com/omogbolahan94/movies-budget-revenue-analysis

Data exploration, data cleaning, data analysis and predictive analysis

dataframe linear-regression matplotlib numpy pandas seaborn statistical-analysis

Last synced: 08 May 2026

https://github.com/samaalharbi2/100-days-of-code

This repository is a collection of 100 Python projects built as part of the "100 Days of Code: Python Developer Challenge" course.

data-science game-development machine-learning numpy pandas python udemy

Last synced: 08 May 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/huzaifa12466/neuralnetscratch

A simple neural network implemented from scratch in Python using NumPy, trained on the Moons dataset

artificial-intelligence artificial-neural-networks backpropagation deep-learning forwardpropagation machine-learning matplotlib neural-network neural-networks numpy

Last synced: 23 Jun 2026

https://github.com/sunnyrao07/water-quality-analysis

A machine learning project that predicts water potability based on chemical and physical attributes, using models like Logistic Regression, Random Forest, and XGBoost.

data-cleaning label-encoding logistic-regression matplotlib model-evaluation numpy pandas pyhton random-forest sckiit-learn seaborn smote standard-scaler xgboost

Last synced: 08 May 2026

https://github.com/ishaan28malik/masters-admission-analyser

Admissions of Indian students for foreign universities.

admission classification machine-learning matplotlib numpy python students

Last synced: 08 May 2026

https://github.com/gerardomacedo/ml-from-scratch

Logistic regression from scratch (NumPy) with tests and CLI; compared vs scikit-learn

ci-cd from-scratch logistic-regresion machine-learning numpy python unittest

Last synced: 23 Jun 2026

https://github.com/daniel-brai/alx-t-x-udacity-data-analytics-nanodegree

A 3-month nanodegree going through the intricacies of data analysis with Python and its various libraries such Pandas, Numpy, matplotlib, seaborn and others

matplotlib numpy pandas python3

Last synced: 08 May 2026

https://github.com/kevinmbeaulieu/machine-learning-practice

ML models & data processing utilities written from scratch as practice.

data-science machine-learning numpy pandas

Last synced: 08 May 2026