An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/m-k-l-s/sudoku

A trivial sudoku solver in Python, using recursion and numpy.

backtracking numpy python recursion solver sudoku

Last synced: 27 Apr 2026

https://github.com/sivasurya681/financial-expense-tracker

The Financial Expense Tracker is a Python-based app built with Flet, Pandas, NumPy, and Matplotlib.pyplot for seamless expense management. It enables real-time tracking, categorization, and data visualization through bar and pie charts. Users can generate summaries, export/import data, and make informed financial decisions easily.

calculator-application differntial expense-tracker flet matplotlib-pyplot numpy pandas python salary-calculator total

Last synced: 09 May 2026

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/saptarshi11/diamlp---diabetes-classification-with-multilayer-perceptron

Diabetes Classification using Multilayer Perceptron (MLP) is a binary classification problem where the goal is to predict whether a patient has diabetes (positive class) or not (negative class) based on a set of biomedical features or risk factors.

artificial-neural-networks binaryclassification machine-learning mlp-classifier numpy seaborn

Last synced: 27 Apr 2026

https://github.com/rokmark0/auto-feature-engineering

🐙 Automated feature engineering toolkit that creates, selects, and transforms features for machine learning on tabular data, boosting model accuracy and speeding workflows.

auto-feature-engineering automl aws-ec2 dask data-science explainable-ml feature-engineering featuretools machine-learning mahinelearning numpy pandas pandas-library python r sagemaker supervised-learning unsupervised-learning

Last synced: 10 Apr 2026

https://github.com/l2nce/datamining-study

Introduction to data mining

data-analysis data-mining matplotlib numpy panda

Last synced: 28 Apr 2026

https://github.com/vasukalariya/neural-network-from-scratch

It's a simple Feedforward Neural Network implemented just using Numpy

artificial-neural-networks classification mnist neural-network numpy

Last synced: 28 Apr 2026

https://github.com/binoydipu/ml-libraries

Machine Learning Libraries Repository

machine-learning numpy

Last synced: 28 Apr 2026

https://github.com/sanjurajveer/market_basket_analysis

Finding pattern of bought objects through market basket analysis

apriori-algorithm association-rules numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/varun-khorgade/salesense-e-commerce-analytics

Analyzed e-commerce datasets to discover customer behavior trends and business insights. Performed EDA, data cleaning, and visualizations to help stakeholders make data-driven decisions.

customer-behavior-analysis dashboards datacleaning exploratory-data-analysis feature-engineering jupyter-notebook matplotlib numpy pandas python seaborn sql time-series-trend visulaization

Last synced: 28 Apr 2026

https://github.com/rajivaleaakash/customer-churn-prediction

A machine learning project focused on predicting customer churn using various data analysis and modeling techniques. The repository includes data preprocessing, feature engineering, exploratory data analysis (EDA), model training, evaluation, and visualization to help businesses identify customers at risk of leaving.

churn-prediction classification customer-churn data-analysis data-science gridsearchcv imblearn machine-learning numpy pandas pyhton randomsearchcv scikit-learn

Last synced: 28 Apr 2026

https://github.com/varakanthamsaihruthikreddy975/avocadopricesanalysis

This data analysis finds out the trends and analysis of avocado data by year

matplotlib-pyplot numpy pandas scipy scipy-stats seaborn sklearn-preprocessing

Last synced: 28 Apr 2026

https://github.com/abdeldjalilchafai/us-flight-delay-eda

Structured EDA on 2015 US flight delay data. Clean, reproducible notebook using a 6-step data analysis framework for real-world datasets.

data-analysis data-cleaning eda exploratory-data-analysis flight-delays kaggle matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/pedronjorge/average-face

Generate an average face from a given set of images by detecting facial landmarks, aligning features, and blending the images using Delaunay Triangulation. Libraries used: Dlib, OpenCV, NumPy

dlib image-generation jupyter-notebook numpy opencv python3

Last synced: 30 Apr 2026

https://github.com/carol-neto/sprint-4-statistical-data-analysis

In this project I had the opportunity to test my knowledge by analyzing a phone plan and creating graphs to compare the plans and determine which ones generate the most revenue.

matplotlib-pyplot numpy pandas pytho scipy-stats seaborn statistical-analysis

Last synced: 09 May 2026

https://github.com/morpheomax/uberpython

Viajes realizados en Uber con Python

numpy pandas python streamlit uber

Last synced: 28 Apr 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/semanticdata/traffic-studies

Comprehensive traffic analysis dashboard for Crystal, Minnesota, built with Streamlit.

matplotlib numpy pandas plotly python seaborn streamlit

Last synced: 28 Apr 2026

https://github.com/technologicat/pylu

Small nogil-compatible linear equation system solver

cython linear-equations numerical numpy python python2 python27 python3 python34 solver

Last synced: 28 Apr 2026

https://github.com/redinfinitypro/drawimages

Rating: (8/10) The code uses Python's Turtle module to create a canvas image, converting it to grayscale, and then placing dots at each pixel's location based on its intensity.

numpy pil random turtle

Last synced: 28 Apr 2026

https://github.com/betiniakarandut/portfolio-project

Sukkar and Cornell Integral Approach to Estimate Static Bottom Hole Pressure(SBHP)

numpy pandas python3

Last synced: 09 May 2026

https://github.com/babaquero07/escalamiento-imagenes-python

Proyecto del curso Algebra Lineal - UTADEO

matloptlib numpy pillow scipy streamlit

Last synced: 28 Apr 2026

https://github.com/ksprugevics/galton-board

Python implementation of the Galton board

galton-board numpy pygame pymunk python

Last synced: 29 Apr 2026

https://github.com/santiagoenriquega/game_of_life

This repository contains simulates Conway's Game of Life using Pygame for graphics and Numpy for array operations. The game simulates cellular automata where cells evolve based on simple rules.

conways-game-of-life numpy pygame python python3

Last synced: 29 Apr 2026

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/skypse/santander-coders-data_science-course

Curso de Data Science, proposto pelo Satander, utilizando Python!

jupyter-notebook numpy pandas-python python scikit-learn

Last synced: 29 Apr 2026

https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure

A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.

eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn

Last synced: 11 Apr 2026

https://github.com/alexgustafsson/word-frequencies

Data and tools to compile word frequencies, trigrams and more for use with NLP, spelling correction etc.

language nlp nltk numpy python python3 sklearn spelling-correction trigram

Last synced: 10 May 2026

https://github.com/humankernel/hands-on-ml

Practice of Machine Learning

machine-learning matplotlib numpy pandas sklearn

Last synced: 08 Jun 2026

https://github.com/anas436/python

This is Python Programming Language. Here, the basic problems of Python language have been solved. If you can solve these problems then your basic skills will be developed in this language and you will be suitable for doing good projects.

jupyter-notebook jupyterlab numpy pandas pycharm python3 watson-studio

Last synced: 11 Apr 2026

https://github.com/ujjvalrajput/data-analysis-python-practise

Practise with matplotlib, pandas, and numpy.

matplotlib numpy pandas python

Last synced: 29 Apr 2026

https://github.com/thaisgarcia/analise-exploratoria

Respondendo 10 perguntas de negócio a partir de um dataset sobre uma rede de varejo que comercializa diversos produtos em diversas cidades dos EUA.

matplotlib numpy pandas

Last synced: 29 Apr 2026

https://github.com/alphacrypto246/air-quality-prediction

The Air Pollution Prediction project forecasts pollution levels using the Air Quality UCI dataset, including variables like CO, NOx, temperature, and humidity. It analyzes temporal and environmental factors to predict air quality trends.

machine-learning numpy pandas scikitlearn-machine-learning sckiit-learn

Last synced: 10 May 2026

https://github.com/unknownman/101-numpy

learning numpy by 101 short videos and jupyter notebook

numpy

Last synced: 29 Apr 2026

https://github.com/antonio-f/naive_convnet

Naive Convolutional Neural Network with TensorFlow 2

101 convolutional-neural-networks deep-learning easy matplotlib numpy simple tensorflow2 tutorial

Last synced: 29 Apr 2026

https://github.com/amulya20010418/zomato_data_analysis

Analyze Zomato restaurant data all the world and find the insights by using Python libaries and also visualize the dataset by using Power-bi

cleaning-data-in-python dashboard numpy pand powerbi python visualization

Last synced: 29 Apr 2026

https://github.com/matheusvazdata/retail-sales-forecast-linreg-sklearn

Minimal project for retail sales forecasting using linear regression (scikit-learn).

forecasting linear-regression machine-learning matplotlib numpy pandas scikit-learn

Last synced: 29 Apr 2026

https://github.com/ksharma67/k-means-algorithm-on-the-iris-dataset

Applied the K-Means algorithm on the Iris dataset, and utilized the Silhouette Score method to find the best value of K

eda elbow-method iris-dataset kmeans-clustering matplotlib numpy pandas python seaborn silhouette-score

Last synced: 29 Apr 2026

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/yimethan/basics-of-data-analysis

2023-2 Basics of Data Analysis

data-analysis numpy pandas python

Last synced: 29 Apr 2026

https://github.com/michaelzheng67/farmsa-programming-for-finance-spring2021

Source files for the programming for finance presentation (complements the material covered)

excel jupyter-notebook matplotlib numpy pandas python sql yahoo-finance-api

Last synced: 29 Apr 2026

https://github.com/atchayaah/vois_aicte_oct2025_atchayaa

Comprehensive data science analysis leveraging NYC Airbnb market data to derive actionable insights on optimal pricing models, key performance indicators (KPIs), and factors influencing host trust and guest experience optimization.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/laipching/sprint6_module1

Exploratory Data Analysis with Python (Pandas/Matplotlib/Seaborn). Business questions, metrics and clear visualizations.

data-visualization eda matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/x-nimesh/panoramer-dockerized

Panoramer is a cutting-edge panoramic image stitching web application designed to transform your images into stunning, professional-quality panoramas. Whether you're a seasoned photographer or a hobbyist, Panoramer's intuitive interface and precise alignment tools make it easy for users of all skill levels to create seamless panoramas.

flask image-processing numpy python react

Last synced: 29 Apr 2026

https://github.com/muhammadusman-khan/e-commerce-store-eda

Exploratory Data Analysis on E-commerce store data to uncover insights about sales trends, customer behavior, and product performance using Python libraries like Pandas, NumPy, and Matplotlib/Seaborn.

data-analysis data-science data-visualization e-commerce eda exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/fantaso/data-analysis-and-manipulation-with-pandas

Project consists to analyse a publicly available movie dataset found in https://www.kaggle.com/beyjin/movies-1990-to-2017 and use Python tools like Pandas in order to get some initial insights about the dataset and finally proceeding to clean, transform and save a new version of the dataset in a better structure thinking about storing the data in a database.

matplotlib numpy pandas python3 seaborn

Last synced: 29 Apr 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 30 Apr 2026

https://github.com/ola-kaznowska/sigmanoid-function

Sigmanoid function. Written in Python 3.13.1 with matplotlib library and numpy

mathmatics matplotlib numpy python python3

Last synced: 30 Apr 2026

https://github.com/victor-gp/tfg-h16b

The code for my bachelor's thesis, a numerical exploration of the second part of Hilbert's 16th problem

bachelor-thesis dynamical-systems matplotlib-pyplot numpy odeint

Last synced: 30 Apr 2026

https://github.com/dina-hosny/import-preprocess-and-visualize-a-dataset-project

A simple project to practice importing a dataset, data cleaning and preparation processes, and visualize the results to answer some given questions.

data-cleaning data-engineering data-science data-visualization jupyter-notebook matplotlib numpy pandas python

Last synced: 30 Apr 2026

https://github.com/nikhilsree5/walmartcasestudy

Analysis of Customer Spending Habits at Walmart Inc

clt eda numpy pandas python3 statistics visualization

Last synced: 30 Apr 2026

https://github.com/mrktsm/spam-email-recognizer

Long Short-Term Memory (LSTM) network trained to classify emails as spam or non-spam. It processes email content to make accurate predictions and can be integrated into projects for efficient spam detection and email management.

data-preprocessing keras lstm-neural-network model-architecture nltk numpy pandas performance-evaluation scikit-learn spam-classification-model tenserflow training-the-model

Last synced: 09 Apr 2026

https://github.com/tapishr/numpy-rnn

Implementation of an RNN using numpy library in python

numpy python recursive-neural-network rnn

Last synced: 30 Apr 2026

https://github.com/tsar123/coffee-quality-analysis

machine learning using RandomForestRegressor and dataset with coffee data

coffee ml numpy pandas randomforestregressor regressor-models sklearn

Last synced: 30 Apr 2026

https://github.com/blankscreen-exe/triple-trial-card-maker

Triple Triad Card maker for a game inspired by Final fantasy

card-game numpy opencv python tripletriad

Last synced: 30 Apr 2026

https://github.com/fikri-rouzan/student-stress-levels-classification

Proyek pemodelan machine learning untuk mengklasifikasikan tingkat stres mahasiswa berdasarkan parameter input akademik dan psikologis.

joblib jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn streamlit

Last synced: 08 Jun 2026

https://github.com/rebeccamorolong/market--basket--analysis

Market Basket Analysis using the Apriori algorithm to predict association rules and frequent itemsets from transactional data. The goal was to uncover product combinations that frequently co-occur in transactions, which can be used for recommendations, promotions, and inventory management.

apriori-algorithm numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/yashbhutwala/pong-ai

Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment

deep-q-learning numpy openai-gym policy-gradient pong python tensorflow

Last synced: 30 Apr 2026

https://github.com/isaacalves7/nvidia-cuda

👁️‍🗨️📗 It's a repository of Nvidia CUDA programming.

anaconda gpu-programming numba numpy nvidia-cuda python3

Last synced: 30 Apr 2026

https://github.com/johassel/feed_forward_neural_network

Fully-connected feed-forward neural network based on the couse "Python for Machine Learning" at TU Berlin WiSe 24/25

machine-learning neural-networks numpy python

Last synced: 30 Apr 2026

https://github.com/abhiram-epuru/ai-powered-movie-recommender

A Movie Recommendation System using the MovieLens dataset, leveraging collaborative filtering techniques such as KNN (k-Nearest Neighbors) and Singular Value Decomposition (SVD). The project demonstrates a personalized recommendation model capable of suggesting movies based on user preferences and interaction data.

matplotlib numpy pandas python sckiit-learn seaborn

Last synced: 30 Apr 2026

https://github.com/prateekrajsrivastav/question-answering-model

This project is an NLP-based Question-Answering System that leverages machine learning and natural language processing techniques to provide answers to user queries based on a given context or dataset. The system processes textual input, extracts meaningful insights, and returns relevant answers to the user's question.

huggingface-transformers matplotlib nltk numpy pandas seaborn spacy

Last synced: 10 May 2026

https://github.com/lmizner/codecademy_biodiversity

Using knowledge of data analysis to interpret data about endangered animals for the National Park Service

barplot jupyter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/mogankumar/bird-flock-simulation-with-pyspark

A PySpark-based bird flock simulation modeling alignment, cohesion, and separation forces with a leader guiding the flock. Efficiently handles large-scale simulations and generates a visualized GIF of the flock’s movement.

argparse matplotlib numpy pyspark python3

Last synced: 01 May 2026

https://github.com/iamsaniasingh/heart_disease_prediction

This is my very first machine learning project, where I used a supervised learning algorithm—logistic regression—to predict heart disease. The model was trained and tested entirely on a pre-existing dataset, with no user input involved. The goal was to understand how ML models work and how they can be applied in healthcare predictions.

logistic-regression machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 11 Apr 2026

https://github.com/shibam120302/all_about_python

Here I upload python from basic to advance ,oops in python, dsa using python system design, numpy, pandas, data science, ML also. Follow @shibam120302 and star this repo.

numpy numpy-arrays oops-in-python pandas python python-dsa python-library python3 system-design tabula

Last synced: 10 May 2026

https://github.com/mahmood-anaam/grid-world-value-iteration

An implementation of the Value Iteration algorithm for solving the Grid World problem. This project provides a function to compute the optimal value function for a grid-based environment where a robot navigates to maximize rewards while avoiding penalties.

dynamic-programming grid-world numpy optimization-problem python value-iteration-algorithm

Last synced: 10 May 2026

https://github.com/sandip2512/play-store-app-review-analysis

EDA on Play Store apps and user sentiment reviews

eda heatmaps matplotlib numpy pandas plotly seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/mohhasan1/machine-learning

This repository is where I learn and store information about various machine learning libraries, such as Pandas, scikit-learn, and others. It serves as a personal reference for exploring and experimenting with different ML tools and techniques.

machine-learning matplotlib numpy pandas seaborn sklearn

Last synced: 01 May 2026

https://github.com/anshpg/exploring-ipl-rivalries-an-analysis-of-matches-from-2008-to-2022

In this project, I delved into an extensive analysis of IPL matches spanning from 2008 to 2022, utilizing a dataset sourced from Kaggle.com. My aim for this IPL season was to explore something novel, steering away from conventional analyses.

csv-files ipl kaggle-dataset matplotlib numpy pandas

Last synced: 10 May 2026

https://github.com/hansalemaos/a_pandas_ex_image_tools

Uses pandas/numpy/numexpr for operations on pictures - very fast

colors image locate numexpr numpy opencv pictures python

Last synced: 01 May 2026

https://github.com/deusares/funpil

A collection of PIL, Numpy and OpenCV functions to manipulate images

batch image-processing lazy media numpy opencv pil pillow python script

Last synced: 01 May 2026

https://github.com/dhruvv1402/spam-detection-python-

This project is a Spam Detection System built using Python. It classifies SMS messages as spam or ham (not spam) using machine learning techniques.

countvectorizer kaggle-dataset nlp-machine-learning nltk numpy pandas python scikit-learn supervised-machine-learning tf-idf

Last synced: 01 May 2026

https://github.com/sahil210695/convolution-evolution

This is to see how a kernel will convolve over an image and what will be its output after convolution

cnn convolutional-neural-networks data-science image-kernel image-processing numpy numpy-neural-network python streamlit

Last synced: 10 May 2026

https://github.com/rgandhi8/inventory-management-system

This project is an AI-driven Inventory Management System designed to optimize supply chain efficiency by leveraging machine learning techniques for demand forecasting. The system integrates ARIMA and ANNs to predict inventory needs, minimize shortages, and reduce holding costs also enhances decision-making by automating inventory planning.

dashboard inventory-management-system machine-learning numpy pandas python seaborn

Last synced: 08 Jun 2026

https://github.com/lmlk-seal/pyspeed

Intelligent Python Performance Optimization Tool Automatically profile, analyze, and accelerate your Python code with minimal effort.

analyzer customtkinter gui intelligent numpy optimization optimization-tools performance python real-world-problem-solving

Last synced: 23 Jun 2026

https://github.com/voidkeishi/computational-science

This repository contains implementations of algorithms from the course "Computational Science" at HUST

computational-science numpy python

Last synced: 02 May 2026

https://github.com/fabriciocovalesci/sistema-vendas-graficos

Sistema simples de vendas e estoque, com gráficos usando Matplotlib

matplotlib matplotlib-pyplot numpy python-script python3 venv virtualenv

Last synced: 02 May 2026

https://github.com/camsai/jupyterlite

CAMSAI JupyterLite is a lightweight, browser-based environment tailored for AI-driven materials science research. It integrates CAMSAI tools, schemas, and workflows, enabling users to validate data, run simulations, and explore materials science applications without requiring local installation or setup.

chemistry jupyterlite materials materials-design materials-informatics materials-science matplotlib numpy pyodide python scipy

Last synced: 02 May 2026

https://github.com/velascort/image_equalizer_gui

Image equalizer using LUT interface.

image-processing lut numpy python scikit-image ttk

Last synced: 10 May 2026