An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/ryancswallace/npdb

Parallel NumPy-like interface for large n-dimensional arrays on disk.

mapreduce-designpatterns numpy numpy-arrays parallel-computing python

Last synced: 09 Feb 2026

https://github.com/rampal-punia/data-science-toolkit

Your Go-To Resource for Essential Data Science Related Commands, Concepts, Quick Overviews and Useful Functions.

artificial-intelligence data-science keras machine-learning matplotlib nlp nlp-machine-learning numpy pandas pythorch sql tensorflow

Last synced: 09 Feb 2026

https://github.com/hansalemaos/all_against_all

Each item of a list against all others

list numpy

Last synced: 02 Apr 2026

https://github.com/sarowarahmed/predicting-kolkata-house-price

🏠 Predicting Kolkata House Price: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to predict house prices in Kolkata. Deployed on Streamlit Cloud for easy access and real-time predictions.

app kolkata linear-regression machine-learning numpy pandas scikit-learn streamlit

Last synced: 26 Feb 2026

https://github.com/sahilk12nayak/data-capstone-projects-using-python-libraries

In this data project we will focus on exploratory data analysis. This project utilizes Python libraries such as NumPy, Pandas, Matplotlib, and Seaborn for data manipulation, analysis, and visualization.

matplotlib numpy pandas seaborn

Last synced: 15 Apr 2026

https://github.com/murugavl/forage_tata_genai_powered_data_analytics

This repository showcases my work from the TATA GenAI Powered Data Analytics Virtual Experience on Forage, where I applied data cleaning, analysis, and visualization techniques to derive actionable business insights.

excel genai matplotlib numpy pandas powerbi python

Last synced: 09 Feb 2026

https://github.com/mahmud-kandawala/simulating-covid-19-python-

This repository contains a Python program that simulates the spread of COVID-19 using Numpy and Matplotlib for calculations and visualization. The simulation offers an interactive way to understand virus dynamics and the impact of epidemiological parameters.

matplotlib numpy python

Last synced: 16 Apr 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/mariam-zaidi/bike-sharing-prediction-model

Build a multiple linear regression model for the prediction of demand for shared bikes.

linear-regression matplotlib numpy pandas rfe seaborn sklearn

Last synced: 09 Feb 2026

https://github.com/harmanveer-2546/statistics-for-machine-learning

Statistical tools help you clean and organize your data. You can identify outliers, manage missing values, and ensure your data is in a format that the ML algorithms can understand.

inline matplotlib matplotlib-styles numpy pandas probability python seaborn statistics

Last synced: 18 Apr 2026

https://github.com/baschin1103/neural-network-building-in-python

In this repository is a programm with a pretrained artificial neural network from Building AI. The network predicts the price of a cabin based on it's attributes.

bias building-ai forward-pass functioning identity linear-combinations neural-network numpy python regression relu

Last synced: 16 Apr 2026

https://github.com/foggysmile/ml_labs_itmo

Лабораторные работы по машинному обучению

bert decision-trees jupyter-notebook machine-learning numpy pandas random-forest xgboost

Last synced: 11 Feb 2026

https://github.com/khinthandarkyaw98/python_for_engineers

This particular Python notebook is designed to provide Engineers with an opportunity to practice scientific computations.

engineering numpy python scientific-computing youtube

Last synced: 16 Apr 2026

https://github.com/cherukuri-thanu/conjoint-analysis-netflix

This repository contains configuration files for performing conjoint analysis on Netflix survey data.

conjoint-analysis logistic-regression matplotlib-pyplot netflix numpy pandas squantify statsmodels

Last synced: 16 Apr 2026

https://github.com/hansalemaos/np_asciiart

Ascii art with numpy

ascii numpy opencv python

Last synced: 16 Apr 2026

https://github.com/tzepart/choosing-oil-region

Choosing and initialisation a machine learning model that will help determine the region where production will bring the greatest profit with the least risk of loss.

matplotlib ml numpy pandas seaborn sklearn

Last synced: 16 Apr 2026

https://github.com/harmonicode/signal_equalizer

Signal Equalizer is a Python-based application designed to process and analyze various types of signal data, including audio and ECG signals. It provides functionalities for reading, processing, visualizing, and enhancing signal data using techniques such as Wiener filtering.

audio-processing numpy pyqt5 scipy signal-processing wiener-filter

Last synced: 13 Feb 2026

https://github.com/lexxai/goit_python_ds_hw_09

Модуль 9. Підбір гіперпараметрів НМ. Глибоке навчання. Tensorflow. Keras.

adam-optimizer data-science google-colab keras keras-tensorflow numpy python

Last synced: 16 Apr 2026

https://github.com/codersales/machine-learning-classification

machine learning jupyter notebooks | data-science | priority | relevant | significant | green-light | 1 | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

authorized classification current decision-tree ensemble-techniques jupyter machine-learning more-than-100-commits more-than-300-commits numpy pandas python3 ranked repository-5 seaborn sklearn stacking sub-critical supervised-learning workstation

Last synced: 07 Mar 2026

https://github.com/ben854719/trade-transfer-workflow-optimizer

This AI portfolio tools securely analyzes asset allocation, sector exposure, and performance. Its graph compares real and predicted close prices, enhanced with sentiment indicators such as “Confident” and “Surprised.” An avatar assistant provides user guidance combining technical accuracy with intuitive explanations.

agentic-ai avatar cryptography emotion-analysis encryption gemini json langgraph langraph machine-learning-algorithms matplotlib mcp-server numpy opencv-python pandas python pytorch rs256 seaborn

Last synced: 15 Feb 2026

https://github.com/type0-1/salary-truth-predictor

A supervised machine learning regression model. Includes problem statement, approach to solution, code, images, dataset, and Jupyter Notebook for interactive analysis.

linear-regression machine-learning matplotlib-pyplot ml numpy pandas polynomial-regression projects scikitlearn-machine-learning support-vector-regression

Last synced: 16 Apr 2026

https://github.com/jessicahora/studies-on-linear-algebra

Repositorio com Estudos sobre Algebra Linear.

linalg linear-algebra matplotlib-pyplot matrix numpy python scipy

Last synced: 01 Mar 2026

https://github.com/sasanka14/water_quality_predictions

Water Quality Prediction - College Project 🌊💧 Predicts water potability (safe/unsafe) using ML models like XGBoost & Random Forest. Features data preprocessing, feature importance, model evaluation, and visualizations. Built with Python, Pandas, Scikit-learn & Seaborn for analysis. 🚀

anaconda jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn xgboost

Last synced: 16 Apr 2026

https://github.com/williyam-m/movie_booking_app

A Django-based movie booking app featuring an admin dashboard with CRUD functionality for movies, theatres, screens, shows, and bookings. Includes an ML-powered movie recommendation system.

django machine-learning numpy orm pandas python sckiit-learn sqlite3

Last synced: 16 Apr 2026

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/hygull/trypandas

A repository containing Python pandas and other data science related sources and other resources used to work with pandas, numpy, matplotlib etc.

matplotlib numpy pandas python3

Last synced: 16 Apr 2026

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 16 Apr 2026

https://github.com/magnusrodseth/disaster-tweets

📚 Assignments in the course IT3212 - Data Driven Software at NTNU. Our task is to classify whether a tweet is related to a disaster or not.

adaboost jupyter-notebook logistic-regression numpy pandas python python3 random-forest support-vector-machines xgboost

Last synced: 16 Apr 2026

https://github.com/anas436/student-performance-analysis

In this project I have constructed a Machine Learning System which will analyis students performance with about their academic records. Note that, this project will work with any students recods which you want to provide.

data-analysis jupyter-notebook matplotlib numpy pandas python3 seaborn

Last synced: 16 Apr 2026

https://github.com/iankitnegi/python_projects

Data Analyst Toolkit: A comprehensive collection of Python scripts and notebooks designed for data analysis tasks. Features data cleaning, visualization, statistical analysis, and machine learning models. Ideal for analysts seeking efficient, reproducible workflows.

matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/moonmoonsamal/meesho_order_financial_analysis

Generating insights from Meesho sales data (Oct-Nov)

cleaning-dataset eda numpy pandas visualization

Last synced: 16 Apr 2026

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 27 Mar 2026

https://github.com/shreeparab1890/indian-cricketer-classifier

This notebook is trying to bulia a model which will predict a Indian Cricketer based on the given image. In this project we have handled 8 Indian Cricketers and build a model to classify the given image between this 8 Cricketers.

image-classification matplotlib numpy opencv pandas python random-forest-classifier scikit-learn sklearn streamlit

Last synced: 01 Apr 2026

https://github.com/michaelgermini/python-3d-programming-book

Complete Python & 3D Programming Book: 30 chapters, 120+ examples, real-time ray tracing, ML graphics, OpenGL, NumPy

3d-graphics computer-graphics educational game-development graphics-programming machine-learning numpy opengl python ray-tracing tutorial

Last synced: 16 Apr 2026

https://github.com/laurencewarne/eratosthenes

Random implementations of the sieve of Eratosthenes

algorithm eratosthenes numpy python3

Last synced: 16 Apr 2026

https://github.com/kganitis/bam-engine

High-performance Python implementation of the macroeconomic agent-based BAM model using ECS architecture

agent-based-modeling computational-economics macroeconomics numpy python

Last synced: 04 Mar 2026

https://github.com/shriansh8619/eda_customer_behavior

This project analyzes Nielsen transaction data using Python to uncover sales trends, customer preferences, and purchasing patterns. It provides insights to optimize inventory, create targeted promotions, and improve store performance. The goal is to help supermarkets enhance strategies and boost profitability based on data-driven insights

matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 16 Apr 2026

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 04 Mar 2026

https://github.com/eckelsjd/copier-numpy

Template for a numpy-based scientific computing Python package.

copier-template numpy pdm python

Last synced: 05 Mar 2026

https://github.com/kunalkumar2001/data-analyst-python-and-its-library

Python (Basics to Advance) and Its Library for Data Analyst

dataanalyst matplotlib numpy pandas python seaborn

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npchessimage

Creates a chessboard pattern as an image.

chessboard cv2 image numpy

Last synced: 05 Mar 2026

https://github.com/gautam-cs/data_science

In this project, I was trying to get most of the relevent information of the data using library like numpy, pandas. I have worked on data like aadhar data, Olympics data

aadhar-data csv data-science json numpy olympics-data pandas python35

Last synced: 17 Apr 2026

https://github.com/yanxue06/housing-price-predictor

Python-based California housing price predictor

jupyter numpy pandas python seaborn

Last synced: 06 Mar 2026

https://github.com/ricobuilds/ml-roadmap

Opinionated roadmap to machine learning in 2023

conda huggingface machine-learning matplotlib numpy pandas python pytorch

Last synced: 06 Mar 2026

https://github.com/neerajcodes888/a-novel-used-car-price-prediction-model-based-on-lindenoise

Welcome to the LinDenoise Repository! LinDenoise offers a smart solution for cleaning noisy data in regression tasks. Integrated seamlessly within the widely-used scikit-learn framework, it effortlessly enhances data quality while improving predictive accuracy

car-price-prediction deep-learning ipynb-notebook machine-learning numpy pandas python3 visualization

Last synced: 06 Mar 2026

https://github.com/kheriberto/knn_project

This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.

data-analysis knn-classifier numpy python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/amirmohammadgholampour/mall-customer-segmentation

Project for segmenting customers in a shopping mall using the Clustering algorithm.

numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/satyas567/weatherdataanalysis

Comprehensive Weather Data Analysis with Python: Explore trends, visualize patterns, detect outliers, and predict temperature using humidity and wind speed

jupyter-notebook linear-regression matplotlib numpy pandas python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/amanimagdi/data-flux-fullstack

This repository is for a full stack application provides in-depth analysis and manipulation of various data types (tabular data, RGB images and textual data), Flask on BE and ReactJs on FE.

flask javascript numpy pandas python reactjs tailwindcss

Last synced: 02 Apr 2026

https://github.com/pranavv34/customer-segmentation

Predictive modeling and customer segmentation project using neural networks to forecast sales and categorize customers for targeted marketing in the online retail sector.

keras matplotlib numpy pandas python seaborn sklearn tensorflow

Last synced: 02 Apr 2026

https://github.com/loong64/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

autograd deep-learning gpu loong64 loongarch64 machine-learning neural-network numpy python tensor

Last synced: 02 Apr 2026

https://github.com/kernel-loophole/numerical-computing

Numerical Computing use python(Numpy,metplotlib).

numpy python quantum-computing

Last synced: 17 Apr 2026

https://github.com/kiarashyavari/python_interview

Practice intermediate to advance topics of Python here

decorators generator multiprocessing multithreading numpy pandas python python3

Last synced: 17 Apr 2026

https://github.com/orliluq/inmersion-datos-python

Desarrollar modelos de machine learning para predecir la probabilidad de incumplimiento crediticio de los clientes, utilizando diferentes algoritmos de clasificación (Regresión Logística, Árboles de Decisión, Random Forest, Naive Bayes).

colab-notebook numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/felixamaladhas/amazon-reviews-sentiment-analysis

This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.

matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/rakibhhridoy/predictingsequence-timeseriesanalysis

Time series data easy handling purposes in TensorFlow and python functional programming. Time Series data is handled and pre-processed by different steps,like making window of the data,splitting data for model purpose is also different than other processing methods. All has combined for easy access.

forecasting functional-programming keras lstm lstm-neural-networks moving-average numpy python tensorflow time-series windowed

Last synced: 02 Apr 2026

https://github.com/nikhilgugwad/sentiment-analysis

Sentiment analysis for the Kannada language to classify Kannada sentences into different emotions.

numpy pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/lmizner/codecademy_nba_trends

Investigate whether there is an association between two variables

chi2-contingency jupyter-notebook matplotlib numpy pandas pearson-correlation pyplot python seaborn

Last synced: 17 Apr 2026

https://github.com/tashi-2004/deep-learning-grid-world-q-learning

Deep Learning Grid World Q-Learning . Implement Q-learning in a 5x5 grid where an agent navigates obstacles and rewards. Train the agent with varying learning rates, visualize its progress, and see Q-values as heatmaps. Run the script to start training and view results. Contributions are welcome!

agent-based-modeling artificial-intelligence deep-learning deep-q-learning exploitation exploration machine-learning machine-learning-algorithms matplotlib-pyplot numpy python q-learning q-learning-algorithm reinforcement-learning reinforcement-learning-algorithms state-value-function training

Last synced: 17 Apr 2026

https://github.com/prahaladhchandrahasan/object-localization

this is a blue colour detector in keras and tensorflow

computer-vision keras numpy pkl tensorflow

Last synced: 17 Apr 2026

https://github.com/kaushik-puttaswamy/exploratory-data-analysis-using-python

This project applies Exploratory Data Analysis (EDA) using visualizations like box plots, scatter plots, and correlation matrices. It helps identify patterns, select ideal functions via least squares error, and map test data while evaluating deviations.

bokeh expolatory-data-analysis matplotlib numpy pandas python seaborn sqlalchemy

Last synced: 17 Apr 2026

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 10 May 2026

https://github.com/rosieoh/emergency_dataanalysis

오픈데이터분석-응급의료체계 방안 정책 제안 데이터 분석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026