An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/johnrobertsantos/exploratory-data-analysis-using-python

This project represents the culmination of my Python learning journey.

jupyter matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/mogankumar/bird-flock-simulation-with-pyspark

A PySpark-based bird flock simulation modeling alignment, cohesion, and separation forces with a leader guiding the flock. Efficiently handles large-scale simulations and generates a visualized GIF of the flock’s movement.

argparse matplotlib numpy pyspark python3

Last synced: 01 May 2026

https://github.com/koki0702/chainer0

A Small Chainer (Chainer From Zero)

chainer deep-learning neural-networks numpy python

Last synced: 01 May 2026

https://github.com/macnianios/fandango

a udemy project for seaborn data visualizations and basic pandas commands

numpy pandas python seaborn seaborn-plots udemy-course-project

Last synced: 01 May 2026

https://github.com/sairupeshl/portfolio-risk-calculator

A portfolio risk engine using Python, integrating the Yahoo Finance API to ingest and preprocess historical multi-asset price data

numpy pandas python yfinance

Last synced: 08 Jun 2026

https://github.com/sandip2512/play-store-app-review-analysis

EDA on Play Store apps and user sentiment reviews

eda heatmaps matplotlib numpy pandas plotly seaborn sentiment-analysis

Last synced: 01 May 2026

https://github.com/mohhasan1/machine-learning

This repository is where I learn and store information about various machine learning libraries, such as Pandas, scikit-learn, and others. It serves as a personal reference for exploring and experimenting with different ML tools and techniques.

machine-learning matplotlib numpy pandas seaborn sklearn

Last synced: 01 May 2026

https://github.com/cbhihe/nn_binary-classifier

NN-based binary classification with logistic regression loss, back-propagation with gradient descent and optional regularization

classifier gradient-descent logistic-regression neural-network numpy python regularization sklearn

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/hansalemaos/a_pandas_ex_image_tools

Uses pandas/numpy/numexpr for operations on pictures - very fast

colors image locate numexpr numpy opencv pictures python

Last synced: 01 May 2026

https://github.com/deusares/funpil

A collection of PIL, Numpy and OpenCV functions to manipulate images

batch image-processing lazy media numpy opencv pil pillow python script

Last synced: 01 May 2026

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 01 May 2026

https://github.com/offthetab/nn_homework

Практические работы по нейронным сетям МИРЭА 3 курс.

gensim ignite matplotlib neural-network nltk numpy pandas scipy-stats seaborn torch torchvision

Last synced: 01 May 2026

https://github.com/dostuffthatmatters/jax-capability-analysis

demo repository containing the experiments for my master's seminar @ TUM

autograd click deep-learning huggingface jax jit-compilation mnist numpy python pytorch

Last synced: 01 May 2026

https://github.com/aniketwithpython/fundamentals-of-scientific-computing

Fundamentals of Scientific Computing (MS1980) Notebook

jupyter matplotlib numpy pandas python

Last synced: 01 May 2026

https://github.com/rgandhi8/inventory-management-system

This project is an AI-driven Inventory Management System designed to optimize supply chain efficiency by leveraging machine learning techniques for demand forecasting. The system integrates ARIMA and ANNs to predict inventory needs, minimize shortages, and reduce holding costs also enhances decision-making by automating inventory planning.

dashboard inventory-management-system machine-learning numpy pandas python seaborn

Last synced: 08 Jun 2026

https://github.com/laszlokorte/canny-notebook

Numpy Canny Edge detector implementation to visualize each step

canny-edge-detection image-processing jupyter-notebook numpy

Last synced: 02 May 2026

https://github.com/ducha-aiki/laf2ell

Utils for LAF to ellipse and back conversion. Also function for drawing LAFs.

ellipse laf local-features numpy visualization

Last synced: 02 May 2026

https://github.com/mgitrov/ann-from-scratch

An artificial neural network implemented from scratch using numpy only.

artificial-neural-networks deep-learning guide numpy

Last synced: 02 May 2026

https://github.com/voidkeishi/computational-science

This repository contains implementations of algorithms from the course "Computational Science" at HUST

computational-science numpy python

Last synced: 02 May 2026

https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.

analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 02 May 2026

https://github.com/parham1998/linear_regression_pytorch

Implementation of a simple Linear Regression model with Numpy & PyTorch library

backpropagation gradient-descent linear-regression loss-functions numpy python pytorch

Last synced: 02 May 2026

https://github.com/bharath-tars/streamlit_diabsynth

Predictive Health Analytics for Diabetic Risk Assessment and Personalized Reporting WebApp using Streamlit

deeplearning mahine-learning numpy onrender-deploy pandas seaborn streamlit

Last synced: 02 May 2026

https://github.com/camsai/jupyterlite

CAMSAI JupyterLite is a lightweight, browser-based environment tailored for AI-driven materials science research. It integrates CAMSAI tools, schemas, and workflows, enabling users to validate data, run simulations, and explore materials science applications without requiring local installation or setup.

chemistry jupyterlite materials materials-design materials-informatics materials-science matplotlib numpy pyodide python scipy

Last synced: 02 May 2026

https://github.com/m0saan/python-for-data-analysis

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,

data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python

Last synced: 02 May 2026

https://github.com/ksharma67/intel-stock-predication-wiith-eda

We are trying to design a model that can predict the price of stock using different methods and algorithms.

eda linear-regression machine-learning-algorithms matplotlib numpy pandas prediction python scaler seaborn skit-learn

Last synced: 02 May 2026

https://github.com/ifte-13/digit-classification-with-mnist

MNIST Digit Classification: Comparative Analysis of Optimization Algorithms

matplotlib mnist-dataset numpy tensorflow

Last synced: 02 May 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/bhavna-kale/cars-eda-project

Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.

data-analysis excel matplotlib numpy pandas python3 searborn streamlit

Last synced: 03 May 2026

https://github.com/krumyakimov/house-price-developments-in-bulgaria

I developed this project for my final exam in the Data Science course at SoftUni.

data-science jupyter-notebook matplotlib numpy pandas scipy seaborn

Last synced: 03 May 2026

https://github.com/raduldev/biopython

Biopython Record Work and few extra coding excercise

bioinformatics biopython gromacs jupyter-notebook matplotlib numpy pandas python3

Last synced: 03 May 2026

https://github.com/kisaa-fatima/foreground-segmentation-using-kmeans-face-recognization-using-knn

Implement a basic version of the interactive image cut-out/segmentation approach called Lazy Snapping. The program uses K-Means Clustering to segment images into foreground and background based on user-provided seed pixels.

face-recognition foreground-segmentation kmeans-clustering knn-classification matplotlib numpy opencv2 pil

Last synced: 03 May 2026

https://github.com/chethannazre/railway-track-obstacle-detection-system

RAILWAY TRACK OBSTACLE DETECTION AND AUTOMATION (RAILTECH AUTOMATION)

automation cv2-library internship numpy ppt project python report

Last synced: 03 May 2026

https://github.com/k-ilyas/ai-projects-handwritten-digit-recognition-titanic-survivor-analysis

AI Projects: Handwritten Digit Recognition & Titanic Survivor Analysis using scikit-learn

machine-learning numpy pandas sickit-learn

Last synced: 03 May 2026

https://github.com/hansalemaos/a_pandas_ex_fastloc

Up to 25 times faster than df.loc by combining np.select and numexpr.evaluate (works with utf-8)

dfloc fast iloc loc numexpr numpy pandas python

Last synced: 03 May 2026

https://github.com/ebrahimar/ai-voice-cloner-xtts-v2

A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.

ai-voice coqui-tts deep-learning json multilingual-tts numpy pydub python speech-synthesis streamlit text-to-speech tts-model voice-cloning xtts-v2

Last synced: 03 May 2026

https://github.com/nak-dt/deep-decision-forest

A custom ensemble tree model inspired by Deep Forest

ai machine-learning machine-learning-algorithms numpy pandas python sklearn

Last synced: 03 May 2026

https://github.com/prathicashettym/machinelearningalgorithms

Contains mini projects and implementation using ML algorithms

jupyter-notebook linear-regression matplotlib numpy pandas panfas python seaborn

Last synced: 03 May 2026

https://github.com/sadmanca/imdb-scraper

Scrapes IMDb's movie database and outputs the data to CSV files.

beautifulsoup data-scraping imdb numpy pandas python requests

Last synced: 03 May 2026

https://github.com/jsnevt/decisiontreecls

Data Science - Árvores aleatórias - RandomForestClassifier

decision-tree-classifier matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/furk4nbulut/uygulamalarla-makine-ogrenmesi-ve-derin-ogrenme-atolyesi

Bu repository, Manisa'da gerçekleştirilen BTK Akademi Uygulamalı Makine Öğrenmesi ve Derin Öğrenme Atölyesi'ne ait eğitim sürecini kapsamaktadır. Atölyede katılımcılar, ileri düzey makine öğrenmesi ve derin öğrenme teknikleriyle ilgili teorik ve pratik bilgiler edinmektedir.

matplotlib numpy pandas scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/jonad/finding_donors

Predicting income with UCI Census Income Dataset using supervised machine learning algorithms

numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 03 May 2026

https://github.com/zongxr/dcic2025-riverwaterprediction

闽江,福建省最大独流入海河流,流域降水丰沛。在新型电力系统和新阶段水利高质量发展的环境下,如何缓解流域防汛压力,提升水资源利用率是水电企业高质量发展的重中之重。目前,基于传统水文学原理已实现未来9小时甲级精度的洪水预报,但存在遇见期与预见精度上仍有提升空间,因此,通过结合人工智能技术实现流域入库流量的精准预测,对保障水库安全调度、防洪及清洁能源稳定供应具有重大战略意义。

lightgbm machine-learning numpy pandas sklearn time-series xgboost

Last synced: 03 May 2026

https://github.com/muskanmi/data_analysis_python

Data analysis on students result dataset using python libraries.

boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn

Last synced: 03 May 2026

https://github.com/ankitgmishra/machinelearning

Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.

artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/alvii147/discretetimelib

Python library for analysis of discrete time signals and systems

numpy pandas scipy signals-and-systems sympy

Last synced: 03 May 2026

https://github.com/schukark/intervalpylib

Interval solver of systems of equations to find the workspace area of a robot

krawczyk matplotlib numpy python3 robotics solving-algorithm sympy systems-of-equations visualization

Last synced: 03 May 2026

https://github.com/nicolashaubricht/analise-dados-pib-per-capita

Este projeto tem como objetivo a análise de dados sobre as variações do PIB per capita em diversos países ao longo dos anos.

numpy pandas plotly python

Last synced: 03 May 2026

https://github.com/atchayaah/home-value-insights-kc

Data-driven project predicting King County housing prices using EDA, regression models, and ML techniques, developed as part of IBM’s Data Analysis with Python course on Coursera.

joblib matplotlib numpy pandas pickle python scikit-learn seaborn

Last synced: 03 May 2026

https://github.com/grachale/convolution_image

The implementation of a function that takes an image (RGB or gray one) and a filter (kernel) and applies a convolution between them.

convolution image-processing numpy python

Last synced: 03 May 2026

https://github.com/akashparley/ipl-data-analysis

The IPL Data Analysis project focuses on extracting valuable insights from IPL match data using various data analytics techniques. By analyzing historical match outcomes, player performances, team comparisons, and venue statistics, the project visualizes trends and patterns through graphs like bar charts, line graphs, and scatter plots.

analytics data-visualization excel metplotlib numpy pandas python seaborn tableau-dashboards

Last synced: 04 May 2026

https://github.com/sohamrider786/house_price_predictor

A machine learning project to predict house prices based on various property features using advanced regression techniques. This project demonstrates end-to-end model development, including data preprocessing, feature engineering, model training, evaluation, and visualization.

machine-learning matplotlib-pyplot numpy pandas random-forest-regression sklearn-library

Last synced: 04 May 2026

https://github.com/anas436/ibm-cloud-data-analysis-of-cars

Exploratory Data Analysis of Cars in Python

ibmwatson matplotlib numpy pandas python3 seaborn

Last synced: 04 May 2026

https://github.com/dr-rompecabezas/medical-data-visualizer

[freeCodeCamp] Medical Data Viz Jupyter Notebook built with Matplotlib and Seaborn

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/bpkaur/whats-in-a-name

Exploring dataset of first names of babies born in the US in order to uncover interesting stories

data-analysis datacamp numpy pandas python3

Last synced: 04 May 2026

https://github.com/timothyjan/intro-machine-learning-polynomial-regression

We use NumPy to implement vectorized linear and polynomial regression models and compare their performance using separate training and test sets.

numpy

Last synced: 04 May 2026

https://github.com/jakeoeding/rippy

A Python seam carving implementation

numpy python3 seam-carving

Last synced: 04 May 2026

https://github.com/jalijuhola/neurcomputing-course

Some of tasks from neurocomputing course

mypy numpy python

Last synced: 04 May 2026

https://github.com/damisparks/become_data_analyst

Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.

data data-analysis data-visualization matplotlib numpy pandas-tutorial

Last synced: 04 May 2026

https://github.com/mchenryspagg/investigate_a_dataset

This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.

data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization

Last synced: 04 May 2026

https://github.com/miozilla/fescarefine

fescarefine :skier::mount_fuji::guide_dog: : Refine & Test ML Models # Features Scaling # Normalization # Datasets

gradient-descent linear-regression matplotlib model-testing mse norm numpy ols overfit pandas scatter seaborn sklearn standardization statsmodels underfit

Last synced: 04 May 2026

https://github.com/florianwoelki/salary_linear_regression

This project is for salary prediction using linear regression without any machine learning library.

jupyter linear-regression machine-learning matplotlib numpy pandas python regression

Last synced: 04 May 2026

https://github.com/cheese23456/ai-based_stock_analysis_and_portfolio_optimisation

📈 Analyze stocks and optimize portfolios with AI-driven insights, technical indicators, and clear investment signals for informed decision-making.

ai asset-allocation financial-analysis huggingface-spaces numpy pandas portfolio-optimisation python scipy-optimize streamlit

Last synced: 04 May 2026

https://github.com/arjunravi26/deep_learning

This repo contains codes and notes to learn deep learning

ann autoencoders cnn deep-learning deepseek-r1 lstm numpy rnn-tensorflow tensorflow transformer

Last synced: 04 May 2026

https://github.com/jonad/titanic_exploratory_analysis

Exploratory Data Analysis on Titanic Dataset

jupyter-notebook numpy pandas python3

Last synced: 04 May 2026

https://github.com/sumaiyyaf/movie-industry-analysis

This project focuses on the acquisition and in-depth analysis of movie data to generate strategic recommendations, guiding our hypothetical company's successful entry into the movie industry.

exploratory-data-analysis numpy pandas python visualization webscraping

Last synced: 04 May 2026

https://github.com/florasteve/ml-foundations-day1

Day-1 ML foundations focused on linear algebra: vectors, dot products, norms, angles, projections, and basic matrix operations—implemented in a Jupyter notebook with NumPy/Matplotlib, clear 2D visuals, a self-quiz, and a brief reflection. Emphasizes how these math primitives map to core ML/AI ideas (similarity, least squares, geometric transforms).

ai data-science education jupyter-notebook linear-algebra machine-learning matplotlib numpy python3 terminal-mac tutorial visualization vscode

Last synced: 04 May 2026

https://github.com/sugha22/data_analysis

Actively upskilling in data science with hands-on learning during a Green Internship focused on environmental sustainability. Used Pandas and NumPy for data preprocessing and cleaning, and created visual dashboards in Excel and Tableau. Gained experience in interpreting sustainability metrics and communicating insights through data storytelling and

data-cleaning data-collection data-storytelling data-transformation excel numpy pandas preprocessing sustainability-metrics tableau-dashboards visualization

Last synced: 04 May 2026

https://github.com/linsanity03/algorithmic_trading

Trading strategy based on the intersection of short-term and long-term moving averages.

jupyter-notebook matplotlib-pyplot numpy pandas

Last synced: 04 May 2026

https://github.com/secary/maths7027

Mathematical Foundations of Data Science

latex mathematics numpy pandas

Last synced: 04 May 2026

https://github.com/saifalibaig/crop-yield-prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

eda feature-encoding feature-selection kaggle-dataset lasso-regression linear-regression matplotlib-pyplot numpy onehot-encoding pandas python3 ridge-regression seaborn simpleimputer xgboost-regression

Last synced: 04 May 2026

https://github.com/pedasoft-consult/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 04 May 2026

https://github.com/shamiul5201/selfie_app_features_application

OpenCV project to build skills in image processing: features include Cartoonify and Pencil Sketch effects in a Jupyter Notebook, blemish removal in a Python script, and chroma keying for videos. Perfect for exploring creative and practical computer vision techniques.

computer-vision numpy opencv python

Last synced: 04 May 2026

https://github.com/0nom4d/308reedpipes

Mathematical EPITECH Project - This project computes a representation of a reedpipe.

coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy

Last synced: 04 May 2026

https://github.com/swarup-ranjan-mondal/chat-analyzer

A streamlit app to analyze whatsapp personal and group chats.

matplotlib numpy pandas python seaborn streamlit

Last synced: 04 May 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/manuethomas/credit-default-risk-analysis-eda

This repository contains the detailed EDA analysis of Home Credit Group Dataset. The analysis aims to find demographic and financial factors associated with higher or lower default risks, providing actionable insights for risk mitigation and improved lending practices

bivariate-analysis correlation-analysis data-preprocessing exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas seaborn univariate-analysis

Last synced: 04 May 2026

https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies

Web scraping the top 250 movies in IMDB site.

data-analysis numpy pandas python

Last synced: 04 May 2026

https://github.com/hanzopgp/randomkagglesolving

Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.

artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow

Last synced: 04 May 2026

https://github.com/dyrits/parite

OPENCLASSROOMS | PERFECTIONNEZ-VOUS EN PYTHON

matplotlib numpy pandas python

Last synced: 04 May 2026

https://github.com/aqueeqazam/machine-learning-using-scikit

This repository contains all of the algorithms used to train the machine learning models using the Scikit library.

numpy scikit-learn

Last synced: 04 May 2026

https://github.com/pabs-code/img-cartoonizer-using-opencv

A streamline app using 3 ways to cartoonized an image using OpenCV and Python.

bilateral-filtering color-quantization edge-detection edge-enhancement laplacian-edge-detection numpy opencv python

Last synced: 04 May 2026

https://github.com/dmkk01/mlp-python

Implementation of a multilayer perceptron using Pytorch and Numpy libraries

mlp numpy pytorch

Last synced: 05 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026