An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/sharkb8t/credit-risk-classification

Demonstrates my abilities to use Jupyter Notebook with scikit-learn to train and evaluate a machine learning model.

jupyter-notebook numpy pandas pathlib python scikit-learn

Last synced: 15 Apr 2026

https://github.com/coueghlani/nlp

Proyecto de Procesamiento de Lenguaje Natural y Análisis de Datos

mineria-de-datos nlp nlp-machine-learning nltk numpy procesadores-de-lenguajes sklearn spacy

Last synced: 08 Feb 2026

https://github.com/carterbox/libimage

Provides large (2k) test images as NumPy arrays.

images numpy python

Last synced: 15 Apr 2026

https://github.com/puureya2/catalogue-price-scraper

Scrape & store product prices with Selenium & BeautifulSoup4. Visually represent data with MatPlotLib,Personal Project

beatifulsoup4 chrome-webdriver matplotlib numpy pandas python python-csv selenium webscraping

Last synced: 16 Apr 2026

https://github.com/karthikarajagopal44/numpy-beginner-to-advanced

This repository is designed to help you master NumPy, the fundamental package for scientific computing with Python. Whether you are just getting started or looking to refine your skills with advanced techniques, this repo covers a comprehensive range of topics and functionalities. It covers the range of topics from beginner to advance.

data-science numpy numpy-library numpy-tutorial python

Last synced: 16 Apr 2026

https://github.com/hansalemaos/np_asciiart

Ascii art with numpy

ascii numpy opencv python

Last synced: 16 Apr 2026

https://github.com/shakhthi/mlproject

This project is an end-to-end ML solution that predicts math scores from student attributes. It starts with ingesting and preprocessing data, followed by feature engineering to pinpoint key predictors. A robust model is then trained and fine-tuned before being seamlessly deployed to deliver real-time predictions.

jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn sklearn

Last synced: 16 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_09

Модуль 9. Підбір гіперпараметрів НМ. Глибоке навчання. Tensorflow. Keras.

adam-optimizer data-science google-colab keras keras-tensorflow numpy python

Last synced: 16 Apr 2026

https://github.com/vgvr0/analisis-de-datos-con-streamlit-numpy-pandas-y-matplotlib

Sistema completo de análisis y visualización de datos cinematográficos que proporciona insights detallados sobre películas, incluyendo análisis financiero, puntuaciones, tendencias temporales y un sistema de recomendación. Desarrollado con Python y Streamlit, ofrece una interfaz interactiva y amigable para explorar datos de películas.

matplotlib numpy pandas plotly plotly-dash recommendation-system streamlit

Last synced: 16 Apr 2026

https://github.com/codersales/machine-learning-classification

machine learning jupyter notebooks | data-science | priority | relevant | significant | green-light | 1 | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

authorized classification current decision-tree ensemble-techniques jupyter machine-learning more-than-100-commits more-than-300-commits numpy pandas python3 ranked repository-5 seaborn sklearn stacking sub-critical supervised-learning workstation

Last synced: 07 Mar 2026

https://github.com/kavayk29/speech-recognition-using-tdnn-and-data-augmentation

Developed a speech recognition system using TDNN, preprocessing audio, extracting MFCC features, and training the model. Fine-tuning with augmented data (19,000 rows) improved accuracy from 9% to 80% training and 40% validation. Data augmentation proved crucial for enhancing model performance and generalization. Still working to increase the acc.

deep-learning keras-tensorflow numpy os pandas tdnn tensorflow

Last synced: 14 Feb 2026

https://github.com/killervardhan8/gesturedecode

The Sign Language Interpretation project focuses on recognizing and interpreting hand gestures to facilitate communication for individuals who use sign language. This project leverages computer vision and machine learning techniques to accurately identify and translate hand signs into text

csv mediapipe numpy python tensorflow

Last synced: 28 Feb 2026

https://github.com/ben854719/trade-transfer-workflow-optimizer

This AI portfolio tools securely analyzes asset allocation, sector exposure, and performance. Its graph compares real and predicted close prices, enhanced with sentiment indicators such as “Confident” and “Surprised.” An avatar assistant provides user guidance combining technical accuracy with intuitive explanations.

agentic-ai avatar cryptography emotion-analysis encryption gemini json langgraph langraph machine-learning-algorithms matplotlib mcp-server numpy opencv-python pandas python pytorch rs256 seaborn

Last synced: 15 Feb 2026

https://github.com/anujdutt9/reinforcement_learning

Reinforcement Learning using Numpy and PyTorch.

numpy python3 pytorch reinforcement-learning

Last synced: 16 Apr 2026

https://github.com/urvee1810/air-quality-prediction-using-arima

Analyzed Beijing's air quality data using time series analysis and ARIMA modeling to forecast PM2.5 pollution levels. Identified seasonal patterns and correlations between pollutants, revealing insights into urban air quality trends and potential contributing factors.

arima-model dickey-fuller-test matplotlib numpy pandas python seaborn statistical-analysis time-series-analysis

Last synced: 16 Apr 2026

https://github.com/costrouc/xnd-notebooks

A collections of notebooks that demonstrate xnd, ndtypes, and gumath features

binder gumath ndtypes notebooks numba numpy xnd

Last synced: 16 Apr 2026

https://github.com/ejw-data/tableau-songs

Brief analysis of songs from 1890 - 2015 using pandas and Tableau

numpy pandas python songs tableau

Last synced: 16 Apr 2026

https://github.com/khaymanii/calories-burnt-prediction-model

This model was built using Python and XGBoost Regression algorithm

matplotlib numpy pandas python scikit-learn

Last synced: 16 Apr 2026

https://github.com/priyanshul28/ml_eda_regression_energyconsumptionforecasting

An EDA and Machine Learning Time-Series Regression Forecasting exercise on the PMJE Energy Consumption dataset demonstrating time-series analysis and the use of Time-Series Split, XGBoost, etc. The model is optimized using hyperparameter tuning through GridSearchCV. A Rob Mulla guided exercise.

forcasting machine-learning numpy pandas scikit-learn time-series-analysis

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npchessimage

Creates a chessboard pattern as an image.

chessboard cv2 image numpy

Last synced: 05 Mar 2026

https://github.com/syedzaheerabbas/risk-analytics-with-python

This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.

eda hypothesis-testing numpy pandas python risk-analysis seaborn

Last synced: 17 Apr 2026

https://github.com/vaishnavis03/finlatics_ml_program

This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.

correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn

Last synced: 17 Apr 2026

https://github.com/kodemartin/np-pd-portfolio

Portfolio of numpy, pandas, and matplotlib solutions

matplotlib numpy pandas

Last synced: 17 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/broodhoney/blue-book-for-bulldozers

This repository holds the project which solves a regression problem on predicting the futures sales of bulldozers. This is from a kaggle competition.

matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/niteshchawla/clustering-ml

Analyzing the vast data of learners can uncover patterns in their professional backgrounds and preferences. Allowing Scaler to make tailored content recommendations and provide specialized mentorship.

cluster-analysis clustering hierarchical-clustering k-means-clustering machine-learning numpy pca-analysis visualisation

Last synced: 17 Apr 2026

https://github.com/prajapatsur/data-analytics-1

Wanted to revise all the basic concepts of the data cleaning, transformation and analysis, so worked on this project. Took a little bit help of AI tool to implement some fresh concepts.

matplotlib numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/prashver/end-to-end-model-deployment-on-aws

Student Performance Analysis with Machine Learning analyzes factors impacting student outcomes using a robust machine learning pipeline. Achieving an impressive R2 score, it predicts student performance effectively. With extensive data preprocessing and deployment on AWS Elastic Beanstalk, it ensures scalability and high availability.

amazon-web-services aws-elastic-beanstalk end-to-end-deployment flask machine-learning-algorithms matplotlib numpy pandas scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/felixamaladhas/amazon-reviews-sentiment-analysis

This is a sentiment analysis project that classifies Amazon product reviews as positive or negative using machine learning techniques.

matplotlib numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/rakibhhridoy/predictingsequence-timeseriesanalysis

Time series data easy handling purposes in TensorFlow and python functional programming. Time Series data is handled and pre-processed by different steps,like making window of the data,splitting data for model purpose is also different than other processing methods. All has combined for easy access.

forecasting functional-programming keras lstm lstm-neural-networks moving-average numpy python tensorflow time-series windowed

Last synced: 02 Apr 2026

https://github.com/ffatahillah7/eda-dsf-dibimbing-titanic-accident

Data Science Fair 3.0 Dibimbing Portofolio - Analyctics and Learning from titanic dataset

data numpy pandas python science seaborn

Last synced: 17 Apr 2026

https://github.com/nikhilsree5/aerofitcasestudy

Customer Profiling and Market Segmentation for AeroFit Treadmills: A Data-Driven Approach

customerprofile eda numpy pandas python visualization

Last synced: 17 Apr 2026

https://github.com/theakashshukla/diabetes-prediction

Diabetes Prediction using futuristic tool

ai jupiter-notebook ml numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npfastsortcpp

Sorts int/float 4x faster than numpy by using parallel_sort from C++ (Windows only)

cpp fast numpy python sort

Last synced: 17 Apr 2026

https://github.com/mugambi645/classify-song-genre-from-audio-data

Classify song genres using supervised learning models & PCA for dimensiality reduction

decision-tree-classifier logistic-regression numpy pandas pca-analysis

Last synced: 17 Apr 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_01

GoIT, Python Data Science, Homework number 1.

data-science google-colab jupyter-nootebook numpy numpy-arrays python

Last synced: 18 Apr 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/thoughtscript/python_api_2023

Some fun with Flask, Python, and Machine Learning

angular docker docker-compose flask machine-learning mysql numpy

Last synced: 04 Apr 2026

https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation

An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.

clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/perpendicooler/elementary-research-for-steamboat-willie-s-store-in-poland

An elementary research for a company to opening store in a city using gurobi and pulp optimization.

christofides-algorithm gurobipy numpy pandas pulp python3 scikit-learn travelling-salesman-problem

Last synced: 05 Apr 2026

https://github.com/al-ghaly/prosper-loans-analysis

A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)

data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics

Last synced: 18 Apr 2026

https://github.com/ahmaruff/kmeans-img-compress

Simple python image compression library using K-Means clustering

color-quantization image-compression image-processing kmeans kmeans-clustering numpy python

Last synced: 18 Apr 2026

https://github.com/znreza/paradoxes

Q-learning algorithm from scratch in python for optimal path finding

grid grid-search matrix numpy python qlearning-algorithm reinforcement-learning

Last synced: 18 Apr 2026

https://github.com/linggarm/lvq-and-elm-with-iris-dataset

Implementation of Learning Vector Quantization (LVQ) and Extreme Learning Machine (ELM) with Iris Dataset

artificial-intelligence deep-learning elm extreme-learning-machine learning-vector-quantization lvq machine-learning numpy python soft-computing

Last synced: 18 Apr 2026

https://github.com/pedro-stanaka/dlnd-projects

Projects from Deep Learning Nanodegree from Udacity

artificial-neural-networks deep-learning machine-learning numpy python3

Last synced: 18 Apr 2026

https://github.com/rhazra-003/modelinglab

Assignment - Solution of Modeling Lab of 7th Semester (Batch of 2023) of NIT Durgapur

jupyter-notebook matplotlib-pyplot numpy pandas python3 shapely

Last synced: 19 Apr 2026

https://github.com/csanri/simplenn

Simple Neural Network for MNIST Classification

machine-learning-algorithms neural-network numpy python

Last synced: 19 Apr 2026

https://github.com/crazy-dot/zomato-data-analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

data-cleaning-and-preprocessing data-manipulation-with-pandas inferential-statistics kaggle-dataset numpy pandas-python python zomato-data-analysis

Last synced: 19 Apr 2026

https://github.com/abdoufermat5/gps_statistiques

Calculs statistiques pour le cloud personnel

geolife gps numpy pandas python statistics trajectory-analysis

Last synced: 20 Apr 2026

https://github.com/shreeya-methuku/predictive_analysis

This project leverages machine learning to perform time-series forecasting on key financial metrics, including sales (income), expenses, cash flow, and profit.

arima financial-forecasting numpy pandas sarima xgboost

Last synced: 20 Apr 2026

https://github.com/montanaz0r/suicide-rate-analysis

Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector

analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate

Last synced: 20 Apr 2026

https://github.com/levieyal/oop-course-exercises

The first exercises in the OOP in Java and python course that took place in the first semester of 2021

docs git github graph-algorithms interfaces numpy python solid-principles unit-testing

Last synced: 20 Apr 2026

https://github.com/separatrixxx/crypto_labs_6_sem

👓 Laboratory work for the 6th semester of MAI on cryptography

cryptography matplotlib numpy python

Last synced: 21 Apr 2026

https://github.com/prince-gh/numpy

Learn Numpy step by step

library numpy prince-gh python

Last synced: 22 Apr 2026

https://github.com/abhinav330/iris-dataset-classic-ml-problem

This repository demonstrates the classification of Iris flowers into three species (Setosa, Versicolor, Virginica) using a Support Vector Machine (SVM) classifier.

classification data-science iris-dataset jupyter machine-learning numpy pandas python sklearn support-vector-machines svm-classifier

Last synced: 06 Jun 2026

https://github.com/haranrk/linear-regression

Implemented Linear Regression and some of it's variants

machine-learning matplotlib numpy python python3

Last synced: 24 Apr 2026

https://github.com/dpb24/datakind-2025

📊 Data Analytics: Identifying Actionable Insights to Improve Financial Inclusion in Kenya

data-analytics data-visualization databricks datakind exploratory-data-analysis financial-data geopandas jupyter-notebook kenya matplotlib numpy python seaborn

Last synced: 24 Apr 2026

https://github.com/nightmachinery/mlp_from_scratch

A simple MLP using only numpy. The computational graph is not automatically tracked, but the network supports arbitrarily many batch axes.

deep-learning deep-neural-networks deeplearning educational mlp mlp-classifier mlp-networks numpy

Last synced: 25 Apr 2026

https://github.com/timothyjan/cartoonify-image-opencv

Application that transforms images into its cartoon using Python, OpenCV(cv2), easyguy, numpy, imageio, and tkinter.

easygui imageio numpy opencv2 tkinter

Last synced: 25 Apr 2026

https://github.com/davewhit3/bill-detector

Bill detection. Usage OCR algorithm to read content

ai numpy ocr-recognition python3

Last synced: 26 Apr 2026

https://github.com/prathush-kumar/fake_news_detection

Developed an intelligent system to identify and classify fake news articles using Natural Language Processing (NLP) and Machine Learning techniques.

classification machine-learning natural-language-processing numpy pandas sklearn-library

Last synced: 26 Apr 2026

https://github.com/abhisingam/brickbreaker

A Terminal-Based game made in Python that emphasises upon the use of OOPs concepts. Python | NumPy | Game Development

game-development numpy python3

Last synced: 26 Apr 2026

https://github.com/m-k-l-s/sudoku

A trivial sudoku solver in Python, using recursion and numpy.

backtracking numpy python recursion solver sudoku

Last synced: 27 Apr 2026

https://github.com/hinanohart/dostosim

Dostoevskian agent-based simulation: seven literary archetypes as a unified reward decomposition. Anti-optimisation, dignity-as-utility, ideological cascade, gambler-ruin — pre-registered H1-H4 with bootstrap CI / BH-FDR / Cliff's delta.

abm agent-based-modeling behavioral-economics bootstrap-ci computational-social-science dostoevsky literature networkx numpy python reproducible-research

Last synced: 27 Apr 2026