An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/athul64/exploratory-data-analysis

To preprocess and analyze the given employee dataset, present the findings graphically, and derive meaningful insights to help better understand the company’s workforce.

colab-notebook data-analysis data-visualization matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 25 Feb 2026

https://github.com/mhmudfzli/exploring-mental-health-data

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

catboost lightgbm matplotlib numpy pandas scikit-learn seaborn xgboost

Last synced: 09 Apr 2026

https://github.com/mrtarantoga/mdl-density-histogram

Cython-accelerated MDL histogram density estimation. Implements Kontkanen & Myllymaki's dynamic programming algorithm for optimal variable-width bins, parametric complexity with Ramanujan approximations, and automatic bin count selection. Based on "MDL Histogram Density Estimation" (JMLR 2007).

cython-library mdl numpy pypi-package python3

Last synced: 30 Nov 2025

https://github.com/sarah-ribeiro/curso-pandas-alura

A comprehensive project comprising numerous exercises developed within an Alura course focused on Pandas.

jupyter-notebook matplotlib numpy pandas python

Last synced: 01 Aug 2025

https://github.com/thomasonzhou/minitorch

rebuilding pytorch: from autograd to convolutions in CUDA

cuda numba numpy

Last synced: 02 Feb 2026

https://github.com/jswong65/machine_learning_nanodegree

Projects of Udacity Machine Learning nanodegree

machine-learning numpy pandas python scikit-learn scipy

Last synced: 09 Apr 2026

https://github.com/nemeslaszlo/titanic-logistic-regression

This notebook covers a simple analysis with logistic regression on the popular titanic dataset.

cufflinks jupyter-notebook logistic-regression matplotlib numpy pandas plotly seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/aiboy996/npnn

NumPy Neural Network framework

autograd machine-learning neural-network numpy

Last synced: 16 May 2026

https://github.com/Udacity-MachineLearning-Internship/finding_donors

First stage project at Udacity on the 'Intro to Machine Learning with TensorFlow' program using sckit-learn in python

csv machine-learning matplotlib numpy pandas python sckiit-learn seaborn sklearn udacity udacity-nanodegree

Last synced: 17 Jul 2025

https://github.com/mahnoorsheikh16/credit-card-default-prediction

This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.

encoding hiplot imblearn json knn-imputer logistic-regression matplotlib numpy pandas pca-analysis plotly scipy seaborn sklearn smote streamlit support-vector-machines timeseries-forecasting visualization xgboost-classifier

Last synced: 06 Apr 2026

https://github.com/ahmednurabdii/data-analytics-portfolio-superstore

My first portfolio project showcasing data cleaning, analysis, and visualization of Superstore sales data.

data-analysis data-visualization jupyter-notebook matplotlib numpy pandas portfolio-project python sales-analysis scipy seaborn superstore-dataset

Last synced: 07 Apr 2026

https://github.com/ewertondrigues02/dados-do-covid-19

Análise exploratória dos dados do COVID-19 / de 22 de Janeiro de 2020 a 27 de Julho de 2020 em 188 países análise feita com as ferramentas como: jupyter Notebook, Google Colab, Excel, Power BI e bibliotecas: Pandas, Matplotlib, Numpy.

analise-exploratoria colab covid-19 covid19 dataframe folium folium-maps folium-python google-colab jupyter jupyter-notebook numpy pandas python3

Last synced: 02 May 2026

https://github.com/quantum-software-development/qiskit

qiskit circuits on IBM Quantum

numpy qbits-circuits quiskit

Last synced: 30 Jul 2025

https://github.com/nirmalyabag20/wine-quality-prediction-machine-learning

This project analyzes the chemical properties of wines to identify key factors influencing quality. By leveraging machine learning techniques, i aim to develop predictive models that accurately classify wine quality, providing valuable insights for producers and enthusiasts alike.

k-neighbors-classifier logistic-regression machine-learning matplotlib numpy pandas python random-forest seaborn svc

Last synced: 09 Apr 2026

https://github.com/melihcanndemir/3d-fractal-explorer

Interactive 3D visualization tool for exploring the mesmerizing world of Mandelbrot and Julia fractals. Built with Python, OpenGL and PyQt5, offering real-time animation and intuitive controls.

3d-graphics complex-numbers computer-graphics data-visualization educational fractal graphics-programming interactive julia-set mandelbrot math-visualization mathematical-art mathematics numpy opengl pyqt pyqt5 python scientific-visualization visualization

Last synced: 30 Jul 2025

https://github.com/upul/indi

Small Machine Learning library written in Python

algorithms machine-learning numpy python scipy

Last synced: 10 Apr 2026

https://github.com/xaxm007/nn-numpy

A Basic Logistic Regression Neural Network using NumPy and calculus.

critical-indulgence logistic-regression neural-network numpy study

Last synced: 19 May 2026

https://github.com/santiagoenriquega/custom_ensemble_learning

This repository focuses on building a random forest classifier and regressor as well as a gradient boosted regressor, building them from scratch using only NumPy for faster array processing.

numpy python python3 random-forest random-forest-classifier random-forest-regressor

Last synced: 15 May 2026

https://github.com/xaxtric7/learn-numpy

🧮 Welcome to my NumPy Practice repository! This repo contains my hands-on learning, notes, and code experiments as I dive into the fundamentals and advanced features of NumPy – a powerful library for numerical computing with Python.

numpy pyrhon3

Last synced: 15 May 2026

https://github.com/mr-chang95/ibm_recommendation_system

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

experimental-design ibm-watson machine-learning numpy pandas recommendation-engine recommendation-system udacity-data-science-nanodegree

Last synced: 20 Apr 2026

https://github.com/andystmc/nextflownyc

Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting

Last synced: 07 Apr 2026

https://github.com/dulajkavinda/live-sketching

✍️ Converting live video stream from webcam into a sketch using OpenVC and Keras

added image-filtering keras machinelearning matplotlib numpy opencv python

Last synced: 10 Apr 2026

https://github.com/nirmalyabag20/book-recommended-system

A system built using content-based and collaborative filtering algorithms to suggest books. It displays the top 50 books and provides recommendations based on a given book name. Includes functionality to view detailed information about a recommended book, along with related books.

flask html jupyter-notebook machine-learning matplotlib numpy pandas pycharm python seaborn

Last synced: 13 Apr 2026

https://github.com/csirmaz/hypergeometry

Geometric tools for an arbitrary number of dimensions

4d cgi geometry numpy python raytracing

Last synced: 17 May 2026

https://github.com/syedaanif/ds-ml-pyhton

This repo contains learnings about Data Science and Machine Learning using Python

artificial-intelligence data-science machine-learning matplotlib numpy pandas plotly python seaborn

Last synced: 07 Apr 2026

https://github.com/vidhi1290/text-classification-model-with-attention-mechanism-nlp

This Python project utilizes PyTorch to perform text classification with an attention mechanism. Pre-trained GloVe embeddings are processed for word representation, and a custom attention model is trained on consumer complaint data to categorize complaints into product categories.🎯

attention-mechanism deeplearning machine-learning nlp nltk numpy pandas python pytorch scikit-learn text-classification tqdm

Last synced: 06 Apr 2026

https://github.com/rasbeetech/loan-calculator

Tkinter framework application

matplotlib numpy python tkinter

Last synced: 20 Mar 2025

https://github.com/thedevreda/python

My Full Backup Packagaes & Files Guide for Python 👷‍♂️

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/huichuanli/play-with-linear-algebra

use python3 and Numpy to realize my personal linear Algebra library

numpy python3 scipy

Last synced: 07 May 2026

https://github.com/painterqubits/datalogs

Python package for logging data from scientific experiments.

json netcdf numpy python xarray

Last synced: 17 Feb 2026

https://github.com/xmcp/waver

Wave file generator

numpy tkinter wavefile waveform

Last synced: 17 May 2026

https://github.com/arpandatta011/facial_image_recognition

This project includes the introduction of the facial expression recognition and an investigation on the recent previous researches for extracting the effective and efficient method for facial expression recognition.

cv2 keras matplotlib numpy pandas tensorflow

Last synced: 09 Apr 2026

https://github.com/hansalemaos/cythoncartesian2

Cartesian Product for NumPy - 40x faster than NumPy + itertools.product

algorithm cartesian cartesian-product cython fast numpy

Last synced: 17 May 2026

https://github.com/willie-conway/data-science-tools-and-ecosystem

🧠 Explore essential Data Science 🧪 tools, 🧰 libraries, and 🐍 Python arithmetic basics — perfect for beginners!

data-science jupyter keras notebooks numpy pandas python pytorch

Last synced: 27 Jul 2025

https://github.com/sumitgirwal/drinks-dataset-data-analysis

A drinks dataset from Kaggle. Applying analysis using Python, NumPy, Pandas, Matplotlib.

dataanalysis drinks-dataset kaggle matplotlib numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/kentlouisetonino/sw-statistics-calculator

A CLI program that compute mean, median, and percentiles.

numpy os python3 statistics

Last synced: 29 Mar 2025

https://github.com/rex-8/mnist-4-layer

Implementing a 4 layer neural to identify digits from a 28x28 grid using just python and numpy

backward-propagation deep-learning forward-propagation loss-functions machine-learning mnist-classification neural-network numpy python relu-network

Last synced: 15 May 2026

https://github.com/tasninanika/heart-disease-analysis

The Heart Disease Analysis project is a comprehensive machine learning study aimed at predicting the presence of heart disease using the Heart Disease UCI Dataset.

knn logistic-regression matplotlib numpy pandas python3 random-forest scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/khaledkamr/image-classification

A deep learning project using CNNs to classify fruits and vegetables images, built with TensorFlow and Keras, showcasing image recognition and model training.

cnn-classification jupyter-notebook keras numpy python streamlit tensorflow

Last synced: 07 Apr 2026

https://github.com/mthiboust/array2image

Converts a Numpy array to a PIL image.

array data-visualization image numpy pillow plotting python

Last synced: 15 May 2026

https://github.com/doondi30/ghiblify-animeganv3

A creative tool that generates beautiful Ghibli-style artwork from your image input.

colab-notebook jupyter-notebook numpy onnxruntime opencv python

Last synced: 07 Apr 2026

https://github.com/udityamerit/curafind-powered-by-ai

CuraFind AI is a web-based application leveraging Natural Language Processing (NLP) to intelligently recommend medicines. Users can search using symptoms, medicine names, or free-text descriptions, and receive suggestions along with brand substitutes for drugs

ai machine-learning nlp numpy pandas scikit-learn

Last synced: 18 Sep 2025

https://github.com/lunastev/flowmodel

FlowModel is a lightweight and extensible machine learning framework designed for beginners to easily experiment with AI development through a modular plugin architecture.

ai ai-model ai-models beginner-friendly flow-model flow-models flowmodel framework lightweight machine-learning model numpy plugin-architecture python torch torchvision

Last synced: 20 May 2026

https://github.com/abhy-kumar/ahmedabad-weather-projection

This notebook tries to predict Ahmedabad's weather using historical trends

arima-forecasting numpy pandas python

Last synced: 13 Apr 2026

https://github.com/aryan-coder-student/smart-agricultural-monitering-aryanpahari037-

Increase agricultural productivity and sustainability by offering a smart farming assistant that helps farmers make data-driven decisions to optimize resource use and crop management.

hackathon-project machine-learning numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/mchenryspagg/outlier-detection-in-election-data

This project aims to examine possible voting irregularities in Nigeria's 2023 election, focusing on alleged vote manipulations by INEC. It seeks to ensure election transparency by identifying polling units with results that significantly differ from nearby units.

geographic geographical-information-system geospatial-analysis geospatial-data latitude-and-longitude matplotlib-pyplot numpy outlier-detection pandas python seaborn

Last synced: 20 Jan 2026

https://github.com/johndev19/data-visualization-tool

A Python-based application for generating various types of charts and graphs from user-input data.

charts matplotlib numpy python visualization

Last synced: 17 May 2026

https://github.com/haidermalikk/neural-network-circuit-optimizer

The Neural Network Circuit Optimizer, built on my custom digital circuit simulator, uses AI and machine learning to optimize simple digital circuits. It leverages neural networks, circuit simulation, and optimization algorithms to reduce component count which improves: power usage, and execution time and efficiency. *Under Dev see Readme*

ai jupyter-notebook matplotlib ml numpy pandas python pytorch sqlalchemy tensorflow

Last synced: 10 Apr 2026

https://github.com/msthamizh/singapore-resale-flat-prices-predicting

Developing a Streamlit application that allows users to interactively predict the resale prices of flats in Singapore using machine learning. The application supports regression tasks, enabling users to input various features, such as location, flat type, floor area, and lease start date, and receive predictions for resale prices.

exploratory-data-analysis machine-learning matplotlib numpy pandas python seaborn streamlit

Last synced: 09 Apr 2026

https://github.com/jaffar-hussein/python-pdb-parser-and-analysis

A Python-based parser and analysis tool for Protein Data Bank (PDB) files. The project aims to facilitate the manipulation of PDB files, enabling users to extract and compute various structural properties of proteins, such as counting amino acid residues and calculating distances between atoms.

jupyter-notebook matplotlib numpy pdb pdb-parser python3 seaborn

Last synced: 10 May 2026

https://github.com/zensam7/easy_my_ai

Библиотека для создания ИИ, написана ПОЛНОСТЬЮ с нуля

genetic-algorithm library machine-learning neural-networks numpy q-learning

Last synced: 14 Feb 2026

https://github.com/chaitanya1436/student_performance_analysis

A project focused on analyzing college student performance using data on department, assessment scores, and performance labels. Implemented in Google Colab, the analysis includes data preprocessing, feature scaling, and exploratory data analysis to uncover insights and prepare the data for further analysis or modeling.

ata-preprocessing data-preparation exploratory-data-analysis feature-scaling google-colab numpy pandas scikit-learn

Last synced: 07 Feb 2026

https://github.com/uhstray-io/pyrizon

Data Collection, Analysis, Mapping, Pipelining & Transformation, & API using Python

api data-engineering etl numpy pandas plotly python pytorch raw-data scikit-learn seaborne sql sqlite tensorflow

Last synced: 09 Apr 2026

https://github.com/sourabh-kumar04/numpy-basic

Numpy-Basic is a structured learning repo covering NumPy from basics to advanced. It includes arrays, indexing, reshaping, filtering, vector ops, angle functions, stats, and .npy file handling. Each concept is explained with code, examples, and Matplotlib visualizations in both light and dark modes. Ideal for students and data learners.

data-analysis data-science data-visualization learning learning-resources machine-learning matplotlib numerical-computing numpy python python-library python-programming

Last synced: 10 May 2026

https://github.com/sumitgirwal/diabetes-linear-regression-ml

This is a simple diabetes checker machine learning test model, build with linear regression in Python , Matplotlib, NumPy, and sklearn. Diabetes is built in a dataset within the sklearn library. Where first you train the model then you apply a test to see how the output/result is approximate/good or change something in the model.

linear-regression matplotlib numpy python3 sklearn

Last synced: 18 Apr 2026

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/upul/chocolate-quality-analysis

This repository contains a Jupiter notebook which describes how to use basic machine learning tools such Scikit-Learning, Pandas, and Numpy for buiding models.

machine-learning numpy pandas predictive-analytics scikit-learn

Last synced: 04 May 2026

https://github.com/octcarp-courses/sustech_cs303-ai_f23_works

(49.8/50) Course code of CS303 2023 Fall: Artificial Intelligence, SUSTech. Taught by Prof. Ke TANG.

numpy pytorch sustech

Last synced: 08 May 2026

https://github.com/iamshatha/disease-symptom-analysis

The data file defines the relationship between different symptoms and possible diseases

analysis dataset disease matplotlib numpy python symptoms

Last synced: 07 May 2026

https://github.com/intelpython/mkl_umath

Package implementing NumPy's UFuncs based on SVML and MKL VML

mkl numpy python

Last synced: 22 Jun 2025

https://github.com/gavineshwar/assignment_plagarism_checker

Assignment Plagiarism Checker

fitz flask html-css numpy python

Last synced: 02 Feb 2026

https://github.com/ansub/mask-detection

A Mask Detection Application using Keras/TensorFlow and OpenCV.

keras numpy opencv python tensorflow

Last synced: 10 Apr 2026

https://github.com/mousazourob/stockdataresampler

Python script that resamples an instruments minute by minute candle stick data to any interval

alpha-vantage-api numpy pandas python

Last synced: 07 May 2026

https://github.com/nehalmr/qiksit-fall-fest-hackathon-2024

The Qiskit Hackathon is a unique opportunity for students to dive into the fascinating world of quantum computing. Participants will collaborate in teams to tackle a series of challenging problems that test their creativity, problem-solving skills, and understanding of quantum algorithms. This hackathon aims to inspire innovation in QC.

cpython hackathon ibm ibm-qiskit ibm-quantum-challenge matplotlib numpy python python3 qbits qiskit qiskit-tutorial quantum-algorithms quantum-computing quantumcircuits quantumregister

Last synced: 11 May 2026

https://github.com/dhakalnirajan/python-for-research

“Python for Academic Research” is a program launched by NRCC to provide short training in the python programming language. This course will teach students to use python in research and implement such learnings in their practical classes. Mr. Kshitiz Regmi will be teaching the course.

data-science numpy pandas plotly python python3

Last synced: 21 Mar 2025

https://github.com/akku-1206/zomato_scrapping

Zomato Restaurant Data Scraper This script uses Selenium to scrape restaurant information from the Zomato NCR delivery page.

chromewebdriver numpy pandas python selenium

Last synced: 04 May 2026

https://github.com/yogeshnile/flight-delay-prediction

In this repo i developed predict a flight delay using machine learning.

flight-delay-prediction jupyter-notebook machine-learning numpy pandas python3 seaborn sklearn

Last synced: 13 Feb 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/shriram-vibhute/data-analysis

This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.

data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn

Last synced: 02 Aug 2025

https://github.com/Shriram-Vibhute/RealEstateForecast-Price-Prediction-System

This project aims to predict Prices of House. It involves several key stages, including data preprocessing, feature engineering, model selection, and evaluation. The goal is to develop a model that provides accurate and reliable price predictions based on the given features.

exploratory-data-analysis feature-engineering feature-transformation hyperparameter-tuning machine-learning matplotlib numpy pandas python random-forest sklearn

Last synced: 02 Apr 2025

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 08 Apr 2026

https://github.com/moeeinaali/ce282-la

Solutions to Dr. Rabiee's CE282: Linear Algebra Course (Sharif University of Technology - Spring 2024)

linear-algebra matplotlib numpy pandas python

Last synced: 13 Apr 2026