An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/jonolav95/advent_of_code_2022

Using Python with Pandas and Numpy (www.adventofcode.com).

numpy pandas

Last synced: 06 May 2026

https://github.com/aleksibovellan/ai-2048-game

AI Heuristics in Python for solving the 2048 Game

2048 2048-game 2048-solver ai expectimax game heuristics numba numpy python python3

Last synced: 06 May 2026

https://github.com/voorhs/ml-practice

ML algorithms implementation

aimasters lomonosov-msu machine-learning numpy python

Last synced: 07 May 2026

https://github.com/sriyaroy/coding-challenges

50 Machine Learning & Computer Vision challenges ranging from Easy to Hard to sharpen you ML engineering skills.

coding-challenges computer-vision interview-prep machine-learning numpy python pytorch

Last synced: 07 May 2026

https://github.com/dulajkavinda/breast-cancer-detect

🧬 breast cancer classification using machine learning

added machinelearning matplotlib numpy pandas python

Last synced: 07 May 2026

https://github.com/ai-ahmed/bda-and-ds

Dual Course: EMC2 Data Scientist & Big Data Analytics & IBM Professional Data Science Course

analytics big-data data-science jupyter-notebook jupyterlab linear-algebra mathematics matplotlib ml numpy pandas probability seaborn sklearn statistics

Last synced: 07 May 2026

https://github.com/matteopolak/chatterbox

RNN-based chat message generator.

numpy pandas python rnn tensorflow text-generation

Last synced: 07 May 2026

https://github.com/ayushmankumar7/siamese-network-with-one-shot-learning

Implementation of this paper - https://www.cs.cmu.edu/~rsalakhu/papers/oneshot1.pdf

numpy tensorflow tf-keras

Last synced: 07 May 2026

https://github.com/ebadshabbir/bluff-detection-model-polynomial_regression-

This project predicts employee salaries based on position levels using Linear and Polynomial Regression models. It trains models with degrees 2, 3, and 4 on a dataset of job titles, position levels, and salaries, and predicts the salary for a position level of 6.5. The results are visualized to compare model performance.

jupyter-notebook linear-regression machine-learning matplotlib-pyplot multiple-linear-regression numpy polynomial-regression python

Last synced: 07 May 2026

https://github.com/hackintoshrao/first-neural-network

Simple Neural Network with a hidden layer written from scratch using Numpy for prediction the bike sharing sales

deep-learning neural-networks numpy python

Last synced: 07 May 2026

https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis

This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.

analysis data jupyter-notebook matplotlib numpy python seaborn vscode

Last synced: 07 May 2026

https://github.com/pereira-elizabeth/aah-2d-hermitian

Minimal 2D AAH model: build, diagonalize, and plot spectrum (NumPy/Matplotlib)

condensed-matter numpy physics scipy tight-binding

Last synced: 07 May 2026

https://github.com/kavayk29/audio-classification-using-python-library

This is a audio classification Project using python Libraries such as librosa to make the visual representation of the audio files, and using numpy to make array of data for manipulation and then extraction the features for classification to train and test of CNN model.

librosa matplotlib-pyplot mfcc-features numpy pandas sklearn-library

Last synced: 07 May 2026

https://github.com/christabelsakyi/sentiment_analysis

This project involves analyzing customer reviews to classify them as positive or negative using Logistic Regression. The workflow includes text preprocessing, feature extraction, training a model, making predictions, and evaluating its performance. Dataset

imblearn machine-learning nltk numpy python sklearn

Last synced: 07 May 2026

https://github.com/aimin-nur/dataanalyst-cohortanalysis

Sebuah project Data Analayst untuk mengetahui bagaimana pola pembelian pelanggan dalam periode waktu tertentu (Cohort Analysis)..

cohort-analysis numpy pandas pytz

Last synced: 07 May 2026

https://github.com/bettenoch/yolo_real_time_object_detector

The YOLO Real-Time Object Detector leverages the ultralytics package to provide robust object detection capabilities. This project is designed to identify and classify objects in images, video streams, and live webcam feeds with high accuracy and efficiency.

numpy object-detection object-tracking pandas python3 real-time-processing ultralytics yolov8

Last synced: 07 May 2026

https://github.com/4strium/predict-french-tv-audience

🎬 A beginner's machine learning project whose aim is to predict the audience for films on TV in France, based on previous data provided by the CNC and Médiamétrie.

machine-learning numpy pandas python tv xgboost

Last synced: 07 May 2026

https://github.com/r-mahesh45/reccomendation-system

This project involves building a recommender system using cosine similarity to measure the similarity between items or users. It leverages numerical data to provide personalized recommendations. The model focuses on extracting meaningful insights and improving decision-making processes.

numpy pandas-dataframe python3 scikitlearn-machine-learning

Last synced: 07 May 2026

https://github.com/momo790-m/startup-profit-prediction

Machine learning project to predict profits of new startups based on R&D, Admin, Marketing, and State data

assumptions business colab-notebook linear-regression machine-learning matplotlib numpy pandas preprocessing seaborn skit-learn sklearn

Last synced: 08 May 2026

https://github.com/alain-godo/spotify-popularity

A simple analysis and modeling for prediction of the popularity of songs on Spotify

linear-regression machine-learning numpy pandas random-forest sckiit-learn spotify tree-decision

Last synced: 08 May 2026

https://github.com/octavioduarte/datascience

A repository with study notes on Data Science concepts, NumPy, pandas, and an introduction to statistics.

data-science jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customer’s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/himanshugoyal77/movie-recommendation

Cosine similarity based Recommendation Engine

nltk numpy pandas requests-library-python tmdb-api vectorization

Last synced: 08 May 2026

https://github.com/antonio-f/simple-k-nearest-neighbors

Simple K-Nearest Neighbors using partial sort

101 k-nearest-neighbors knn numpy partial-sort python tutorial

Last synced: 08 May 2026

https://github.com/egrizq/bike_analysis

Exploring the bike sharing system to find the patterns behind user behavior by weather, season, hour, and months.

data-science matplot numpy python seaborn

Last synced: 08 May 2026

https://github.com/leandro-bertoluzzi/parchis-probabilities

Small weekend project to play around with numpy and help me taking decissions while playing parchís

matplotlib numpy python

Last synced: 08 May 2026

https://github.com/sahil210695/numpy-basics

A brief overview of NumPy basics

data-science numpy numpy-tutorial python

Last synced: 08 May 2026

https://github.com/omogbolahan94/movies-budget-revenue-analysis

Data exploration, data cleaning, data analysis and predictive analysis

dataframe linear-regression matplotlib numpy pandas seaborn statistical-analysis

Last synced: 08 May 2026

https://github.com/chdl17/credit-eda-casestudy

Credit EDA is a GitHub repository designed to provide educational materials and tools for exploring credit risk data. The repository contains notebooks, datasets, and code samples to help users learn and apply these concepts to credit data.

exploratory-data-analysis numpy pandas

Last synced: 08 May 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/nova-land/kalmanfilter-python

Kalman Filter in PyTorch and Numpy

kalman-filter numpy python pytorch

Last synced: 08 May 2026

https://github.com/raghavendranhp/predictive-analytics-in-automotive-manufacturing-testing-time-optimization

Explore machine learning for automotive testing optimization. Predictive analytics to reduce testing time and environmental impact.

elasticnet extratreesregressor fastica model-pipeline numpy pandas pca randomforestregressor svr-regression-prediction xgboost

Last synced: 09 May 2026

https://github.com/jsnevt/series_temporais

Previsão de eventos que ocorrem sazonalmente

datetime matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/sameerjain/ai-snake-game

Implemented a Q-learning agent to play and learn the classic Snake game, leveraging PyTorch for Neural Network. Visualized using Matplotlib, showcasing agent's evolving performance over time. Built UI using Pygame to visualize Snake, including food, and current score.

anaconda conda matplotlib neural-network numpy pygame python3 pytorch q-learning

Last synced: 09 May 2026

https://github.com/jupfu/beck-view-digitalize

Digitalize 16mm films with Python & reactivex & FT232H

cython ft232h libpng multiprocessing numpy opencv python3 reactivex um232h

Last synced: 09 May 2026

https://github.com/omogbolahan94/lego-analysis

Analysis of LEGO data from 1949 to 2021.

matplotlib merge numpy pandas

Last synced: 09 May 2026

https://github.com/aliyahscoding/tiny-recommender

Tiny item-item recommender in Python using cosine similarity over a toy user–item matrix. Simple CLI to print top-N recommendations.

collaborative-filtering cosine-similarity numpy pandas python recommender-system

Last synced: 09 May 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/moonshallow5/food_vision_mini

A program which can detect if an image contains either a pizza, sushi or steak: Has now been deployed on Flutter :)

huggingface matplotlib numpy pytorch

Last synced: 09 May 2026

https://github.com/amr-yasser226/pagerank-power-method

Modular Python implementation of the PageRank algorithm via the power method, with discrete‑math foundations, end‑to‑end code, and experimental validation.

discrete-mathematics graph-algorithms markov-chain network-analysis networks numpy pagerank power-method python scipy

Last synced: 09 May 2026

https://github.com/aroojzahra908/machine-learning

This repository contains helpful guides and materials for easily understanding Pandas, NumPy, and Matplotlib libraries, which are essential tools for creating visual representations of data.

algorithms machine-learning-algorithms matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/abdullah2020/hamoye_stageb

This is my Hamoye Stage B project. The project focuses on Predicting Energy Efficiency of Buildings. It implemented different Machine Learning algorithm technique that are not limited to Linear Regression, LASSO, Ridge etc.

eda lasso-regression linear-regression numpy pandas predictive-modeling regression-models ridge-regression rmse rsquare-values

Last synced: 09 May 2026

https://github.com/carol-neto/sprint-4-statistical-data-analysis

In this project I had the opportunity to test my knowledge by analyzing a phone plan and creating graphs to compare the plans and determine which ones generate the most revenue.

matplotlib-pyplot numpy pandas pytho scipy-stats seaborn statistical-analysis

Last synced: 09 May 2026

https://github.com/betiniakarandut/portfolio-project

Sukkar and Cornell Integral Approach to Estimate Static Bottom Hole Pressure(SBHP)

numpy pandas python3

Last synced: 09 May 2026

https://github.com/vviveksharma/barcodescanner

The Bar Code scanning Application built using Computer Vision library such as Opencv in python the main objective of the project is to read the barcode that is being shown in the camera and decoding it and telling if the Database or Text file that is attached to it has that permitted decoded number if yes then it shows "Permitted" is green else "Not-Permitted" in red

numpy opencv python3

Last synced: 10 May 2026

https://github.com/1adityakadam/computer-vision-approach-for-quality-inspection-of-steel-sheets

Machine Learning and Computer Vision project improves steel defect inspection using computer vision. A segmentation technique partitions steel sheet images into regions, enabling better defect analysis. U-Net is used for segmentation, and MobileNetV2 for classification. Performance is evaluated with the Dice Coefficient.

computer-vision machine-learning numpy pandas unet-image-segmentation

Last synced: 10 May 2026

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 10 May 2026

https://github.com/rebeccamorolong/market--basket--analysis

Market Basket Analysis using the Apriori algorithm to predict association rules and frequent itemsets from transactional data. The goal was to uncover product combinations that frequently co-occur in transactions, which can be used for recommendations, promotions, and inventory management.

apriori-algorithm numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/esoltys/climate_data_viz

FastAPI-based web application that fetches and visualizes climate data from the Copernicus Climate Data Store (CDS).

apache-echarts copernicus-climate-data-store fastapi numpy

Last synced: 10 May 2026

https://github.com/prakashjha1/email-sms-spam-classifier

Many email services today provide spam filters that are able to classify emails into spam and non-spam email with high accuracy.

machine-learning natural-language-processing numpy pandas python

Last synced: 10 May 2026

https://github.com/m1-elmasry/quickmark

Efficient OMR software designed for quick and precise grading of bubble sheet exams.

bubble-sheet image-processing numpy opencv python

Last synced: 10 May 2026

https://github.com/shubhansu-kr/int213-python-programming

Python Programming | Semester 3 | Python | OOPS | Tkinter | Pandas | Numpy

lpu numpy oops-in-python pandas python sem3 tkinter-gui

Last synced: 10 May 2026

https://github.com/tnleite/real-estate-opportunities-analysis

Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.

catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression

Last synced: 10 May 2026

https://github.com/jsnevt/regressao_logistica

Cálculos de probabilidades sobre uma eleição

matplotlib numpy pandas-dataframe python

Last synced: 11 May 2026

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/parthds02/customer-segmentation-with-kmeans-clustering

Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.

data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode

Last synced: 11 May 2026

https://github.com/bhuvilol/objdetector_webcam_and_video

Object Detection from Webcam input or video input

cvzone numpy opencv python3 yolov8

Last synced: 11 May 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/daksh90a/wine-quality-analysis

The Wine Quality Analysis project is an AI/ML-based data analysis initiative aimed at predicting and understanding the factors that influence the quality of wine.

matplotlib-python numpy pandas seaborn

Last synced: 12 May 2026

https://github.com/chandkund/movie-recommender-system

Unlocking the magic of personalized movie recommendations! This project uses the Cosine Similarity Algorithm to suggest movies based on user-selected titles, making movie discovery more enjoyable.

cosine-similarity matplotlib nltk numpy pandas-python seaborn

Last synced: 12 May 2026

https://github.com/jsnevt/knn_neighborhood

ML - classificação por vizinhança

kneighborsclassifier matplotlib numpy pandas seaborn

Last synced: 13 May 2026

https://github.com/abideen-olawuwo/gender-detection

Gender Detection app

cv cv2 numpy

Last synced: 14 May 2026

https://github.com/khushi130404/gradient_realm

Gradient_Realm is a Python project exploring regression techniques and optimization methods like Regularization, Batch, Stochastic, and Mini-batch Gradient Descent. It uses scikit-learn and custom implementations for hands-on learning.

batch-gradient-descent mini-batch-gradient-descent numpy stochastic-gradient-descent

Last synced: 14 Jun 2026

https://github.com/tomdewildt/interactive-and-explainable-ai-design

Code for The Interactive And Explainable AI Design course of my master's degree

jupyter lime numpy pandas python scikit-learn shap

Last synced: 18 Jun 2026

https://github.com/jayemscript/lab-to-code

A complete Python learning roadmap for scientists and researchers — covering data science, biology, chemistry, physics, and mathematics with curated libraries, tools, and resources.

bioinformatics chemistry data-science jupyter-notebook machine-learning mathematics numpy pandas physics python research roadmap scientific-computing scikit-learn

Last synced: 19 Jun 2026

https://github.com/huzaifa12466/neuralnetscratch

A simple neural network implemented from scratch in Python using NumPy, trained on the Moons dataset

artificial-intelligence artificial-neural-networks backpropagation deep-learning forwardpropagation machine-learning matplotlib neural-network neural-networks numpy

Last synced: 23 Jun 2026

https://github.com/no0bitah/pdf-highlight-extractor

A Python tool for extracting highlighted text from PDF files while preserving formatting attributes (headers, bold, italic) and removing unwanted line breaks and page breaks. Perfect for integrating with content management systems.

automation crm documentation-tool numpy opencv pdf pdf-document-processor pillow pymupdf pypdfium2 python3 scrapping

Last synced: 23 Jun 2026

https://github.com/micmurawski/cloud-array

cloud-array is an open-source Python library for storing and streaming large Numpy Arrays on local file systems and major cloud proviers CDNs.

aws azure big-data bigarray blob-storage cloud data-structures digitalocean-spaces gcp gcp-cloud-storage ibm-cloud-object-storage numpy s3 stream-processing streaming zadara

Last synced: 12 Aug 2025

https://github.com/sanchariii/health-assist-1

Health Assist web app uses the Streamlit framework to help users identify potential diseases. Simply input your symptoms, and the system will predict whether you have diabetes, heart disease, or Parkinson's disease. The system is designed to be quick and convenient, promoting early detection and timely medical intervention.

numpy pandas pickle streamlit-webapp svm-model

Last synced: 13 Aug 2025

https://github.com/muhkartal/xai_dashboard

an interactive AI dashboard for machine learning model analysis and explainability, supports model training, dataset exploration, feature importance analysis, and SHAP-based explanations for both individual predictions and overall model behavior, compare multiple models, visualize insights, and export results seamlessly

joblib numpy pandas python sckit-learn shap streamlit xgboost

Last synced: 04 May 2026

https://github.com/mani-prakash-n-r/book_recommendation_system_using_hybrid_technique

The Book Recommendation System uses collaborative and content filtering techniques to suggest books based on user preferences and content similarity.

csv html numpy pandas python sckiit-learn

Last synced: 09 Apr 2026

https://github.com/shoaib1522/credit-card-fraud-analysis

"A comprehensive project on Credit Card Fraud Detection combining Exploratory Data Analysis, Machine Learning, and an interactive Streamlit web app for real-world applicability."

eda epochs mlp-classifier neural-network numpy pandas-python perceptron-learning-algorithm python regression sckiit-learn seaborn streamlit streamlit-webapp tensorflow

Last synced: 09 Apr 2026

https://github.com/kmkurn/text2array

Python library to convert text dataset into arrays.

deep-learning natural-language-processing neural-networks numpy

Last synced: 14 May 2026

https://github.com/mattmoony/ann_mnist

Simple self-written ANN powered by NumPy to classify handwritten digits of the famous MNIST Dataset. ✍️

adam ann deep-learning learning-algorithm machine-learning ml mnist mnist-handwritten-digits momentum neural-net neural-network normalization numpy regularization relu sigmoid tanh

Last synced: 09 Apr 2026

https://github.com/amandeep-gupta19/chatbot

Created a custom chatbot using Langchain. Here's a summary of what I did: Data Extraction: I gathered data about technical courses from the Brainlox website using Langchain’s URL loaders. Embedding Creation & Storage: I converted this data into embeddings and stored it in a vector store for efficient searching. API Development: I built a Flask

data-extraction faiss-vector-database flask-restful langchain numpy scikit-learn vector-database webbaseloader

Last synced: 09 Apr 2026

https://github.com/adi3042/diamond-price-prediction

🔮💎 Explore the Diamond Price Oracle! Predicting diamond values based on your preferences and market trends. Your ultimate tool for informed decisions in the diamond world. Join us for the future of pricing! 💎✨🚀 #DiamondPredictor

anaconda datetime diamond flask functools html ipykernel jupytrnotebooks matplotlib numpy pandas price-prediction readme regression sckiit-learn setuptools venv

Last synced: 09 Apr 2026

https://github.com/quarkgluant/analyse-visualisation-data-with-python

French Udemy Course "Analyse et visualisation de données avec Python" on Numpy, Pandas and MatPlotLib Python's librairies

matplotlib-tutorial numpy numpy-tutorial pandas pandas-tutorial python python3 udemy

Last synced: 09 Apr 2026

https://github.com/niteshchawla/netflix-case-study

Analyze the data and generate insights that could help Netflix ijn deciding which type of shows/movies to produce and how they can grow the business in different countries

matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026