An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/chdl17/imdb-assignment

This GitHub repository contains a project that performs an analysis of the IMDb movie rating dataset. The data is used to explore the relationships between various features and the movie rating. Visualizations are used to illustrate the results of the analysis. The code is organized into a Jupyter Notebook file and data files.

numpy pandas-dataframe python

Last synced: 08 May 2026

https://github.com/octavioduarte/datascience

A repository with study notes on Data Science concepts, NumPy, pandas, and an introduction to statistics.

data-science jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customer’s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/samaalharbi2/100-days-of-code

This repository is a collection of 100 Python projects built as part of the "100 Days of Code: Python Developer Challenge" course.

data-science game-development machine-learning numpy pandas python udemy

Last synced: 08 May 2026

https://github.com/kkkaan/tensor-library-403-hw1

A numpy like library for tensor (gergen) operations.

deep-learning numpy python tensor

Last synced: 08 May 2026

https://github.com/ditikrushna/identify-customer-segments

In this project, Bertelsmann partners AZ Direct and Arvato Financial Solutions have provided two datasets one with demographic information about the people of Germany, and one with that same information for customers of a mail-order sales company. I have looked at relationships between demographics features, organized the population into clusters, and saw how prevalent customers are in each of the segments obtained.

clustering clustering-algorithm numpy pca

Last synced: 08 May 2026

https://github.com/diegodscamara/whisperlive

🎙️ WhisperLive: Real-time audio transcription powered by OpenAI's Whisper model. Convert live speech to text with high accuracy, supporting multiple languages and real-time processing. Perfect for accessibility, content creation, and live captioning.

ai blackhole numpy python wave whisper

Last synced: 09 May 2026

https://github.com/mattkevan/data-science

Notes from learning various data science topics.

data-science matplotlib numpy python3

Last synced: 09 May 2026

https://github.com/ishaj72/analysis_model2

Dataset analysis model using matplotlib and other libraries

machine-learning matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/khaymanii/credit-card-fraud-detection-model

This model was built using python and Logistic Regression Machine Learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 09 May 2026

https://github.com/carol-neto/sprint-4-statistical-data-analysis

In this project I had the opportunity to test my knowledge by analyzing a phone plan and creating graphs to compare the plans and determine which ones generate the most revenue.

matplotlib-pyplot numpy pandas pytho scipy-stats seaborn statistical-analysis

Last synced: 09 May 2026

https://github.com/arjunravi26/selenium

This repository contains code from my learning about Selenium for web scraping and automation.

numpy python selenium time undetected-chrome webdriver

Last synced: 09 May 2026

https://github.com/vadimkantorov/fasttsv

TSV parser for Python in pure vectorized NumPy code

numpy parsing simd tsv vectorized-code

Last synced: 09 May 2026

https://github.com/faisal-khann/python-practice

This is the section for python practice, where we engage in programming exercises to improve our Python skill.

numpy numpy-arrays pandas python

Last synced: 10 May 2026

https://github.com/sahaavi/color-detection

Colour detection is the process of detecting the name of any color. Well, for humans this is an extremely easy task but for computers, it is not straightforward. Human eyes and brains work together to translate light into color. Light receptors that are present in our eyes transmit the signal to the brain. Our brain then recognizes the color. Since childhood, we have mapped certain lights with their color names. We will be using the somewhat same strategy to detect color names.

color-detection color-detector numpy opencv-python pandas

Last synced: 10 May 2026

https://github.com/jupfu/beck-view-movie

Assemble mp4 video from list of images

multithreading numpy opencv python3 threadpool tqdm

Last synced: 10 May 2026

https://github.com/jjerphan/joml

A minimalist numpy-baked Neural Network API

deep-learning fun minimalist neural-network neural-networks numpy python wheel-implementing

Last synced: 10 May 2026

https://github.com/m1-elmasry/quickmark

Efficient OMR software designed for quick and precise grading of bubble sheet exams.

bubble-sheet image-processing numpy opencv python

Last synced: 10 May 2026

https://github.com/andersoncrs/aprendizaje_no_supervisado_kmeans_customers

Este repositorio contiene un análisis de datos de clientes de un centro comercial utilizando técnicas de aprendizaje no supervisado, específicamente K Means y clustering jerárquico. El objetivo del proyecto es segmentar a los clientes en grupos homogéneos para entender mejor sus comportamientos y características.

data-analysis kmeans-clustering matplotlib numpy seaborn visualization

Last synced: 10 May 2026

https://github.com/susanketsarkar/nn-from-scratch

A pure NumPy implementation of a deep neural network, built for educational purposes and deep learning understanding. This project implements a multi-layer neural network with various features commonly found in modern deep learning frameworks.

deep-learning implementation-of-algorithms neural-network numpy python

Last synced: 10 May 2026

https://github.com/tnleite/real-estate-opportunities-analysis

Este repositório apresenta uma análise de oportunidades no mercado imobiliário, combinando séries temporais, clusterização e previsões para identificar estados com maior potencial de crescimento e orientar estratégias de expansão eficientes.

catboostregressor cluster-analysis data-science kmeans-clustering lightgbm-regressor machine-learning-algorithms numpy regression-models scikit-learn xgboost-regression

Last synced: 10 May 2026

https://github.com/gatuno1/details_meson_numpy_error

Documents for exploration of meson error on numpy installation over PyPy on Windows

meson numpy pypy unicode windows

Last synced: 11 May 2026

https://github.com/chaganti-reddy/heart_disease_prediction

Machine Model creation for Heart Disease Prediction

data-science deep-learning machine-learning numpy python

Last synced: 11 May 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/chandkund/movie-recommender-system

Unlocking the magic of personalized movie recommendations! This project uses the Cosine Similarity Algorithm to suggest movies based on user-selected titles, making movie discovery more enjoyable.

cosine-similarity matplotlib nltk numpy pandas-python seaborn

Last synced: 12 May 2026

https://github.com/jbizzlefoshizzle/weather_trends_vs_latitude

Pandas project analyzing possible correlation between latitude and max. temperature, wind speed, cloudiness, and humidity

json-api jupyter-notebook matplotlib-pyplot numpy weather-information

Last synced: 13 May 2026

https://github.com/asuquoaa/big_4_sports_teams_and_city_population_analysis-2018-

Analysis of sports teams' win/loss ratios vs. metro area populations across NFL, NBA, MLB, and NHL.

data-cleaning-and-preprocessing numpy pandas

Last synced: 13 May 2026

https://github.com/claby2/ascii-generator

Python program to convert images into ASCII art.

ascii numpy optparse pillow python

Last synced: 13 May 2026

https://github.com/jayemscript/lab-to-code

A complete Python learning roadmap for scientists and researchers — covering data science, biology, chemistry, physics, and mathematics with curated libraries, tools, and resources.

bioinformatics chemistry data-science jupyter-notebook machine-learning mathematics numpy pandas physics python research roadmap scientific-computing scikit-learn

Last synced: 19 Jun 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learning—it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/huzaifa12466/neuralnetscratch

A simple neural network implemented from scratch in Python using NumPy, trained on the Moons dataset

artificial-intelligence artificial-neural-networks backpropagation deep-learning forwardpropagation machine-learning matplotlib neural-network neural-networks numpy

Last synced: 23 Jun 2026

https://github.com/charlenry/python_math_machine_learning

Mes notebooks de travaux pratiques sur Python, NumPy, SimPy, SciPy, Matplotlib, Plotly, Seaborn et les Mathématiques pour le Machine Learning

algebra derivatives functions jupyter linear matplotlib matrix numpy plotly probabilities pyplot python pytorch scipy seaborn sklearn statistics sympy tensorflow time

Last synced: 25 Jun 2026

https://github.com/huggon1/ml-algorithm-implementations

Educational implementations for ML, DL, LLM blocks, ViT, and CUDA.

cuda machine-learning numpy pytorch vision-transformer

Last synced: 28 Jun 2026

https://github.com/akashash01/support_vector_machine

It's an linear model for classification and regression problems. It can solve non-linear problems by creating a hyperplane which separates the data into classes using KERNAL .

data-science datavisualization machine-learning numpy pandas python sklearn svm-classifier

Last synced: 13 Apr 2026

https://github.com/ejw-data/python-bitwise-operators

Difference between bitwise and logical operators when using pandas

numpy pandas

Last synced: 08 May 2026

https://github.com/christabelsakyi/product_purchase_prediction

As a newly hired AI Engineer, your task is to predict customer behavior based on various features such as age, income, and gender. This exercise involves cleaning the data, training a decision tree model, and evaluating the model's performance to understand the key factors influencing customer purchasing decisions.

machine-learning matplotlib-pyplot numpy python seaborn sklearn

Last synced: 08 May 2026

https://github.com/umer-farooq-cs/california-housing-regression

Linear Regression experiments on the California Housing dataset across five phases, using NumPy and scikit-learn only (no pandas). Includes EDA, polynomial features, SGD with scaling, residuals, 5-fold CV, and an LNCS-style report with figures.

california-housing cross-validation deep-learning-for-perception linear-regression numpy polynomial-features reproducible-research residual-analysis sgd sklearn

Last synced: 09 May 2026

https://github.com/raveenaross/bulldozer-price-prediction

Predicting the sale price of bulldozers using machine learning

matplotlib numpy pandas sklearn time

Last synced: 07 Oct 2025

https://github.com/pedasoft-consult/employee-attrition-prediction

This project implements a machine learning pipeline for predicting employee attrition using HR data. The system analyzes various factors that contribute to employee turnover and provides predictions to help HR departments take preventive measures.

joblib numpy pandas python3 sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/ahmrina/unetbraintumorseg

UNetBrainTumorSeg is a 3D Slicer extension named "seg_module" that integrates a 3D UNet trained on BraTS2020 dataset to segment brain tumor area from MRI volumes

3d-slicer-extension monai numpy python pytorch slicer unet vtk

Last synced: 07 May 2026

https://github.com/manjotkaurgill/agritech

Enter details of your soil and weather, and find best suitable crop for farming. With our advanced AI system, you can make informed decisions and optimize your agricultural practices.

flask generative-ai insight-generation machine-learning matplotlib mongodb nextjs numpy pandas python scikit-learn seaborn

Last synced: 12 Apr 2026

https://github.com/davityak03/object-detection-in-images-using-yolo

This project demonstrates object detection using the YOLO (You Only Look Once) model, providing a solution for detecting and labeling multiple objects within images or video streams. This implementation covers image preprocessing, model inference, and visualization of detected objects.

computer-vision matplotlib-pyplot numpy object-detection opencv-python pandas python yolo

Last synced: 18 Jan 2026

https://github.com/harshavardh8n/machine-learning-projects

Machine Learning Projects: A collection of machine learning projects developed as part of the curriculum, utilizing libraries such as NumPy, Pandas, Seaborn, and Matplotlib. These projects demonstrate various algorithms, including Apriori and Decision Trees, showcasing practical applications and data analysis techniques.

apriori-algorithm decision-trees jupyter-notebook machine-learning numpy pandas python

Last synced: 18 Jan 2026

https://github.com/mkdirer/disaster-tweet-nlp-adaboost-transformers

A machine learning project that classifies tweets as disaster-related or non-disaster-related using transformers and the AdaBoost technique. Achieved an accuracy of approximately 80% on the validation set

geopy keras matplotlib numpy pandas python sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/isk-daniar/signal-filter-plt

Signal filter PLT

numpy pillow python

Last synced: 18 Apr 2026

https://github.com/lewis-morris/image_augment

Image augmentation for CNN's. Aiming to increase generalization.

augmentation image numpy python

Last synced: 18 Feb 2026

https://github.com/ahmedabdalkreem/lung-cancer

It help to know the patient have lung cancer or not and show what depenent in this disease that lead to this disease.

matplotlib numpy pandas python sklearn-library

Last synced: 14 Apr 2026

https://github.com/nikhleshshukla123/web-scraping-using-python

scrapes multiple pages of Amazon search results using python.

beautifulsoup4 numpy pandas python

Last synced: 03 May 2026

https://github.com/yappy2000d/pps-ndarray

Multi-dimensional array implementation

cpp numpy

Last synced: 18 Jan 2026

https://github.com/yeisson8a/tallerdataanalysis

Taller de Data Analysis utilizando Python y las librerías Pandas, NumPy, Seaborn, Matplotlib para manipulación de datos de un archivo, así como la librería Statsmodels para regresiones lineales.

analisis-de-datos linear-regression matplotlib numpy pandas python3 seaborn statsmodels

Last synced: 14 Apr 2026

https://github.com/bhavnanahar/coral_project

ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.

colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo

Last synced: 14 Apr 2026

https://github.com/dmarks84/coursework_project_image-text-recognition

Project for University of Michigan Python Programming Specialization -- Read in documents with images and text, and utilized CV libraries/packages to extract specific types of images and text, pairing them together

classification computer-vision image-classification numpy pandas programming python text-classification

Last synced: 14 Apr 2026

https://github.com/sohitbennett/roadsafe

A Deep learning computer vision system for real-time traffic safety monitoring.

computer-vision esrgan keras numpy pandas python scikit-learn tensorflow tesseract-ocr yolov5 yolov8

Last synced: 08 Apr 2026

https://github.com/gurramankit/censusproject_ml_randomforest

The objective of this project is to build a classification model using the Census Income dataset from the UCI Machine Learning Repository. The model predicts whether an individual's income exceeds $50,000 per year, based on their demographic and employment-related attributes.

matplotlib numpy pandas python random-forest scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/farhannirzhor/python_eda_project_on_diwali_sales_analysis

In this project, I was given a sales data. I analyzed their sales data and shared a summary

data-cleaning data-manipulation exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/badranalyst/time-series-analysis-of-global-trends-in-diet-gym-and-finance

This project analyzes global trends in diet, gym, and finance over time using time series data. The analysis is performed using Python libraries like Pandas, Matplotlib, and Seaborn to visualize trends and identify patterns in these sectors across various countries.

data-analysis dataset matplotlib-pyplot numpy pandas python seaborn time-series

Last synced: 14 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/saylie/cat-dog-classifier-

The Cat-Dog Classifier is a deep learning-based image classification project that can differentiate between images of cats and dogs. It uses a Convolutional Neural Network (CNN) model trained on labeled datasets to predict whether a given image belongs to a cat or a dog.

ai javascript keras numpy opencv python react tensorflow

Last synced: 14 Apr 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/soumyapro/parkinson-disease-prediction

This project predicts Parkinson's disease using machine learning models.

logistic-regression numpy pandas scikit-learn svc xgboost

Last synced: 19 Jan 2026

https://github.com/sudarshanc00/smishing

This project aims to classify text messages to detect potential smishing (SMS phishing) attacks. Using machine learning, the project provides a classifier that can differentiate between legitimate messages and smishing attempts, helping to prevent scams.

nltk numpy pandas python scikit-learn scipy

Last synced: 14 Apr 2026

https://github.com/lorenzorottigni/ml-kyphosis

Machine Learning python bootcamp: decision trees & random forest on kyphosis dataset

decision-trees ipynb machine-learning numpy pandas python random-forest seaborn

Last synced: 14 Apr 2026

https://github.com/sultanazhari/game-trend

Ice as a online store Video Games want to know a new trend of video game to help them prepare a video games which is more profitable to sell. As a data Analyst we're giving a trend of video game by genre, platform, console and the others clasification.

matplotlib-pyplot numpy pandas python3 scipy-stats video-game

Last synced: 11 Apr 2026

https://github.com/jeffson-dsilva/smartphone-price-prediction

Smartphone Price Prediction is a machine learning project that predicts the price category of a smartphone based on its features. This project provides insights into how various specifications, such as RAM, battery capacity, and processor speed, influence the pricing of smartphones.

dataanalysis machine-learning matplotlib modelbuilding numpy pandas price-prediction python seaborn smartphone

Last synced: 14 Apr 2026

https://github.com/sayidmushtaq/reportcard

Tracking and analyzing student grades. It calculates averages, identifies top performers, and generates performance reports with ease.👨🏼‍🎓👨🏼‍🎓

begginer-project numpy reporting-tool

Last synced: 27 Jan 2026

https://github.com/angelarreola/ai_notes

Notas de la materia "Inteligencia Artificial" para su posterior extraccion mediante algun modelo de lenguaje que nos permita dar respuestas personalizadas con base a la informacion presente en este repositorio.

ai matplotlib numpy pandas phaserjs python scikit-learn

Last synced: 21 Jan 2026

https://github.com/szymon-budziak/real_estate_house_prices_prediction

Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.

data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn

Last synced: 21 Jan 2026

https://github.com/wasifsohail5/amusic-ai_powered_musicrecommendationsystem

AMUSIC is an AI-driven music recommendation system that helps users discover personalized songs. Using Python, Streamlit, and Scikit-learn, it offers smart recommendations, advanced search, and interactive music insights. Users can save favorites, create playlists, and export data for a seamless music discovery experience.

joblib k-nearest-neighbours matplotlib minmaxscaler numpy pandas pickle plotly python scikit-learn seaborn streamlit

Last synced: 14 Oct 2025

https://github.com/helabenkhalfallah/art-bloom

ArtBloom is a backend application designed for art enthusiasts and researchers.

analytics backend-api numpy pandas-python python recommendationsystem

Last synced: 22 Jan 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/hansalemaos/locate_pixelcolor_cython_multi_auto_compile

Detects colors in images 8 x faster than Numpy / Uses Cython - returns a dict

bot c color colors colour colours cpp cython fast find multiprocessing numpy numpy-array opencv rgb rgba search

Last synced: 24 Feb 2026

https://github.com/rohanrony19/movie-recommendation-system

This is a python project where using Pandas library we will find correlation and give the best recommendation for movies.

data-analysis deep-learning knn-algorithm numpy pandas python recommendation-system

Last synced: 14 Apr 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/matkorussovich/student-performance-analysis

Este repositorio contiene un análisis del desempeño académico de estudiantes, realizado como parte del módulo "Introducción al Data Science" en el Máster en Data Science de la Universidad Europea de Madrid.

jupyter-notebook matplotlib-pyplot numpy pandas python

Last synced: 14 Apr 2026

https://github.com/sralter/potential_talents

Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.

lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet

Last synced: 09 Apr 2026

https://github.com/alansteinbarth/irysy

🪻 Irysy 🌼 eksploracyjna analiza danych – wizualizacje, ML, transparentny kod i autorskie grafiki. Projekt łączący AI, estetykę i praktyczne umiejętności!

ai classification clean-code data-science exploratory-data-analysis feature-engineering github-pages iris-dataset jupyter-notebook machine-learning matplotlib modern-ui numpy open-source pandas portfolio-project python reproducible-research seaborn visualization

Last synced: 17 Oct 2025

https://github.com/mbarbetti/hello-snakemake

Simple implementation of snakemake to sample and plot data

hello-world matplotlib numpy python snakemake

Last synced: 14 Apr 2026

https://github.com/architj6/gesture-volume-control

Easily control your system volume with simple hand gestures! This project leverages OpenCV, MediaPipe, and Pycaw to track hand movements and adjust the system volume dynamically. Say goodbye to keyboard shortcuts and volume buttons—just move your hand to set the volume!

handgesture mediapipe numpy opencv os pycaw python python3 volume-control

Last synced: 03 May 2026