An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/zenithclown/agine

Library for finding Nearest Neighbor or to find if two points on Earth have a Direct Line of Sight.

bearing bearing-calculations distance-calculation euclidean-distance geometry haversine manhattan-distance nearest-neighbor-search numpy pandas rasterio shapely

Last synced: 28 Jan 2026

https://github.com/mike014/soundbot

This Repository will contain the prototype of a ChatBot created for the playback of audio, or everything related to audio in the world of video games and not.

chatbot dsp nlp nlp-library nlp-machine-learning numpy python scipy sound-processing

Last synced: 13 Apr 2026

https://github.com/bdurga26/fake-news-detection

This project aims to find whether the given news is real or fake by using Machine learning Algorithms.

ds ml numpy pandas python sklearn

Last synced: 14 Apr 2026

https://github.com/sawaira-iqbal/data-visualization-project-on-car-sales-data

Explore Sales Data Visualization with Interactive Charts & Insights! 📊 Uncover trends and patterns to drive smarter automotive decisions.

bivariate-analysis data-science data-visualization interactive-visualizations matplotlib multivariate-analysis numpy plotly python seaborn univariate-analysis

Last synced: 12 Apr 2026

https://github.com/yousefmostafa125/image-processing

An interactive image processing web app built with Streamlit and OpenCV. The app allows users to upload an image, apply various processing steps, and download the results individually or as a ZIP file.

numpy opencv-python python streamlit

Last synced: 03 May 2026

https://github.com/kwasi-dankwa/game-sales

This project investigates why Nintendo’s global market share fluctuated between 2010–2016, identifies top-performing genres and regions, and builds predictive models to forecast sales and regional performance. It culminates in specific, actionable recommendations to grow market share.

business-intelligence gaming juptyernotebook-language matplotlib numpy pandas postgresql python3

Last synced: 09 Apr 2026

https://github.com/mehwishferoz/data-analysis-with-python-zero-to-pandas

This repository contains the Python code and projects I created while learning the Data Analysis with Python: Zero to Pandas course. The course covers essential topics such as data cleaning, analysis, and visualization using powerful Python libraries like Pandas, NumPy, Matplotlib, and Seaborn.

exploratory-data-analysis jovian matplotlib numpy pandas seaborn

Last synced: 10 May 2026

https://github.com/prateekrajsrivastav/financial-transition-classification

This project focuses on classifying financial transactions using machine learning techniques. By leveraging labeled data, the model aims to categorize transactions into predefined categories (e.g., "Food," "Transport," "Shopping," etc.).

matplotlib numpy pandas-python scikitlearn-machine-learning seaborn

Last synced: 07 Sep 2025

https://github.com/murugavl/customer_churn_prediction

Customer churn prediction is a crucial task in business analytics, helping companies identify customers who are likely to leave and take preventive actions. This project utilizes Machine Learning (ML) techniques to analyze customer behavior and predict churn.

jupyter-notebook machine-learning matplotlib numpy pandas pickle python scikit-learn streamlit

Last synced: 13 Apr 2026

https://github.com/shwetapardhi/assignment-1-q9_a--basic-statistics-level-1

Q9) Calculate Skewness, Kurtosis & draw inferences on the following data Cars speed and distance Use Q9_a.csv

kurtosis matplotlib-pyplot numpy pandas python seaborn skewness

Last synced: 05 May 2026

https://github.com/mahmood-anaam/grid-world-value-iteration

An implementation of the Value Iteration algorithm for solving the Grid World problem. This project provides a function to compute the optimal value function for a grid-based environment where a robot navigates to maximize rewards while avoiding penalties.

dynamic-programming grid-world numpy optimization-problem python value-iteration-algorithm

Last synced: 10 May 2026

https://github.com/ljadhav25/data-engineering-poc

This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da

data-analysis etl matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/cezlul/analyse-ventes-immobilier

Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.

data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn

Last synced: 13 Apr 2026

https://github.com/rotantos/ml-projects

Includes various ML projects created for educational purposes.

classification-model jupyter-notebook machine-learning numpy pandas python sklearn-library

Last synced: 10 Apr 2026

https://github.com/samestrin/image-manipulation-api-digitalocean

A Python DigitalOcean App Platform based REST image manipulation API using Flask, NumPy, and OpenCV that runs in a Docker container.

api docker flask image image-processing numpy python python3 rest rest-api

Last synced: 13 Apr 2026

https://github.com/ahmedabdalkreem/covid-19

In this project we build complete project conatin tree part first part build deep leanring model using CNN second part build simple mobile app using Flutter and the last part is to connect mobile app with model using Flask API.

cnn dart deep-learning flask-api flutter matplotlib mobile-app numpy pandas python tensorflow

Last synced: 10 Apr 2026

https://github.com/kunalshelke90/wine-quality-testing

This project is about creating a machine learning algorithm that can predict the quality of wine based on the given dataset. Different machine learning algorithms such as logistic regression, decision tree and random forest are used in this project.

eda feature-engineering flask machine-learning numpy pandas python

Last synced: 10 Apr 2026

https://github.com/bbhargavpanchal/brain-tumor-segmentation-unet

U-Net based Convolutional Neural Network for brain tumor segmentation using MRI images. Implemented in Python with TensorFlow, OpenCV, and Scikit-learn.

keras-tensorflow numpy opencv pandas python scikit-learn sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/dhaase-de/dito

Yet another toolbox for the daily work with OpenCV under Python.

numpy opencv python

Last synced: 22 Aug 2025

https://github.com/ksprugevics/magic-square

Some simple algorithm implementations for solving (ordinary) magic squares using NumPy

algorithms mathematics numpy python

Last synced: 21 Apr 2026

https://github.com/burhanali2211/marksanalyzer

This is a simple CLI-based tool that helps analyze student marks by calculating key statistics such as the highest and lowest scores.

analysis analyzer arrays marks numpy

Last synced: 05 May 2026

https://github.com/iamwatchdogs/retail-insights-from-superstore-data

An simple Data Analysis Project for finding insight within the Super store data set

conda-environment data-analytics jupyter-notebook matplotlib numpy pandas python

Last synced: 05 May 2026

https://github.com/chandkund/predicting-diabetes-onset

The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.

deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/mayankmittal29/pagesage-ai_urls_content_scraper_webapp

PageSage is a modular web scraping and semantic retrieval system using BeautifulSoup for extraction, SentenceTransformers for dense embeddings, and ChromaDB for scalable vector search—enabling efficient data indexing for advanced RAG pipelines.

beautifulsoup chromadb fastapi gpt-turbo numpy openai-api python3 question-answering rag react requests sentence-transformers tailwindcss web-scraping

Last synced: 10 Apr 2026

https://github.com/hilarionengarejr/movie-recommender-app

Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.

docker flask nltk numpy pandas python3 scikit-learn selenium

Last synced: 10 Apr 2026

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 11 Apr 2026

https://github.com/eesunmoon/aml_project

[Project] Analyzing Fitness Patterns & Building Predictive Models

matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/harmanveer-2546/covid-19-chances-detection

The main goal of this project is to build a machine learning model that, given a Covid-19 patient's current symptom, status, and medical history, will predict whether the patient is in high risk or not.

covid-19 covid-data logistic-regression matplotlib mlp-classifier numpy pandas python random-forest seaborn sklearn-library xgboost

Last synced: 10 Apr 2026

https://github.com/avin27/credit-card-fraud-detection

A python script to analyse credit card data to detect fraudulent transactions

data-science numpy pandas sklearn

Last synced: 10 Apr 2026

https://github.com/evan-dg31/data-science

Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering

classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn

Last synced: 13 Apr 2026

https://github.com/tharushiimasha/python-practice

This repository contains Python programs and exercises from practice sessions. It covers Python basics, file handling, and libraries like NumPy and pandas for data manipulation.

api-integration csv-files file-handling json-files jupyter-notebook numpy oops-in-python pandas python web-scraping

Last synced: 13 Apr 2026

https://github.com/darshanpakhale250-gif/customer-churn-prediction-ml

A machine learning project to predict customer churn using regression and classification models including logistic regression, decision tree, and random forest. Performed EDA, visualizations, and model evaluation. The dataset is taken from Kaggle and implemented in Google Colab.

colab-notebook customer-churn-analysis data-science decision-trees kaggle logistic-regression machine ml numpy pandas python random-forest

Last synced: 13 Apr 2026

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 13 Apr 2026

https://github.com/kizman-23/supervised_models

Classical prediction of future data using models trained by labeled data

numpy pandas scikit-learn supervised-machine-learning

Last synced: 13 Apr 2026

https://github.com/harmanveer-2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python

Last synced: 10 Apr 2026

https://github.com/vc-nithesh944/pandas_numpy

Practice repository for NumPy & Pandas with scripts, notebooks, and notes for future revision

numpy pandas

Last synced: 05 May 2026

https://github.com/venkat-0706/accenture-hackathon

Developing an e-commerce recommendation system involves utilizing technologies such as Python for programming, Pandas for data manipulation, SQL for database management, FastAPI for building APIs, PostgreSQL for data storage, and Docker for containerization.

alembic api docker fastapi machinelearningalgorithms matplotlib numpy postgresql pydantic python3 scipy seaborn sqlmodel

Last synced: 13 Apr 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/annygutierrez/bank-ann

Basic Artificial Neural Network

keras matplotlib numpy pandas sklearn

Last synced: 13 Apr 2026

https://github.com/anujdutt9/language-model-using-rnn

Text Generation using basic RNN from scratch using Python and Numpy.

language-model numpy python rnn text-generation

Last synced: 08 May 2026

https://github.com/archishmansengupta/dnn

Digit Neural Network is a digit recognition network based on MNIST data set using numpy, pandas and matplotlib

matplotlib mnist neural-network numpy pandas python

Last synced: 13 Apr 2026

https://github.com/jpgiant/nyc_energy_prediction

A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.

data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/harmanveer-2546/tweets-cleaning-with-python

Twitter is one of the most used data sources for data analysis. The reason is that it’s open and free to collect unless you subscribe to the paid version one. Besides, it’s pretty simple to collect data from it.

filtering function-calling nlptk numpy os pandas punctuation python re removing-links tokenization tweet-cleaning twitter-data-analysis twitter-sentiment-analysis

Last synced: 10 Apr 2026

https://github.com/ledsouza/medphys-bi

Aplicativo desenvolvido para gestão e análise de dados do Serviço de Física Médica em Medicina Nuclear do HCPA

aws boto3 cloud-computing cloud-run docker dotenv gcp mongodb numpy os pandas plotly pymongo python s3 streamlit terraform vitrinedev yaml

Last synced: 10 Apr 2026

https://github.com/lingumd/school_district_analysis

Updated test score data and school district analysis using Python.

averages binning jupyterlab numpy pandas pandas-loc python

Last synced: 13 Apr 2026

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/thomasa/python-intro

This is a Jupyter notebook slide show for a 2-hour crash intro to Python, NumPy, Matplotlib, and SciPy

matplotlib numpy python scipy tutorial

Last synced: 05 May 2026

https://github.com/charanpool/image-patch-mapping

A Python toolkit for image processing — edge detection, template matching, and patch detection using OpenCV

canny-edge-detector computer-vision edge-detection image-processing numpy opencv python sobel-filter template-matching

Last synced: 18 Jan 2026

https://github.com/vedanty3/supermarket-sales-data-analysis

This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.

data-analysis data-science jupyter-notebook matplotlib numpy pandas python

Last synced: 08 May 2026

https://github.com/abideen-olawuwo/nyc-taxi

Prediction the duration of New York Taxi trip

linear-regression matplotlib numpy pandas python seaborn sklearn

Last synced: 11 Apr 2026

https://github.com/nikhilsree5/netflixcasestudy

Global Content Strategy: Leveraging Data Insights to Optimize Netflix's International Growth

eda numpy pandas python visualization

Last synced: 13 Apr 2026

https://github.com/prowilliam/maze_generator

A maze generator is created where you can modify its dimensions and shows a red route where it describes the solution of the maze, and it is exported in png format

numpy pillow python

Last synced: 28 Apr 2026

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/alepm1990/nba-analytics

Análisis y trabajos en Python con BBDD deportivas

matplotlib-pyplot numpy pandas pyhon python3 seaborn

Last synced: 05 May 2026

https://github.com/shlok-nahar/mnist-cnn-classifier

This repository trains and evaluates three CNN models on MNIST, providing performance comparisons and 5 unique visualizations.

confusion-matrix graph heatmap-visualization json machine matplotlib mnist numpy precision-recall python receiver-operating-characteristic seaborn sklearn tensorflow

Last synced: 13 Apr 2026

https://github.com/m1-elmasry/quickmark

Efficient OMR software designed for quick and precise grading of bubble sheet exams.

bubble-sheet image-processing numpy opencv python

Last synced: 10 May 2026

https://github.com/abdullah-masood-05/eye-balll-detection

This repository provides advanced eye gaze tracking using MediaPipe Face Mesh for both video files and live camera feeds.

mediapipe numpy opencv scipy

Last synced: 10 Apr 2026

https://github.com/yuu-eguci/cognitive-services-trial

Try to play with Cognitive Services!! [Cognitive Services] [OpenCV] [Numpy]

cognitive-services dotenv numpy opencv-python pipenv python python3

Last synced: 05 Jul 2025

https://github.com/jessicahora/series_temporais

Análise de Series Temporais do Consumo de Energia da Alemanha.

numpy pandas python time-series-analysis

Last synced: 07 Apr 2026

https://github.com/zafir100100/cancer-stage-prediction

This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.

cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn

Last synced: 05 May 2026

https://github.com/ondiekelijah/shadowcoach-formtracker

A computer vision-based workout form tracking application that uses AI to monitor exercise form, count repetitions, and provide real-time feedback.

mediapipe-pose numpy opencv-python

Last synced: 09 Apr 2025

https://github.com/niteshchawla/loantap-logisticregression

Given a set of attributes for an Individual, determine if a credit line should be extended to them. If so, what should the repayment terms be in business recommendations?

confusion-matrix encoding feature-engineering featureimportance hyperparameter-tuning logistic-regression machine-learning numpy pandas-python precision-recall roc-auc-curve sklearn-metrics smote-sampling

Last synced: 10 May 2026

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/hari7261/playwithdata-python

This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring

data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn

Last synced: 13 Apr 2026

https://github.com/max134/deepplate

A custom-trained YOLOv11-based license plate detection with PaddleOCR for accurate ANPR.

computer-vision cv2 deep-learning image-recognition licesnse-plate-recognition numpy object-detection ocr paddleocr real-time-processing sqlite3 ultralytics yolov11

Last synced: 13 Jul 2025

https://github.com/itshyphen/mass-mailing-script

A simple mass mailing script that sends personalized email to multiple emails importing from csv

numpy pandas python smtplib

Last synced: 13 Apr 2026

https://github.com/shailikadakia/yoga-pose-accuracy-detector

Real-time yoga pose accuracy detector using MediaPipe and OpenCV. Classifies poses from webcam input with KNN and provides feedback on duration and accuracy, built with Python and scikit-learn.

computer-vision fitness health-tech knn machine-learning mediapipe numpy opencv pandas pose-estimation python real-time-detection yoga

Last synced: 09 Apr 2026

https://github.com/bhattbhavesh91/numpy-performance-test

Comparing the performance of Base Python Square root function with a numpy equivalent

list numpy python

Last synced: 20 Apr 2026

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/lohiyah/dataset_summariser

A Python tool for generating concise summaries of datasets using Pandas and NumPy for analysis, and Matplotlib and Seaborn for insightful visualizations. Ideal for quick exploratory data analysis (EDA).

matplotlib numpy pandas seaborn

Last synced: 05 May 2026

https://github.com/andersoncrs/aprendizaje_no_supervisado_kmeans_customers

Este repositorio contiene un análisis de datos de clientes de un centro comercial utilizando técnicas de aprendizaje no supervisado, específicamente K Means y clustering jerárquico. El objetivo del proyecto es segmentar a los clientes en grupos homogéneos para entender mejor sus comportamientos y características.

data-analysis kmeans-clustering matplotlib numpy seaborn visualization

Last synced: 10 May 2026