An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/quinnphil/math-for-programmers

Work-alone code for Math for Programmers book by Paul Orland

geometry mathematics matplotlib numpy python

Last synced: 17 Apr 2026

https://github.com/mgitrov/cnn-on-cifar-10

A computer vision project aiming to classify random images uploaded by the user.

artificial-neural-networks computer-vision convolutional-neural-networks deep-learning keras matplotlib numpy pil regularization tkinter

Last synced: 17 Apr 2026

https://github.com/vladimiracunadev-create/python-data-science-program

Python Data Science Program โ€” 197 clases en 9 partes. Pauta avanzada derivada de Gรฉron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseรฑanza y mejora continua.

bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 01 Jun 2026

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 10 May 2026

https://github.com/sarowarahmed/advertising-sales-app

๐Ÿ“ˆ Advertising Sales Predictor: A web app powered by a Machine Learning model, built with Numpy, Pandas, Scikit-learn, and Streamlit, to forecast sales based on TV, Newspaper, and Online Advertising. Deployed on Streamlit Cloud for real-time, easy-to-use predictions.

advertising app machine-learning multiple-linear-regression numpy pandas sales scikit-learn streamlit

Last synced: 07 Feb 2026

https://github.com/mnitin-reddy/content-based-recommendation-system-using-deep-learning

A content-based movie recommendation system using deep learning to predict user ratings by leveraging user and movie features. The system integrates neural networks for feature extraction, utility scripts for data processing, and supports both new and existing user recommendations.

deep-learning keras neural-networks numpy pandas python scikit-learn tensorflow

Last synced: 03 Apr 2026

https://github.com/rosieoh/emergency_dataanalysis

์˜คํ”ˆ๋ฐ์ดํ„ฐ๋ถ„์„-์‘๊ธ‰์˜๋ฃŒ์ฒด๊ณ„ ๋ฐฉ์•ˆ ์ •์ฑ… ์ œ์•ˆ ๋ฐ์ดํ„ฐ ๋ถ„์„

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/gama1903/digital_image_processing

Implementation of Basic Algorithms of Digital Image Processing mainly through Numpy.

digital-image-processing numpy opencv python

Last synced: 08 May 2026

https://github.com/varakanthamsaihruthikreddy975/champo_carpets_case_analysis_python

the champo carpets case study Analysis in python, this is still under development this will be a more comprehensive analysis than previous one in R. I have the complete version of code in R refer to this link - ( https://github.com/VarakanthamSaiHruthikReddy975/Champo_Carpets_Case_Study_Analysis)

case-analysis jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python3 seaborn

Last synced: 17 Apr 2026

https://github.com/anag0es/projetos-ml

Este รฉ um lugar onde guardo vรกrios projetos para entender como funciona a aprendizagem de mรกquina. Cada projeto รฉ uma forma de praticar o que estou aprendendo nos meus estudos.

artificial-intelligence crisp-dm data-science deep-learning learning machine-learning neural-network numpy pandas prediction-model python

Last synced: 17 Apr 2026

https://github.com/momo790-m/startup-profit-prediction

Machine learning project to predict profits of new startups based on R&D, Admin, Marketing, and State data

assumptions business colab-notebook linear-regression machine-learning matplotlib numpy pandas preprocessing seaborn skit-learn sklearn

Last synced: 08 May 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/mwoss/mownit

Various tasks from numerical analysis laboratories solved using Python

jupyter-notebook mathematics matplotlib numerical-analysis numpy python scipy

Last synced: 17 Apr 2026

https://github.com/dhruvil-26/python-projects

This repository contains Python projects showcasing data analysis and visualization. 1. IMDB Movie Analysis: Analyzing movie trends, genres, and ratings. 2. Loan Default Analysis EDA: Exploring factors contributing to loan defaults.

eda imdb-dataset loan-default-analysis matplotlib numpy pandas python seaborn visualization

Last synced: 08 May 2026

https://github.com/hasanyahya101/pareto-tutorial-python

This tutorial demonstrates how to create Pareto charts using Python libraries like pandas, NumPy, and Matplotlib. All code for this is given in the Jupyter Notebook.

chart charts code-blocks ipynb jupyter-notebook license markdown matplotlib mit numpy pandas pareto py python

Last synced: 07 Feb 2026

https://github.com/hygull/pyrandocs

A repository (pyrandocs i.e. Python's v2/v3 random documents/files) containing different Python v2/v3 based resources (files/documents etc.) from random topics. A project to try different Python's features.

base64 beautifulsoup class exception-handling hashlib json jwt-authentication loops matplotlib nltk numpy pandas python2 python27 python3 requests searching time-complexity urllib urllib2

Last synced: 15 Apr 2026

https://github.com/prakashjha1/stock-investment-analysis

Stock Investment Analysis Project can help investor to select the better performing stocks.

data-analysis data-science numpy pandas pandas-datareader parallel-programming python

Last synced: 08 May 2026

https://github.com/prcharan592/social-media-sentiment-analysis

Social media sentiment analysis using tweets involves analyzing tweet data to determine public sentiment (positive, negative, or neutral) using natural language processing (NLP) and machine learning techniques.

data-visualization machine-learning matplotlib nlp nltk numpy pandas python3 sentiment-analysis spacy tweets

Last synced: 04 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/alain-godo/spotify-popularity

A simple analysis and modeling for prediction of the popularity of songs on Spotify

linear-regression machine-learning numpy pandas random-forest sckiit-learn spotify tree-decision

Last synced: 08 May 2026

https://github.com/luizabash/wmap-tt-eda

EDA of the WMAP 5-Year TT Power Spectrum data, the analysis includes summary statistics, data visualization, polynomial regression for model fitting, and hypothesis testing

astrophysics matplotlib numpy pandas scipy

Last synced: 01 Feb 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_01

GoIT, Python Data Science, Homework number 1.

data-science google-colab jupyter-nootebook numpy numpy-arrays python

Last synced: 18 Apr 2026

https://github.com/akansharajput280799/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

data-visualization excel exploratory-data-analysis matplotlib numpy python seaborn

Last synced: 08 May 2026

https://github.com/yassin522/music-genre-classification

This repository is dedicated to the Music Genre Classification project, an endeavor that leverages machine learning techniques to accurately classify music into various genres

catboost matplotlib numpy pandas python sklearn

Last synced: 04 Apr 2026

https://github.com/octavioduarte/datascience

A repository with study notes on Data Science concepts, NumPy, pandas, and an introduction to statistics.

data-science jupyter-notebook numpy pandas

Last synced: 08 May 2026

https://github.com/huzaifa12466/neuralnetscratch

A simple neural network implemented from scratch in Python using NumPy, trained on the Moons dataset

artificial-intelligence artificial-neural-networks backpropagation deep-learning forwardpropagation machine-learning matplotlib neural-network neural-networks numpy

Last synced: 23 Jun 2026

https://github.com/amiegirl/ford_gobike-tripdata

This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.

jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 08 May 2026

https://github.com/thoughtscript/python_api_2023

Some fun with Flask, Python, and Machine Learning

angular docker docker-compose flask machine-learning mysql numpy

Last synced: 04 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/niteshchawla/nc-walmart-casestudy

The Management team at Walmart Inc. wants to analyze the customer purchase behavior (specifically, purchase amount) against the customerโ€™s gender and the various other factors to help the business make better decisions. They want to understand if the spending habits differ between male and female customers:.

central-limit-theorem confidence-intervals matplotlib-pyplot numpy pandas python scipy-stats seaborn stats

Last synced: 08 May 2026

https://github.com/audeering/audmath

General math functions

math numpy

Last synced: 07 Feb 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/kasraskari/python-for-data-science

Foundational concepts and libraries for data science with Python.

data-science jupyter-notebook matplotlib numpy pandas python scipy

Last synced: 04 Apr 2026

https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation

An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.

clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/abdul-rafay19/california-housing-price-prediction

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 04 Apr 2026

https://github.com/yashsonaar/machine-learning-tasks

This repository has machine learning tasks which include classification, recommendation system, fraud detection system

classification jupyter-notebook machine-learning numpy pandas prediction python scikit-learn testing

Last synced: 04 Apr 2026

https://github.com/ccolpasm/pageviewanalyzer

PageViewAnalyzer analyzes and visualizes time series data of forum visits (2016-2019). It uses Pandas, Matplotlib, and Seaborn to create line plots, bar charts, and box plots, highlighting trends, monthly averages, and seasonality in page views. Perfect for data exploration with Python.

data-science matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/jsnevt/regressao_linear

Regressรฃo linear simples - Qual a velocidade se levou x distรขncia para parar ?

linear-regression matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022

Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.

catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap

Last synced: 18 Apr 2026

https://github.com/armanshadyan/bacteria-detection

Deep learning-powered tool for detecting and analyzing bacteria in microscope images. Designed for microbiology research, it identifies bacteria, counts them, and calculates coverage area with high precision. Supports real-time video processing, single-image analysis, batch processing, and model training.

mediapipe numpy object-detection opencv python pytorch

Last synced: 05 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026

https://github.com/tsar123/neutral-single-nucleotide-polymorphisms-analysis

analysis of data on neutral single nucleotide polymorphisms

dna-polymorphism matplotlib numpy pandas seaborn snv

Last synced: 01 Feb 2026

https://github.com/murugavl/flower-prediction

Flower Prediction is a machine learning project that uses the Iris dataset to classify iris flowers into three species: Setosa, Versicolor, and Virginica. The project includes data analysis, model training with various algorithms, and deployment via a Flask web application for user-friendly predictions.

flask machine-learning matplotlib numpy pandas python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/dyceszn/i.duruohacos102

First-year deep dive into problem-solving with Pythonโ€”featuring class practices and projects using NumPy, Pandas, Tkinter, and psycopg2.

numpy psycopg2 python tkinter

Last synced: 18 Apr 2026

https://github.com/iv4n-ga6l/titanic-survival-prediction

a predictive model in Python that answers the following question: โ€œusing data relating to passengers (name, age, gender, socio-economic class, etc.), who has the greatest chance to survive?"

matplotlib numpy pandas python seaborn sklearn

Last synced: 18 Apr 2026

https://github.com/ayushmaan-xd/exploratory-spotify-data-analysis

Exploratory Spotify Data Analysis is a project where I analyzed Spotifyโ€™s music dataset to uncover trends in audio features and song popularity. Using Python and data visualization tools

jupyter-notebook matplotlib numpy pandas python seaborn spotify-dataset

Last synced: 18 Apr 2026

https://github.com/nksmanya/virtual_canvas

A hand gesture-based drawing app that lets you create art in the air using hand movements

mediapipe numpy opencv opencv-python python

Last synced: 18 Apr 2026

https://github.com/manalisbhavsar/mall-customers-clustering

K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.

data-analysis data-visualization matplotlib numpy pandas python scikit-learn

Last synced: 18 Apr 2026

https://github.com/vyask21/opencv-drowsy

Drowsiness detection can be quickly done using shape predictor face landmarks.dat model file that mark the essential landmarks on the face using OpenCV modules.

numpy opencv-python scipy

Last synced: 08 Feb 2026

https://github.com/nellyndj52/breast-cancer-classification-ml

This project aims to build a machine learning models to predict whether a tumor is malignant or benign based on the Breast Cancer dataset. Various classification algorithms are tested, including Logistic Regression, Random Forest, and Support Vector Machine (SVM).

breast-cancer cancer dataset flask-api heroku-deployment lime machine-learning mlp-networks numpy perceptron-neural-networks pickle segmentation support-vector-machines svm

Last synced: 03 Apr 2025

https://github.com/hamishivi/denserflow

A basic numpy neural net library. This achieved high marks in a uni assignment!

deep-learning neural-network numpy

Last synced: 18 May 2026

https://github.com/johanntf/direccion_maximo_crecimiento

Direcciรณn de mรกximo crecimiento en una funciรณn multivariable dado un punto de partida

customtkinter matplotlib numpy python

Last synced: 18 May 2026

https://github.com/sanjanahombal/study-on-sentiment-analysis

This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.

bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer

Last synced: 07 Jan 2026

https://github.com/sultanazhari/customer-habit-analysis-model

Megaline company wants to develop a model that can analyze consumer behavior and recommend one of Megaline's two new plans: Smart or Ultra. In this classification task, we need to develop a model that is able to choose the right package

accuracy-score decision-tree-classifier logistic-regression matplotlib-pyplot numpy pandas python3 random-forest-classifier seaborn train-test-using-sklearn

Last synced: 11 Apr 2026

https://github.com/lefteris-souflas/economic-connectedness-analysis

Jupyter notebook, replicating studies on social capital from Nature journal, analyze economic connectedness, upward income mobility, and more. Python and relevant datasets are utilized to recreate figures and analyses.

analytics choropleth-map json jupyter-notebook matplotlib matplotlib-pyplot numpy pandas plotly-express python3 regplot scatterplot seaborn stata visualization

Last synced: 07 Apr 2026

https://github.com/santiagoenriquega/custom_neural_net

This repository focuses on building a neural network from scratch using only base python and Numpy.

gradient-descent neural-network numpy optimization python

Last synced: 18 May 2026

https://github.com/toon-van-berkel/imagetovideofx

ImageToVideoFX is a simple Python project that takes an image (JPG/PNG) and transforms it into a short video with a smooth zoom effect. You can optionally add background music. This is perfect for content creators or developers who want to automate turning images into engaging video content.

imagetovideo moviepy numpy pillow python

Last synced: 08 May 2025

https://github.com/trups39/nyc-green-taxi-trip-analysis

Analyze NYC Green Taxi data, predicting fares and optimizing pickup locations using Regression machine learning models uncover travel patterns.

numpy pandas predictive-modeling python3 regression-models skit

Last synced: 17 Feb 2026

https://github.com/nazmul-1117/100-days-of-machine-learning

I'm Nazmul so exited to start a new journey to learn 100 Days of Machine Learning. It's February 8, 2025. I'm so exited, let's see what happened insha'Allah

data-science machine-learning numpy pandas-dataframe python3 scikit-learn statistics

Last synced: 11 Aug 2025

https://github.com/dor-sketch/audio-visual-synth

Combining real-time graphics, audio synthesis, and interactive elements to create an engaging, cross-platform music visualizer.

image-processing midi-player music-composition musicsheet notes-tool numpy pygame pygbag pyopengl

Last synced: 18 May 2026

https://github.com/rajesh9943/developing-a-bike-buyer-sales-report-dashboard-using-excel

This dataset has details of 1000 users from different backgrounds and whether or not they buy a bike. This data can be used to build the dashboard in Google Sheets. There are some NA (Null / Empty) values injected in the dataset. Use this dataset for Data Cleaning, Exploration, and Visualization.

excel google-sheets numpy pandas-library python3

Last synced: 17 May 2026

https://github.com/yash22222/analysis-of-amcat-aspiring-minds

Analyzing employment outcomes for engineering graduates based on a provided dataset. Specifically, it aims to verify a claim regarding the salary range for fresh graduates in certain engineering roles and investigate potential relationships between gender and specialization preferences.

amcat aspiring-minds charts computer-engineering data-analytics data-vizualisation dataset graph machine-learning matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/yesenvidnath/facerecognito-pythonfacialrecognitionapplication

this is a web application created to detect faces using a pre-given image links through Excel format, this project uses Pytorch & Flask

facent flask numpy python

Last synced: 10 Apr 2026

https://github.com/dvamishkilapa/xolm2_py

Ported Xolm2 VSU

mathplotlib numpy pyqt5

Last synced: 23 Oct 2025

https://github.com/biocpy/mopsy

Matrix operations

matrix numpy scipy

Last synced: 07 Mar 2026

https://github.com/wondermongering/nonlineardatagen

A flexible Python framework for generating, fitting, and visualizing noisy nonlinear data. Perfect for educational purposes, algorithm testing, and demonstrating statistical concepts. Includes tools for various noise models, custom function fitting, robust error metrics, and publication-quality visualizations

curve-fitting data-simulation data-visualization education educational-tool interpolation mathematical-modelling matplotlib noise-modeling numpy regression-analysis scientific-computing scipy signal-processing statistics teaching-resource time-series-analysis

Last synced: 22 Aug 2025

https://github.com/aminkhavari78/-maternal-health-risk-data-set

In this part I'm working on Maternal Health Risk Prediction

data-science linearsvc machine-learning matplotlib numpy pandas scatter-plot sklearn

Last synced: 05 May 2026

https://github.com/kr1shnasomani/facetronix

A toolkit for face detection, annotation, and mask recognition.

computer-vision deep-learning keras neural-network numpy opencv tensorflow

Last synced: 05 Apr 2026

https://github.com/yalai92/master_academic_projects

This repository is dedicated to showcasing the academic projects completed during my Master in Data Science & AI. The main objective is to show a collection of projects in various data science fields, including: data cleaning & preprocessing, data analysis, data visualization, machine learning, clustering, among others.

automation clustering data-merging google-colab jupyter-notebook machine-learning matplotlib numpy pandas powerbi python sklearn xgboost

Last synced: 13 Jun 2025