An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/parthds02/customer-segmentation-with-kmeans-clustering

Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.

data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode

Last synced: 11 May 2026

https://github.com/mihirh19/uber-analysis

Uber's 2016 dataset analysis offers insightful information about the company's operations and user behaviour. Patterns and trends can be discovered by looking at variables like trip time, distance travelled, and pickup/drop-off locations. Uber may use this information to detect high-demand locations, increase overall efficiency, and optimise driver

matplotlib numpy opendatasets pandas seaborn

Last synced: 11 May 2026

https://github.com/bhuvilol/objdetector_webcam_and_video

Object Detection from Webcam input or video input

cvzone numpy opencv python3 yolov8

Last synced: 11 May 2026

https://github.com/khalidelboray/image_processing

Notes about image processing

image-processing numpy python3

Last synced: 11 May 2026

https://github.com/tfiers/unitlib

πŸ“ Physical units for NumPy arrays \ ⏱ Fast β€’ Simple β€’ High voltage

numpy python scientific-computing unitlib units

Last synced: 13 May 2026

https://github.com/mituskillologies/dkte-da-mar25

Programs conducted at DKTE's Engineering Institute, Ichalkaranji in training on Python Data Analytics March 2025.

data-analysis matplotlib numpy pandas python-programming tkinter-python

Last synced: 13 May 2026

https://github.com/asuquoaa/big_4_sports_teams_and_city_population_analysis-2018-

Analysis of sports teams' win/loss ratios vs. metro area populations across NFL, NBA, MLB, and NHL.

data-cleaning-and-preprocessing numpy pandas

Last synced: 13 May 2026

https://github.com/gmcgibbon/udacity-dlnd-project-1

Udacity Deep Learning Nanodegree - Project 1

deep-learning jupyter-notebook nn numpy udacity

Last synced: 13 May 2026

https://github.com/maroshmka/np-mlp-clf

Raw implementation of mlp classifier in python using numpy.

classification example-code learning machine-learning multilayer-perceptron neural-network numpy

Last synced: 14 May 2026

https://github.com/chicoyildiz/python-raytracing-demo

Interactive Python ray tracer (numpy + pygame) with a ceiling camera, dynamic resolution scaling, and a glowing bouncing fireball.

numba numpy python python3 raytracing

Last synced: 14 May 2026

https://github.com/khushi130404/gradient_realm

Gradient_Realm is a Python project exploring regression techniques and optimization methods like Regularization, Batch, Stochastic, and Mini-batch Gradient Descent. It uses scikit-learn and custom implementations for hands-on learning.

batch-gradient-descent mini-batch-gradient-descent numpy stochastic-gradient-descent

Last synced: 14 Jun 2026

https://github.com/pnguyen215/pycelize

Pycelize is a Flask application designed for processing Excel and CSV files. It provides RESTful APIs for common data operations including extraction, normalization, mapping, SQL generation, and file binding.

excel lib numpy openpyxl pandas py-app pycelize python-service python3

Last synced: 17 Jun 2026

https://github.com/dino-kupinic/pocolocosolver

algorithmic solution with 3D visualization

algorithms numpy panda3d puzzle pytest python solver

Last synced: 19 Jun 2026

https://github.com/harshavardhanbommalata/knn_tutorial

K-Nearest Neighbors (KNN) is a simple yet powerful machine learning algorithm. Unlike models that learn parameters during training, KNN uses lazy learningβ€”it stores the dataset and predicts by finding the closest neighbors, making decisions through majority voting.

machine-learning-algorithms mathematical-modelling matplotlib numpy pandas

Last synced: 20 Jun 2026

https://github.com/huzaifa12466/neuralnetscratch

A simple neural network implemented from scratch in Python using NumPy, trained on the Moons dataset

artificial-intelligence artificial-neural-networks backpropagation deep-learning forwardpropagation machine-learning matplotlib neural-network neural-networks numpy

Last synced: 23 Jun 2026

https://github.com/charlenry/python_math_machine_learning

Mes notebooks de travaux pratiques sur Python, NumPy, SimPy, SciPy, Matplotlib, Plotly, Seaborn et les MathΓ©matiques pour le Machine Learning

algebra derivatives functions jupyter linear matplotlib matrix numpy plotly probabilities pyplot python pytorch scipy seaborn sklearn statistics sympy tensorflow time

Last synced: 25 Jun 2026

https://github.com/rphadtare/python_by_example

This repository consist sample programs to learn and understand core concepts

duckdb numpy pandas python-os python3

Last synced: 27 Jun 2026

https://github.com/khaymanii/credit-card-fraud-detection-model

This model was built using python and Logistic Regression Machine Learning algorithm

matplotlib numpy pandas python sckit-learn

Last synced: 09 May 2026

https://github.com/shhiivvaam/regression

Linear Regression Model

linear-regression numpy python

Last synced: 10 May 2026

https://github.com/rosieoh/emergency_dataanalysis

μ˜€ν”ˆλ°μ΄ν„°λΆ„μ„-μ‘κΈ‰μ˜λ£Œμ²΄κ³„ λ°©μ•ˆ μ •μ±… μ œμ•ˆ 데이터 뢄석

ipython matplotlib numpy pandas python scikit-learn scipy

Last synced: 04 Apr 2026

https://github.com/neetigyab/pdf_form_parser

Older Test version of designated PDFReader project

excel forms matplotlib numpy ocr opencv pandas pdf pdf-converter pdfplumber pytesseract python

Last synced: 17 Apr 2026

https://github.com/noluyorabi/bravais-pearson-interactive

Bravais-Pearson Interactive – A web app for visualizing Pearson correlation. Generate datasets, analyze correlations, and explore regression lines, rΒ², and p-values interactively.

data-visualization interactive-analysis matplotlib numpy pearson-correlation scipy seaborn statistics streamlit

Last synced: 17 Apr 2026

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 04 Apr 2026

https://github.com/mwoss/mownit

Various tasks from numerical analysis laboratories solved using Python

jupyter-notebook mathematics matplotlib numerical-analysis numpy python scipy

Last synced: 17 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/compcode1/nfl-analysis

The objective of this analysis was to explore potential correlations between player attributes (such as average age and weight across different positions) and key team performance metrics from the 2023 NFL season.

datacleaning datanormalization numpy pandas pearson-correlation-coefficient pvalues

Last synced: 18 Apr 2026

https://github.com/27ahmad/movie-recommendation-system

Welcome to the Movie Recommendation System! This project uses Streamlit to provide personalized movie recommendations based on user preferences and similarity.

movie-recommendation numpy pandas python scikit-learn

Last synced: 04 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql

Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!

data-analysis data-science deepnote machine-learning numpy pandas python sql

Last synced: 18 Apr 2026

https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation

An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.

clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/abdul-rafay19/california-housing-price-prediction

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/mnitin-reddy/a-b-testing-and-regression-analysis-for-ad-performance-optimization

Analyzed the performance of Facebook and AdWords ads using A/B testing and regression analysis to identify trends, correlations, and cost-effectiveness. Key insights included distribution of clicks and conversions, monthly trends, and cost-per-conversion analysis to optimize ROI.

abtesting data-science hypothesis-testing machine-learning matplotlib numpy pandas scikit-learn scipy seaborn statsmodels

Last synced: 04 Apr 2026

https://github.com/ccolpasm/pageviewanalyzer

PageViewAnalyzer analyzes and visualizes time series data of forum visits (2016-2019). It uses Pandas, Matplotlib, and Seaborn to create line plots, bar charts, and box plots, highlighting trends, monthly averages, and seasonality in page views. Perfect for data exploration with Python.

data-science matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026

https://github.com/dyceszn/i.duruohacos102

First-year deep dive into problem-solving with Pythonβ€”featuring class practices and projects using NumPy, Pandas, Tkinter, and psycopg2.

numpy psycopg2 python tkinter

Last synced: 18 Apr 2026

https://github.com/ayushmaan-xd/exploratory-spotify-data-analysis

Exploratory Spotify Data Analysis is a project where I analyzed Spotify’s music dataset to uncover trends in audio features and song popularity. Using Python and data visualization tools

jupyter-notebook matplotlib numpy pandas python seaborn spotify-dataset

Last synced: 18 Apr 2026

https://github.com/ashithapallath/feature-engineering

This repository contains a range of examples and techniques for feature engineering, aimed at improving dataset quality and boosting model performance. It covers essential methods such as Exploratory Data Analysis (EDA) and Interquartile Range (IQR) analysis for detecting and handling outliers.

exploratory-data-analysis feature-engineering iqr-method matplotlib numpy outlier-detection pandas python seaborn

Last synced: 18 Apr 2026

https://github.com/alexkhype/04python-carpricepredictor

Modelo de machine learning para predecir el valor de mercado de autos usados, optimizando precisiΓ³n y eficiencia.

catboost data-science lightgbm machine-learning numpy pandas python sklearn statistics xgboost

Last synced: 05 Apr 2026

https://github.com/jai0212/ai-virtual-painter

Real time finger and object painting using advanced computer vision technology with custom hand gestures and customizable colours

artificial-intelligence cmake computer-vision cpp mediapipe numpy opencv python

Last synced: 05 Apr 2026

https://github.com/shdrn2402/incomesresearch

The project aims to study the influence of various characteristics on a person’s income. In this project, I practiced data preprocessing and visualization, and also touched on the basics of machine learning.

jupyter-notebook matplotlib numpy pandas python seaborn sklearn visualization

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

ΠœΠΎΠ΄ΡƒΠ»ΡŒ 4. ΠšΠ»Π°ΡΠΈΡ„Ρ–ΠΊΠ°Ρ†Ρ–Ρ Ρ‚Π° ΠΎΡ†Ρ–Π½ΠΊΠ° Ρ€ΠΎΠ±ΠΎΡ‚ΠΈ ΠΌΠΎΠ΄Π΅Π»Ρ–. Π›Ρ–Π½Ρ–ΠΉΠ½Π° рСгрСсія: пСрСнавчання Ρ‚Π° рСгуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/andres15alvarez/numericalcalculator

Calculator of roots of equations and system of linear algebraic equations.

numerical-methods numpy python scipy

Last synced: 05 Apr 2026

https://github.com/lorenzorottigni/ml-movies

Machine Learning python bootcamp: Recommender Systems on movies dataset

ipynb machine-learning numpy pandas python recommender-system scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/paulo-santos-ds/visao_computacional

O sistema utiliza inteligΓͺncia artificial para analisar documentos e caracterΓ­sticas fΓ­sicas dos visitantes, garantindo que apenas maiores de 18 anos possam acessar as Γ‘reas de jogos

keras numpy pandas plotly python restnet50 sklearn tensorflow

Last synced: 05 Apr 2026

https://github.com/al-ghaly/prosper-loans-analysis

A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)

data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics

Last synced: 18 Apr 2026

https://github.com/ahmaruff/kmeans-img-compress

Simple python image compression library using K-Means clustering

color-quantization image-compression image-processing kmeans kmeans-clustering numpy python

Last synced: 18 Apr 2026

https://github.com/rahuls-1106/dataspark

DataSpark is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-visualization jupyter-notebook matplotlib numpy pandas-dataframe powerbi python seaborn sql

Last synced: 18 Apr 2026

https://github.com/noopur-phadkar/botnavigator

BotNavigator utilizes the A* (A-Star) algorithm for pathfinding, leveraging Python along with libraries such as Matplotlib and Numpy for efficient array operations and visualization. This innovative solution optimizes robotic navigation within warehouse environments, enhancing operational efficiency and scalability.

a-star-path-finding matplotlib numpy pathfinding-algorithms python robotics

Last synced: 18 Apr 2026

https://github.com/md-emranhossen/deep-learning-hub

Welcome to Deep Learning Playground, a personal repository for exploring and experimenting with deep learning concepts, architectures, and techniques. This repo serves as a hands-on space for implementing and understanding deep learning models, from fundamental neural networks to advanced architectures.

cnn-model deep-learning deeplearning gan generative-adversarial-network neaural-network numpy pandas pthon tensorflow

Last synced: 18 Apr 2026

https://github.com/zombietdv/rfl-13_8

Implementaion for Genetic algorithm and Q-Learning.

genetic-algorithm grid-world machine-learning numpy pygame q-learning

Last synced: 18 Apr 2026

https://github.com/amdjadouxx/my_neural_network

implementation of a neural network library

deep-neural-networks ia machine-learning numpy python3

Last synced: 18 Apr 2026

https://github.com/shortthirdman/distributedlagmodels

Distributed Lag Models in Dynamic Model Time Series

disributed-log-models numpy pandas python statsmodels time-series

Last synced: 19 Apr 2026

https://github.com/codetun/wie-act-challenge-ai

Our AI model, integrated with the Defend Her extension, uses advanced natural language processing (NLP) to predict and identify offensive or harmful language in real-time πŸ§ πŸ’¬πŸš«. By continuously analyzing content, it ensures accurate detection of bad words, enhancing the extension's ability to protect users and promote a safer online environment 🌐

colab csv jupiter-notebook kaggle nltk numpy pandas

Last synced: 19 Apr 2026

https://github.com/jcardonamde/reto_cobranzas_sector_financiero

This project analyzes data provided by the organization iData, focusing on the collection process of a financial company. The objective is to develop segmentation strategies that enhance portfolio recovery and improve debt management.

data-science kmeans-clustering machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/costopoulos/ntua-dsp

:signal_strength: NTUA ECE Digital Signal Processing Course Source Codes and Reports

dsp filters fourier-transform librosa numpy pywt scipy short-time-signal-analysis stft

Last synced: 19 Apr 2026

https://github.com/crazy-dot/zomato-data-analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

data-cleaning-and-preprocessing data-manipulation-with-pandas inferential-statistics kaggle-dataset numpy pandas-python python zomato-data-analysis

Last synced: 19 Apr 2026

https://github.com/emerald-geomodelling/downfile

A safe & version independent pickle alternative that integrates setuptool entry points for extensions

feather json numpy pandas python serialization

Last synced: 19 Apr 2026

https://github.com/gemmus/tictactoe-ai

Built with minimax algorithm, using pygame.

minimax-algorithm numpy oop pygame python tic-tac-toe

Last synced: 20 Apr 2026

https://github.com/montanaz0r/suicide-rate-analysis

Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector

analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate

Last synced: 20 Apr 2026

https://github.com/saurabh9136/data-analysis_using_pandas-numpy

A beginner-friendly repository exploring data analysis using NumPy and Pandas. Covers fundamental operations, data manipulation, and real-world dataset analysis.

data-analysis-python numpy pandas python scipy

Last synced: 20 Apr 2026

https://github.com/jbalooshie/school_district_analysis

Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.

data-analysis data-science dataframes jupyter-notebook numpy pandas python

Last synced: 20 Apr 2026

https://github.com/vtorres09/projetomonitoriaip

πŸ“‹ Data analysis of a dataset created to select students to be teacher assistant (Introduction to Programming).

numpy object-oriented-programming pandas python

Last synced: 20 Apr 2026

https://github.com/mkhekare/bikesharing_ml

Provides insights into the bike sharing trends based on various factors such as time of day, season, and weather. The findings can be used to improve bike sharing services and understand user behavior.

matplotlib numpy numpy-library pandas seaborn

Last synced: 20 Apr 2026

https://github.com/prashhhant213/data_analysis_and_visualization-_for_streaming_platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

colab-notebook data datavisualization matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/venkat-0706/air-drawing

A real-time hand gesture drawing app using Python, OpenCV, and MediaPipe to draw, erase, and switch colors on a canvas using finger gestures via webcam.

computer-vision hand-gesture-recognition mediapipe mediapipe-hands numpy opencv-python python realtime-object-tracker

Last synced: 20 Apr 2026

https://github.com/souvikmajumder26/neural-network-from-scratch

πŸš€ Building a simple Neural Network to classify the images of the Fashion-MNIST dataset, using only numpy and no TensorFlow, Keras, or PyTorch.

config fashion-mnist logger neural-network neural-network-from-scratch notebook-jupyter numpy project python

Last synced: 21 Apr 2026

https://github.com/aleklukanen/systemsofsprings

This repository contains a few helpful pieces of Python code that can be used to compute time dependent positions of springs in 2 and 3 dimensions.

finite-element-methods finite-elements jupyter-notebook numerical numpy python

Last synced: 21 Apr 2026

https://github.com/carmoreno/aviones-aldeas

:airplane: Final project o Complexity and Optimisation course :house_with_garden:

algorithms numpy pyqt4 scipy

Last synced: 21 Apr 2026

https://github.com/mohammed-majid/image-classification-nn

Image classification using custom Neural Network (No ML packages)

custom multiclass-classification neural-network numpy

Last synced: 22 Apr 2026