An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/dpgitaccount/project---hospital-readmission-analysis

The goal of this project is to build a predictive model to estimate the likelihood of a hospital readmission based on patient data. By identifying factors that contribute to readmissions, hospitals can optimize care and reduce costs associated with repeated visits.

boxplot confusion-matrix datamodeling exploratory-data-analysis heatmap histplot numpy pandas plotly python random-forest seaborn smote-sampling visualization

Last synced: 16 Apr 2026

https://github.com/codersales/machine-learning-classification

machine learning jupyter notebooks | data-science | priority | relevant | significant | green-light | 1 | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

authorized classification current decision-tree ensemble-techniques jupyter machine-learning more-than-100-commits more-than-300-commits numpy pandas python3 ranked repository-5 seaborn sklearn stacking sub-critical supervised-learning workstation

Last synced: 07 Mar 2026

https://github.com/lorenzorottigni/ml-advertising

Machine Learning python bootcamp: logistic regression on advertising dataset

ipynb logistic-regression machine-learning numpy pandas python scikit-learn seaborn

Last synced: 16 Apr 2026

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/thiti-dev/attendance-management-system

A quick casual project just only for a demonstration in the discrete mathematics course

numpy opencv pandas pillow tkinter

Last synced: 16 Apr 2026

https://github.com/shiqilu77/machinlearning-lol_gaming

a supervised model that predicts the match winning rate in LOL game based on the lineups of both sides. Implemented data processing by multiple python API including sklearn, NumPy, pandas, and by feature encoding, kNN, Synthetic Minority Oversampling Technique, etc

league-of-legends machine-learning numpy supervised-learning

Last synced: 16 Apr 2026

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 27 Mar 2026

https://github.com/laurencewarne/eratosthenes

Random implementations of the sieve of Eratosthenes

algorithm eratosthenes numpy python3

Last synced: 16 Apr 2026

https://github.com/littie28/hazy-frames

Hierarchical coordinate frames - crystal clear transforms

3d coordinate-systems geometry numpy python reference-frames scipy transformations

Last synced: 18 Apr 2026

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 04 Mar 2026

https://github.com/priyanshul28/ml_eda_regression_energyconsumptionforecasting

An EDA and Machine Learning Time-Series Regression Forecasting exercise on the PMJE Energy Consumption dataset demonstrating time-series analysis and the use of Time-Series Split, XGBoost, etc. The model is optimized using hyperparameter tuning through GridSearchCV. A Rob Mulla guided exercise.

forcasting machine-learning numpy pandas scikit-learn time-series-analysis

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npchessimage

Creates a chessboard pattern as an image.

chessboard cv2 image numpy

Last synced: 05 Mar 2026

https://github.com/gautam-cs/data_science

In this project, I was trying to get most of the relevent information of the data using library like numpy, pandas. I have worked on data like aadhar data, Olympics data

aadhar-data csv data-science json numpy olympics-data pandas python35

Last synced: 17 Apr 2026

https://github.com/kheriberto/knn_project

This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.

data-analysis knn-classifier numpy python scikit-learn seaborn

Last synced: 02 Apr 2026

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/akshitvats026/heart_disease_prediction

An ML-based Heart Disease Prediction System that predicts the likelihood of heart disease based on user health parameters. Built using Python, Pandas, and Scikit-learn, the system performs data preprocessing, trains a predictive model, and provides real-time predictions with high accuracy.

accuracy-score logistic-regression machine-learning matplotlib-pyplot numpy pandas python scikit-learn

Last synced: 02 Apr 2026

https://github.com/amanimagdi/data-flux-fullstack

This repository is for a full stack application provides in-depth analysis and manipulation of various data types (tabular data, RGB images and textual data), Flask on BE and ReactJs on FE.

flask javascript numpy pandas python reactjs tailwindcss

Last synced: 02 Apr 2026

https://github.com/davityak03/basics-of-python-

In this we have done multiple programs in python so as to develop the basic skill sets in python

array dictionary lists matrix numpy pandas programming python sets

Last synced: 02 Apr 2026

https://github.com/niteshchawla/clustering-ml

Analyzing the vast data of learners can uncover patterns in their professional backgrounds and preferences. Allowing Scaler to make tailored content recommendations and provide specialized mentorship.

cluster-analysis clustering hierarchical-clustering k-means-clustering machine-learning numpy pca-analysis visualisation

Last synced: 17 Apr 2026

https://github.com/ledsouza/curso_de_estatistica_parte_4

Projeto de estatística para estudo de correlação e regressão

data-science data-visualization numpy pandas seaborn statsmodels vitrinedev

Last synced: 17 Apr 2026

https://github.com/kiarashyavari/python_interview

Practice intermediate to advance topics of Python here

decorators generator multiprocessing multithreading numpy pandas python python3

Last synced: 17 Apr 2026

https://github.com/rakibhhridoy/predictingsequence-timeseriesanalysis

Time series data easy handling purposes in TensorFlow and python functional programming. Time Series data is handled and pre-processed by different steps,like making window of the data,splitting data for model purpose is also different than other processing methods. All has combined for easy access.

forecasting functional-programming keras lstm lstm-neural-networks moving-average numpy python tensorflow time-series windowed

Last synced: 02 Apr 2026

https://github.com/nikhilgugwad/sentiment-analysis

Sentiment analysis for the Kannada language to classify Kannada sentences into different emotions.

numpy pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/christiemj/clock-drift-simulation

A Monte Carlo simulation in Python to model vacuum-induced frequency drifts in atomic clocks, with code and visuals (experimental setup, simulated vs. real-world comparison, and drift graph), as described in "Probing Vacuum-Induced Clock Drifts via Quantum Metrology: A Testable Hypothesis" (DOI: 10.5281/zenodo.15163879). Licensed under GPL 3.0.

allan-variance atomic-clocks cesium-ion clock-drift matplotlib monte-carlo-simulation numpy open-science pandas physics precision-metrology python quantum-metrology research-software scipy simulation vacuum-fluctuations vacuum-noise

Last synced: 17 Apr 2026

https://github.com/nikhilsree5/aerofitcasestudy

Customer Profiling and Market Segmentation for AeroFit Treadmills: A Data-Driven Approach

customerprofile eda numpy pandas python visualization

Last synced: 17 Apr 2026

https://github.com/theakashshukla/diabetes-prediction

Diabetes Prediction using futuristic tool

ai jupiter-notebook ml numpy pandas sklearn

Last synced: 17 Apr 2026

https://github.com/hansalemaos/npfastsortcpp

Sorts int/float 4x faster than numpy by using parallel_sort from C++ (Windows only)

cpp fast numpy python sort

Last synced: 17 Apr 2026

https://github.com/kaushik-puttaswamy/exploratory-data-analysis-using-python

This project applies Exploratory Data Analysis (EDA) using visualizations like box plots, scatter plots, and correlation matrices. It helps identify patterns, select ideal functions via least squares error, and map test data while evaluating deviations.

bokeh expolatory-data-analysis matplotlib numpy pandas python seaborn sqlalchemy

Last synced: 17 Apr 2026

https://github.com/smartpottech/smartpot-datagenerator

SmartPot-DataGenerator simulates environmental and plant data (temperature, humidity, luminosity, pH, TDS, soil moisture) for hydroponic garden management. This data is used for testing, analysis, and visualization in the SmartPot web portal, helping optimize plant care without actual hardware sensors.

data-model matplotlib numpy python simulated

Last synced: 17 Apr 2026

https://github.com/jjwizardmp/a-start-search-python

Python project where the 'A Start Search' algorithm and a simulation of the agent's path with animations were implemented.

cv2 ipython iterto jupyter-lab jupyter-notebook mathplotlib numpy pip python python-env python3

Last synced: 04 Apr 2026

https://github.com/datamallo/python101

The ressources you need to learn or get better in Python. Covering the basics of Python3, NumPy and Pandas for

numpy pandas python3

Last synced: 18 Apr 2026

https://github.com/jass024/bitcoin-historical-analysis

Performed comprehensive time-series analysis on 14+ years of Bitcoin historical data using Python (Pandas, NumPy, Plotly). Calculated key technical indicators (SMA, EMA, RSI, volatility) to identify market trends, potential anomalies, and areas of regulatory concern. Visualized results using interactive Plotly charts.

numpy pandas python-3

Last synced: 18 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/kasraskari/python-for-data-science

Foundational concepts and libraries for data science with Python.

data-science jupyter-notebook matplotlib numpy pandas python scipy

Last synced: 04 Apr 2026

https://github.com/abdul-rafay19/california-housing-price-prediction

This project predicts California housing prices using machine learning regression models, including Random Forests and Decision Trees. It covers data preprocessing, exploratory analysis, model training, and hyperparameter tuning to optimize performance.

decision-trees gridsearchcv linear-regression matplotlib numpy pandas python random-forest randomsearch-cv scikit-learn scipy seaborn

Last synced: 04 Apr 2026

https://github.com/yasinefeee/parkspotter_pretest_enviroment

The ParkSpotter project is designed to detect the occupancy status of parking spots in a simulation environment. Using a toy model, a camera system, and a machine learning model, this system identifies whether a parking space is EMPTY or NOT EMPTY in real-time.

ai-systems classifier computer-vision cv2 numpy opencv parking-spot-detection parking-spots python sckit-image sckit-learn simulation-environment svm svm-classifier

Last synced: 18 Apr 2026

https://github.com/sultanazhari/decisive-model-with-high-profit-potential-and-low-risk

OilyGiant mining company finding the best place for 200 new well points, As an Data Scientist we're creating a model who can choose the best 200 point by profit and risk.

accuracy-score chi2-contingency confusion-matrix logistic-regression matplotlib-pyplot numpy pandas python3 random-forest roc-auc-score seaborn train-test-using-sklearn

Last synced: 18 Apr 2026

https://github.com/hariprasath-v/machinehack-analytics-olympiad-2022

Create a machine learning model to help an insurance company understand which claims are worth rejecting and the claims which should be accepted for reimbursement.

catboost-classifier exploratory-data-analysis logloss machinehack numpy optuna pandas python scikit-learn shap

Last synced: 18 Apr 2026

https://github.com/ricardorobledo/spamemailclassification

Spam email classification using machine learning (Random Forest, SVC, Logistic Regression, etc.) with data balancing techniques (SMOTE, BorderlineSMOTE, ADASYN). Final calibrated Random Forest model achieves ROC-AUC 0.982 and PR-AUC 0.979 on the Spam Email Classification dataset.

imbalanced-data imbalanced-learning numpy pandas python3 sklearn

Last synced: 05 Apr 2026

https://github.com/stffns/snapvec

Fast compressed ANN search via randomized Hadamard transform + Lloyd-Max quantization. Pure NumPy.

ann embeddings hadamard numpy quantization rag vector-search

Last synced: 07 Apr 2026

https://github.com/ayushmaan-xd/exploratory-spotify-data-analysis

Exploratory Spotify Data Analysis is a project where I analyzed Spotify’s music dataset to uncover trends in audio features and song popularity. Using Python and data visualization tools

jupyter-notebook matplotlib numpy pandas python seaborn spotify-dataset

Last synced: 18 Apr 2026

https://github.com/ashithapallath/feature-engineering

This repository contains a range of examples and techniques for feature engineering, aimed at improving dataset quality and boosting model performance. It covers essential methods such as Exploratory Data Analysis (EDA) and Interquartile Range (IQR) analysis for detecting and handling outliers.

exploratory-data-analysis feature-engineering iqr-method matplotlib numpy outlier-detection pandas python seaborn

Last synced: 18 Apr 2026

https://github.com/jigyasag18/iit-guhawati-final-capstone-project

Smart Dynamic Parking Price Optimization System that adjusts parking fees in real-time based on demand, traffic, and competition. It employs adaptive pricing models and rerouting logic to enhance parking utilization and reduce congestion. The system is visualized via an interactive Streamlit dashboard, enabling users to simulate dynamic pricing.

bokeh bokeh-server bokehplots capstone-project data dataset deployment machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot mlproject normalisation numpy pandas pathway python streamlit

Last synced: 05 Apr 2026

https://github.com/lexxai/goit_python_ds_hw_04

Модуль 4. Класифікація та оцінка роботи моделі. Лінійна регресія: перенавчання та регуляризація

lasso-regression linear-regression numpy pandas python red regression ridge-regression scikit-learn

Last synced: 05 Apr 2026

https://github.com/lorenzorottigni/ml-movies

Machine Learning python bootcamp: Recommender Systems on movies dataset

ipynb machine-learning numpy pandas python recommender-system scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/ahmaruff/kmeans-img-compress

Simple python image compression library using K-Means clustering

color-quantization image-compression image-processing kmeans kmeans-clustering numpy python

Last synced: 18 Apr 2026

https://github.com/matiassenia/galaxypy

La aplicación carga una imagen en escala de grises y utiliza los puntos de menor luminosidad (más oscuros) para dibujar líneas de colores que crean una estructura parecida a una galaxia.

numpy python

Last synced: 18 Apr 2026

https://github.com/noopur-phadkar/botnavigator

BotNavigator utilizes the A* (A-Star) algorithm for pathfinding, leveraging Python along with libraries such as Matplotlib and Numpy for efficient array operations and visualization. This innovative solution optimizes robotic navigation within warehouse environments, enhancing operational efficiency and scalability.

a-star-path-finding matplotlib numpy pathfinding-algorithms python robotics

Last synced: 18 Apr 2026

https://github.com/autokmg/ai-foundations

Practice exercises related to the Foundations of Artificial Intelligence university course. The code examples are designed to help students gain a deeper understanding of these topics and to prepare for exams and assignments. The repository is open to contributions from other students.

numpy python

Last synced: 18 Apr 2026

https://github.com/shortthirdman/distributedlagmodels

Distributed Lag Models in Dynamic Model Time Series

disributed-log-models numpy pandas python statsmodels time-series

Last synced: 19 Apr 2026

https://github.com/simonusher/multi-layer-perceptron

My solution to an assignment on multi-layer perceptron. Implemented in python and numpy. Contains the forward and backward pass code, as well as some utility functions.

multi-layer-perceptron neural-network numpy python

Last synced: 19 Apr 2026

https://github.com/bharath-tars/modelml

Model for Diabetes Prediction using Machine Learning

docpilot flask heroku matplotlib-pyplot numpy pandas seaborn sklearn smtp

Last synced: 19 Apr 2026

https://github.com/renukadhule/airbnb_listings_and_reviews_python_analysis

Explore Airbnb listings and reviews from over 250,000 properties across 10 major cities. Dive into data on listing details, host information, pricing, and customer reviews for better insights

matplotlib-pyplot numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/vyjayanthipolapragada/car_mileage_prediction

Predicting the mileage of car using the linear regression model with Scikit-learn

kaggle-titanic linear-regression machine-learning numpy pandas predictive-modeling python scikit-learn

Last synced: 20 Apr 2026

https://github.com/vtorres09/projetomonitoriaip

📋 Data analysis of a dataset created to select students to be teacher assistant (Introduction to Programming).

numpy object-oriented-programming pandas python

Last synced: 20 Apr 2026

https://github.com/aminnj/numpy.jl

Simple numpy snippets implemented in julia

joke julia maybenot numpy python

Last synced: 20 Apr 2026

https://github.com/danpoynor/pet-shelter-data-analysis-notebook

Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.

csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data

Last synced: 21 Apr 2026

https://github.com/refraction-ray/numkl

A thin cython/python wrapper on some routines from Intel MKL

cython-wrapper lapack mkl numpy

Last synced: 21 Apr 2026

https://github.com/ahmad-ali-rafique/numpy-mastery-notebook

Inside this repository, you'll discover a comprehensive notebook dedicated to showcasing various NumPy array methods and operations. From basic array manipulation to advanced techniques, I've compiled a collection of examples and explanations to help both beginners and seasoned Python developers deepen their understanding of NumPy.

dataanalytics numpy numpy-arrays numpy-exercises numpy-library numpy-python

Last synced: 24 Apr 2026

https://github.com/haranrk/linear-regression

Implemented Linear Regression and some of it's variants

machine-learning matplotlib numpy python python3

Last synced: 24 Apr 2026

https://github.com/dpb24/datakind-2025

📊 Data Analytics: Identifying Actionable Insights to Improve Financial Inclusion in Kenya

data-analytics data-visualization databricks datakind exploratory-data-analysis financial-data geopandas jupyter-notebook kenya matplotlib numpy python seaborn

Last synced: 24 Apr 2026

https://github.com/takana671/texturegenerator

Procedually generate texture images from noise.

cloud cubemap cv2 cython noise noise-textures numpy python skybox

Last synced: 25 Apr 2026

https://github.com/timothyjan/cartoonify-image-opencv

Application that transforms images into its cartoon using Python, OpenCV(cv2), easyguy, numpy, imageio, and tkinter.

easygui imageio numpy opencv2 tkinter

Last synced: 25 Apr 2026

https://github.com/prathush-kumar/fake_news_detection

Developed an intelligent system to identify and classify fake news articles using Natural Language Processing (NLP) and Machine Learning techniques.

classification machine-learning natural-language-processing numpy pandas sklearn-library

Last synced: 26 Apr 2026

https://github.com/josttim/timelined_array

Manage easily 1 or multidimensionnal samples numpy arrays that are time related. Extends numpy without removing any of it's abilities on such arrays.

numpy python time-series-analysis

Last synced: 27 Apr 2026

https://github.com/nyx1311/timelstm

TimeLSTM: An interactive Streamlit app for multi-step time series forecasting using LSTM networks, featuring data preprocessing, visualization, GPU-accelerated model training, and automated result export.

deep-learning deep-neural-networks lstm-neural-networks numpy pandas ploty python3 scikit-learn-python statsmodels streamlit torch tqdm

Last synced: 27 Apr 2026

https://github.com/crispengari/netflix-statistics

This is the visualisation of two categories of netflix movies which are: * movies * tv shows

data-science datafram jupyter-notebook matplotlib numpy pandas python

Last synced: 27 Apr 2026

https://github.com/saptarshi11/diamlp---diabetes-classification-with-multilayer-perceptron

Diabetes Classification using Multilayer Perceptron (MLP) is a binary classification problem where the goal is to predict whether a patient has diabetes (positive class) or not (negative class) based on a set of biomedical features or risk factors.

artificial-neural-networks binaryclassification machine-learning mlp-classifier numpy seaborn

Last synced: 27 Apr 2026

https://github.com/erikbavenstrand/neural-network-implementation

An Artificial Neural Network Implementation in Numpy to work with MNIST dataset.

ai feedforward-neural-network machine-learning mnist neural-network numpy python

Last synced: 27 Apr 2026

https://github.com/vasukalariya/neural-network-from-scratch

It's a simple Feedforward Neural Network implemented just using Numpy

artificial-neural-networks classification mnist neural-network numpy

Last synced: 28 Apr 2026

https://github.com/binoydipu/ml-libraries

Machine Learning Libraries Repository

machine-learning numpy

Last synced: 28 Apr 2026

https://github.com/hai4320/ml_ai_notebook

All my note about ML, AI and Data Science

ai machine-learning numpy pandas scikit-learn

Last synced: 28 Apr 2026

https://github.com/jovicdev97/financial-loan-datascience-notebook

using numpy and pandas to analyze a synthetic loan dataset with python

data-analysis matlabplot numpy pandas plotting python seaborn

Last synced: 28 Apr 2026

https://github.com/abdeldjalilchafai/us-flight-delay-eda

Structured EDA on 2015 US flight delay data. Clean, reproducible notebook using a 6-step data analysis framework for real-world datasets.

data-analysis data-cleaning eda exploratory-data-analysis flight-delays kaggle matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/iv4n-ga6l/crowd-move-simulation

Crowd move simulation in python

matplotlib numpy python simulation

Last synced: 28 Apr 2026