An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba

First assignment for the course Data Mining @CSE.UOI

data-analysis data-science numpy scipy seaborn statistics

Last synced: 01 May 2026

https://github.com/mattdelaune/saas_cohort_analysis

This project analyzes customer segmentation and behavior using data science and cohort analysis. Key metrics like CRR, NRR, CLR, and CLV are examined through detailed charts, including the cohort layer cake and CLR vs. CLV cost efficiency analysis. Exploratory Data Analysis and systematic data manipulation reveal actionable insights.

anaconda git jupyter-notebook matplotlib numpy pandas python saas-data sales-data sales-data-analysis-project seaborn

Last synced: 03 Jan 2026

https://github.com/khaymanii/big_mart_prediction_model

This model was built using Python and Logistic Regression Algorithm

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 01 May 2026

https://github.com/macagua/entrenamiento.data_scientist_python

Repositorio de manuales y recursos del entrenamiento "Data Scientist en Python" realizado por Leonardo J. Caballero G.

data-analytics data-scientist data-visualization numpy pandas-dataframe python37 streamlit

Last synced: 01 May 2026

https://github.com/joshi-jyoti/heart-disease-prediction

This repository contains a Python-based project for predicting the likelihood of heart disease using a Logistic Regression machine learning model. It leverages a dataset of patient medical information to train and evaluate the model, providing insights into potential diagnoses.🩺

heart-disease-prediction heart-disease-predictor kaggle-dataset machine-learning numpy pandas python scikit-learn

Last synced: 01 May 2026

https://github.com/cchandre/rg

Renormalization for the break-up of invariant tori in Hamiltonian flows

expm hamiltonian hamiltonian-dynamics invariant-tori kolmogorov-arnold-moser numpy python3 renormalization-group scipy

Last synced: 01 May 2026

https://github.com/gauravsakure02/pubg_win_prediction

Predicting PUBG game wins using a dataset involves analyzing various in-game features such as player kills, distance traveled, and weapons acquired to build a model that forecasts the likelihood of winning a match based on player performance, helping in winning game

matplotlib numpy pandas sklearn

Last synced: 29 Apr 2026

https://github.com/rvats20/python-programming-basic

Python programing basic to develop robust python applications.

data-science eda learning learning-by-doing matplotlib-python numpy pandas python

Last synced: 02 May 2026

https://github.com/epsoft/ner

NER

ner numpy pandas

Last synced: 04 May 2026

https://github.com/rakibhhridoy/machinelearning-featureselection

Before training a model or feed a model, first priority is on data,not in model. The more data is preprocessed and engineered the more model will learn. Feature selectio one of the methods processing data before feeding the model. Various feature selection techniques is shown here.

extratreesclassifier feature-selection gridsearchcv lasso-regression logistic-regression machine-learning numpy pandas pca rfe rfecv scikit-learn selectkbest

Last synced: 02 May 2026

https://github.com/zazi2002/machine-learning-project

Introduction to Machine Learning project with the goal of improving the classification performance on a dataset by optimizing the number of features and weak learners.

dimentionality-reduction ensemble-learning numpy pca random-forest scikit-learn

Last synced: 02 May 2026

https://github.com/cosama/vasapy

Vectorized Associative Arrays for Python

associative-array associative-map dictionary numpy python vectorized

Last synced: 16 Mar 2026

https://github.com/realtristan/micronet

Neural network library from scratch

neural-networks numpy python pytorch

Last synced: 03 May 2026

https://github.com/surtecha/multi-physics-simulation-engine

A real world physics simulation application built with python

jax numpy physics pygame python scipy simulation

Last synced: 14 Apr 2025

https://github.com/thamirisq/data_analysis_culinary_journey

Project to analyze key information from Culinary Jorney, a marketplace Company, and develop an initial strategy to lead the entire team effectively.

folium-maps groupby-method jupyter-notebook markercluster marketplace-ds matplotlib-pyplot numpy pandas pandas-dataframe streamlit

Last synced: 10 Apr 2026

https://github.com/achalesh27022003/python-assignments-iwt

This repository is objected to have all the assignments and project related to IWT (Semester - 3).

cosinewave frame grid image imagetk labels matplotlib-pyplot numpy numpy-arrays numpy-library photoimage pil python3 sinewave spyder tkinter tkinter-gui window

Last synced: 10 Apr 2026

https://github.com/garcane/income-prediction-ml

This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.

data data-science machine-learning ml numpy pandas python random-forest scikit-learn

Last synced: 08 Apr 2026

https://github.com/haidermalikk/neural-network-circuit-optimizer

The Neural Network Circuit Optimizer, built on my custom digital circuit simulator, uses AI and machine learning to optimize simple digital circuits. It leverages neural networks, circuit simulation, and optimization algorithms to reduce component count which improves: power usage, and execution time and efficiency. *Under Dev see Readme*

ai jupyter-notebook matplotlib ml numpy pandas python pytorch sqlalchemy tensorflow

Last synced: 10 Apr 2026

https://github.com/muralisuvvani/airbnb-exploratory-analysis

Exploratory Data Analysis (EDA) on Airbnb NYC dataset to uncover trends, patterns, and insights. Data cleaning and visualization of Airbnb listings in New York City. Analyzing Airbnb NYC 2019 dataset using Python (Pandas, Numpy, Matplotlib, Seaborn)

analytics data-visualization exploratory-data-analysis numpy pandas python3

Last synced: 04 May 2026

https://github.com/musfiquejim/enacp-a-hybrid-machine-learning-framework-for-detecting-anticancer-peptides

EnACP: একটি Ensemble Learning মডেল যা অ্যান্টিক্যান্সার পেপটাইড সনাক্তকরণের জন্য ব্যবহৃত হয়।

adamoptimizer ai cnn-classification data-visualization deep-learning densenet inceptionv3 machine-learning matplotlib-pyplot numpy pandas randomforestclassifier randomforestregressor restnet sklearn u-net-pytorch vgg16 vgg19

Last synced: 04 May 2026

https://github.com/tasninanika/k-means-clustering

An interactive and insightful customer segmentation project using K-Means Clustering.

matplotlib numpy pandas plotly python3 scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/harmanveer-2546/supply-chain

Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.

customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis

Last synced: 10 Apr 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/aahad91/idv

Interactive Data Visualization - Master Course

matplotlib numpy pandas plotly python3

Last synced: 08 May 2026

https://github.com/megha060199/academic-erp

Python, SQL, Javascript

mysql numpy python

Last synced: 04 May 2026

https://github.com/michaelfeil/lstm-dlnn2020

Character-LSTM implementation in Numpy

lstm nlp-machine-learning numpy

Last synced: 14 May 2026

https://github.com/h3ar7b3a7/exploringjupyterlab

This is just an exploration in using Jupyter Lab, Notebook and some useful Python libraries. Some basic commands, markups and extensions setup are documented here too.

bokeh extensions jupyterlab matplotlib notebook-jupyter numpy pandas

Last synced: 04 May 2026

https://github.com/harmanveer-2546/predicting-customer-churn-for-a-telecom-company

Customer churn is a significant issue for big business companies. Companies are attempting to create methods for predicting customer churn to get a direct impact on getting more revenues, particularly in telecom companies.

ann autoencoder churn-prediction customer-churn-prediction encoding mlp model-building-and-evaluation numpy os pandas standardization

Last synced: 04 May 2026

https://github.com/prashver/customer-segmentation

This project utilizes unsupervised machine learning to segment bank customers for targeted marketing campaigns. It covers tasks like data exploration, determining optimal clusters, and applying k-means for segmentation. Ideal for marketing departments in banking and retail industries.

customer-segmentation matplotlib numpy pandas seaborn sklearn

Last synced: 07 Apr 2026

https://github.com/avijit-jana/used-car-price-prediction

A Streamlit-based application utilizing machine learning to predict used car prices based on features like make, model, year, fuel type, and transmission, enhancing customer experience and streamlining the pricing process.

carpriceprediction data-science jupyter-notebooks machine-learning numpy pandas pickle pricing-data-science python3 streamlit streamlit-webapp webapplication

Last synced: 11 Apr 2026

https://github.com/zion-dev1/e-commerce-analysis

Short machine learning project which analyses customer behaviour 🔎

analysis clustering ecommerce machine-learning numpy pandas sklearn

Last synced: 05 Jul 2025

https://github.com/tasninanika/k-nearest_neighbors

K-Nearest Neighbors (KNN) is a simple and powerful machine learning algorithm used for classification and regression.

knn matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/petitgrizzlies/metaheuristiques

Travaux pratiques de meta heuristique. Unige 2016-2017

cffi cpp golang heuristic-algorithm matplotlib numpy python35

Last synced: 13 Apr 2026

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/guidorice/hab-detection-wyvern-hsi

Harmful Algal Bloom (HAB) Detection from Wyvern hyperspectral imagery

gdal geopandas gis hyperspectral jupyter numpy python rasterio

Last synced: 05 May 2026

https://github.com/tasninanika/mammographic-masses-analysis-dt

This project uses a Decision Tree Classifier to predict whether a detected mammographic mass is benign (0) or malignant (1) based on input features.

decision-tree-classifier numpy pandas pyhton3 scikit-learn

Last synced: 11 Apr 2026

https://github.com/Tritonix711/FractureAI

This tool helps people upload X-rays to find broken bones. It uses a machine to mark where the breaks are and gives users marked pictures to download. A smart computer also helps people understand their broken bones and gives them advice.

css cv2 flask gorq html javascript matplotlib npm numpy pandas pydantic python react scikit-learn torch torchvision ultralytics

Last synced: 19 Aug 2025

https://github.com/kiritoind/neural-network-for-cat-image-classification

This script defines essential functions for constructing a deep neural network from scratch for image classification. It includes activation functions like sigmoid and ReLU, data loading from H5 files, parameter initialization, forward and backward propagation for training, gradient descent for parameter updates, and functions for prediction

classification computer-vision deep-learning deep-neural-networks jupyter-notebook numpy python

Last synced: 05 May 2026

https://github.com/singh-dhruv/traffic-light-detection-and-recognition

Welcome to this repository! Here we explored the topic of traffic lights detection and its recognition for helping the Autonomous Vehicles in their interpretation of traffic lights and decision making while driving.

autonomous-driving computer-vision matplotlib-pyplot numpy python r-cnn tensorflow

Last synced: 05 May 2026

https://github.com/williyam-m/movie-recommendation-system

Developed a web app with a cosine similarity machine learning model for personalized recommendations based on user history, likes, bookmarks, and activity. Implemented user auth and CRUD operations for movies.

django machine-learning numpy pandas prediction-model python scikit-learn

Last synced: 10 Apr 2026

https://github.com/sahajrajmalla/co2-calculator-api

A Web Application backend API that enables people to create events, join events, and measure the carbon footprint produced by each participant to get insights about online and onsite carbon emissions.

apis carbon-emissions carbon-footprint fastapi makefile numpy pandas precommit sqlalchemy sqlite3 uvicorn

Last synced: 09 Apr 2026

https://github.com/datarohit/fifa-2020--data-analysis

This is dataset is from Kaggle.com which contains data of 18000+ fifa players with more than 100 features about them for analysis. Simple analysis performed on this Dataset.

exploratory-data-analysis exploratory-data-visualizations matplotlib-pyplot numpy pandas seaborn

Last synced: 10 Apr 2026

https://github.com/coqui123/tradegpt

TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹

analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript

Last synced: 11 Apr 2026

https://github.com/ola-kaznowska/ai-digit-recognition

AI digit recognition. Artificial neural network for digit recognition

ai artificial-intelligence machine-learning neural-network numpy tensorflow

Last synced: 14 Apr 2026

https://github.com/nissanjk/linear-solver

A Python desktop application to solve systems of linear equations using various numerical methods.

anaconda3 fpdf imageio matplotlib numpy python3 tkinter

Last synced: 16 May 2026

https://github.com/muhammad-fiaz/gpt

A simple implementation based on the "Attention is All You Need" paper, using GPT-2 for text generation.

attention-is-all-you-need gpt gpt-2 gpt-3 gpt-implementation gpt-using-pytorch gpt2 numpy open-source paper-implementations python pytorch pytorch-implementation

Last synced: 13 Apr 2026

https://github.com/sarthak-1408/rain-fall-prediction

This repository represents the End to End Machine Learning Project (Rain Fall Prediction in Australia).

heroku heroku-deployment machine-learning numpy pandas rain-fall rain-fall-prediction scikit-learn xgboost-algorithm

Last synced: 05 May 2026

https://github.com/hamdaniqhmqd/project-predict-saham-bbri

Repository Project-Predict-Saham-BBRI is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/shanmukhsrisaivedullapalli/smsspamclassification

SMSSpamClassification is a machine learning project aimed at accurately classifying SMS messages as either spam or ham (non-spam). It employs natural language processing techniques to extract relevant features from the text data and utilizes various classification algorithms to build a robust spam detection model.

jupyter-notebook numpy pandas pickle python3 sklearn spam-classification spam-detection

Last synced: 11 Apr 2026

https://github.com/shanmukhsrisaivedullapalli/cifar10

This project uses TensorFlow to classify images from the CIFAR-10 dataset. It compares the performance of an Artificial Neural Network (ANN) and a Convolutional Neural Network (CNN), covering data preprocessing, model training, evaluation, and prediction on new images.

cifar10-classification cnn-classification matplotlib neural-networks numpy pickle pillow python3 tensorflow

Last synced: 03 Feb 2026

https://github.com/dulajkavinda/live-sketching

✍️ Converting live video stream from webcam into a sketch using OpenVC and Keras

added image-filtering keras machinelearning matplotlib numpy opencv python

Last synced: 10 Apr 2026

https://github.com/zensam7/easy_my_ai

Библиотека для создания ИИ, написана ПОЛНОСТЬЮ с нуля

genetic-algorithm library machine-learning neural-networks numpy q-learning

Last synced: 14 Feb 2026

https://github.com/anvesham/machinelearning_models_using_matlab

Stepwise Multiple Linear Regression (With Interactions) and Random Forest Regression on predicting the Productivity of the Garment Factory Workers

bagging-ensemble bayesian-optimization decision-trees jupyter-notebook machine-learning matlab multiple-linear-regression-model numpy python random-forest-regression

Last synced: 11 Apr 2026

https://github.com/nemeslaszlo/loan-data-decision-tree-and-random-forest

This repository covers a "Lending Club" analysis with decision tree and random forest with data visualizations and exploration.

decesion-trees matplotlib numpy pandas random-forest seaborn sklearn

Last synced: 17 Apr 2026

https://github.com/aarryasutar/credit_eda

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/34j/numpy-flint-arb

Arbitrary precision floating / ball arithmetic (interval arithmetic) dtype in NumPy / array API

acb arb arbitrary-precision array-api flint interval-arithmetic mpmath numpy

Last synced: 10 Mar 2026

https://github.com/vineet416/chronic-kidney-disease-prediction

This repository contain code of Chronic Kidney Disease Detection Prediction Project. The goal of this project is predict the chronic kidney disease using parameters like Diabetes Mellitus, Blood Urea, Sugar, Hypertension etc.. I used multiple machine learning algorithms with hyperparameter tuning which is having highest accuracy score of 97.5

data-visualization data-wrangling exploratory-data-analysis feature-engineering feature-selection hyperparameter-tuning machine-learning matplotlib numpy pandas plotly pre-processing python seaborn sklearn-library statsmodels

Last synced: 14 Apr 2026

https://github.com/carlthome/ipytoolbox

A collection of IPython magics and other utilities

ipython jupyter matplotlib numpy

Last synced: 18 Jan 2026

https://github.com/gehad-ahmed30/data-analysis

This repository offers a guide to data visualization with Pandas and includes practical examples and two complete projects. Learn to preprocess, analyze, and visualize datasets to solve real-world problems effectively.

analysis matplotlib numpy pandas python seaborn

Last synced: 07 May 2026

https://github.com/duoan/dl-bike-sharing-demand

ractice neural network implemented just with numpy for Kaggle Bike Sharing Demand

bike-sharing bike-sharing-demand deep-learning kaggle neural-network numpy udacity-deep-learning

Last synced: 18 Jun 2026

https://github.com/dipeshgoyal013/ipl_win_probability

A project which help you to check win probability of batting team in inning 2nd

machine-learning matplotlib numpy pandas python sklearn

Last synced: 05 Feb 2026

https://github.com/l-johnston/microohm

SI unit system in Numpy as native dtype

numpy python units

Last synced: 24 Feb 2026

https://github.com/akashash01/akash_linear-regression

This is an linear approach machine learning model used to predict the values of variable(dependent) based on other variables(independent).

anova-test autocorrelation data-science heteroscedasticity linear-regression machine-learning mape multicollinearity numpy pandas python supervised-learning

Last synced: 07 May 2026

https://github.com/4thel00z/numpy-serializer

numpy-serializer is a nice and high-level way to serialize numpy.arrays while preserving their shape and type.

bytes deserialization deserialize deserializer msgpack numpy raw serialize serializer serializers

Last synced: 27 Jan 2026

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/faizantkhan/regression-project-bangalore-property-price-prediction

🏠 Bangalore Property Price Prediction is a comprehensive project designed to accurately predict property prices in Bangalore. Leveraging advanced regression techniques and a dataset sourced from Kaggle, the model undergoes meticulous feature engineering, data cleaning, and parameter tuning to ensure high accuracy.

backend-api css data-cleaning data-science data-visualization eda flask html javascript machine-learning-algorithms numpy pandas project project-repository property python regression-models server

Last synced: 14 Apr 2026

https://github.com/harmanveer-2546/heart-failure-prediction

Heart failure is a severe condition in which the heart is unable to pump blood effectively. Early prediction of heart failure can significantly improve patient outcomes. This project aims to build a predictive model using machine learning techniques to identify patients at risk of heart failure.

exploratory-data-analysis feature-engineering jupyter-notebook matplotlib model-training-and-evaluation numpy pandas python sckiit-learn seaborn visualization-of-results

Last synced: 14 Apr 2026

https://github.com/sivatsk26/university-admit-eligibility-predictor

This project is created using Machine Learning and Regression methods- a statistical technique to predict the outcome of event which is to verify the users’ admission eligibility level, considering the universities they have chosen. This is achieved based on the algorithms implemented, when is user feed the application with the required information

html-css-javascript ibm-cloud ibm-watson linear-regression machine-learning matplotlib numpy pandas python python-flask random-forest scikit-learn

Last synced: 13 Apr 2026

https://github.com/nirmalyabag20/wine-quality-prediction-machine-learning

This project analyzes the chemical properties of wines to identify key factors influencing quality. By leveraging machine learning techniques, i aim to develop predictive models that accurately classify wine quality, providing valuable insights for producers and enthusiasts alike.

k-neighbors-classifier logistic-regression machine-learning matplotlib numpy pandas python random-forest seaborn svc

Last synced: 09 Apr 2026

https://github.com/nirmalyabag20/loan-status-prediction-using-machine-learning

This project focuses on predicting the loan status (approved or not approved) based on various applicant details. The goal is to develop a machine learning model that accurately classifies whether a loan should be approved, helping financial institutions make informed lending decisions.

matplotlib numpy pandas python scikit-learn seaborn support-vector-machine

Last synced: 19 Jan 2026

https://github.com/rixiiz/using-knn-to-predict-the-obp-of-mlb-players

Using KNN to predict the On Base Percentage (OBP) of Major League Baseball (MLB) players at the end of the season

artificial-intelligence dataset f1-score jupyter-notebook knn-regression machine-learning matplotlib mse numpy pandas python scikit-learn supervised-learning

Last synced: 05 Apr 2026

https://github.com/bursasha/deeplearningai-machine-learning-tensorflow-course

Projects and certificates from a machine learning course focused on TensorFlow and Keras, covering neural networks, CNNs, NLP, and time series analysis 🧠

convolutional-neural-networks deeplearning-ai keras machine-learning mlcourse natural-language-processing neural-networks numpy pandas python-machine-learning recurrent-neural-networks tensorflow

Last synced: 10 Apr 2026

https://github.com/harmanveer2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xgboost

Last synced: 11 Apr 2026

https://github.com/takkii/pylean

Data analysis ( 🐍 💎 📈 )

analayze matplotlib numpy pandas python scikit-learn

Last synced: 09 Sep 2025

https://github.com/fatma-moanes/deep-learning-labs

My implementation for the labs of the Neural Networks and Deep Learning course that I studied at my university, Zewail City.

classification cnn data-science deep-learning gan keras matplotlib neural-network-architectures neural-networks numpy pandas preprocessing python regression-models regularization seaborn tensorflow time-series-analysis

Last synced: 14 Apr 2026

https://github.com/Krisseck/hki-traffic-predict

Keras neural network to predict traffic in Helsinki

avoindata helsinki keras numpy open-data python traffic

Last synced: 13 Jul 2025