An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/FredStober/cpp2numpy

cpp2numpy is a C++ library to write numpy files

c-plus-plus numpy

Last synced: 13 Jul 2025

https://github.com/gauravsakure02/pubg_win_prediction

Predicting PUBG game wins using a dataset involves analyzing various in-game features such as player kills, distance traveled, and weapons acquired to build a model that forecasts the likelihood of winning a match based on player performance, helping in winning game

matplotlib numpy pandas sklearn

Last synced: 29 Apr 2026

https://github.com/raduldev/gromacs

Molecular Dynamics Simulation using GROMACS in Jupyter notebook

bash gromacs matplotlib molecular-dynamics-simulation numpy pandas python3 shell

Last synced: 09 Apr 2026

https://github.com/dimatura/imgutils

Miscellaneous Numpy and PIL utilities for images.

image numpy pil pillow python

Last synced: 19 Jan 2026

https://github.com/achuth-0908/hemoguard-anemia-predictor

A Web App incorporated with a Gradient Boosting Classifier Model, to predict Anemia with given data.

css flask html matplotlib numpy pandas python scikit-learn

Last synced: 11 Apr 2026

https://github.com/qua77i20/ai2

A simple single-layer neural network (perceptron) built in Python. Demonstrates backpropagation training from scratch for beginners.

ai backpropagation data-science deep-learning machine-learning neural-network numpy perceptron python

Last synced: 15 May 2026

https://github.com/briankopp/fewerbytes

compress numpy arrays. binary and numeric techniques for numpy compression

compress compression numpy python python3 storage

Last synced: 18 May 2026

https://github.com/bestmahdi2/uni__pythonsupportvectormachinesbinaryclassification

A university project in which the binary classification of support vector machines is implemented with Python language

binary-classification classification matplotlib numpy python scikit-image scikit-learn seaborn support-vector-machine svm

Last synced: 07 Apr 2026

https://github.com/ryannapp12/quant_trading_engine

A modular, and scalable quantitative trading engine built in Python. This project demonstrates efficient data caching with SQLite, concurrent backtesting, and advanced risk analytics, showcasing best practices in clean code architecture and performance optimization.

algorithmic-trading backtesting dash data-analysis data-visualization fintech lstm machine-learning numpy pandas plotly python quantitative-finance real-time risk-management sqlite technical-analysis tensorflow time-series-analysis trading-strategies

Last synced: 11 Apr 2026

https://github.com/urme-b/multimodal-multisensor

Longitudinal neurophysiological study of adult psychometric testing.

keras matplotlib numpy pandas python pytorch scikit-learn seaborn tensorflow

Last synced: 13 Apr 2026

https://github.com/peterfication/num-peek

A CLI tool to peek into *.npy files

cli numpy python rust

Last synced: 20 Jan 2026

https://github.com/ashar97/financial-engineering

Financial Engineering Practicum Assignments

financial-engineering matlab numpy pandas python r yahoo-finance

Last synced: 11 Apr 2026

https://github.com/akashshnkr/multi-disease-prediction

Developed and integrated three machine learning models for predicting diabetes, Parkinson's, and heart disease into a Streamlit-based web application. The interface allows users to input data and receive accurate health predictions, enhancing early detection and healthcare outcomes.

logistic-regression machine-learning-algorithms numpy pandas python scikit-learn streamlit-webapp svm

Last synced: 02 Jan 2026

https://github.com/jabhij/eda_experiments

In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.

ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis

Last synced: 14 Apr 2026

https://github.com/ksatrajit0/heart-disease-prediction-ml

Predicts the risk of heart attack in a patient using their medical record

heart-disease-prediction machine-learning matplotlib numpy pandas scikit-learn seaborn

Last synced: 19 Apr 2026

https://github.com/camilajaviera91/apache-beam-pipeline-first-approach

This code demonstrates how to integrate Apache Beam with scikit-learn datasets and perform simple data transformations. It loads the Linnerud dataset from scikit-learn, converts it into a Pandas DataFrame for easier manipulation.

apache-beam dataframes glob kmeans-clustering matplotlib-pyplot mean-absolute-error mean-square-error numpy os pandas pipelines scipy-stats seaborn silhouette-score sklearn sklearn-datasets standardscaler

Last synced: 28 Apr 2026

https://github.com/mmsaki/whale_portfolio

Determine which Whale portfolio is performing the best across multiple areas: volatility, returns, risk and Sharpe ratios.

datetime finance jupyter-notebook matplotlib numpy pandas pathlib python quantitative-finance seaborn stock-market trading-strategies

Last synced: 06 May 2026

https://github.com/nishkarsh25/end_to_end_real_estate_price_prediction_project

This data science project series walks through step by step process of how to build a real estate price prediction website. We will first build a model using sklearn and linear regression using banglore home prices dataset from kaggle.com. Second step would be to write a python flask server that uses the saved model to serve http requests....

css html javascript jupyter-notebook matplotlib numpy pandas python python-flask-server sklearn

Last synced: 07 Apr 2026

https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-

# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻

analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits

Last synced: 01 Jul 2025

https://github.com/savernish/forgenn

forgeNN is a in-development purpose‑built neural network framework combining a transparent NumPy autograd engine with a Keras‑like API and performance oriented primitives. Developed by a college student with an ambitious feature pipeline.

artificial-intelligence deep-learning keras machine-learning mlp-networks neural-network numpy pytorch tensorflow tensors

Last synced: 08 Oct 2025

https://github.com/nirmalyabag20/diabetes-prediction-using-machine-learning

This project focuses on predicting diabetes using machine learning algorithms based on health metrics like glucose levels, blood pressure, and BMI. By comparing different models, the goal is to identify the most accurate approach for early diabetes detection, showcasing the potential of machine learning in healthcare.

decision-tree-classifier jupyter-notebook kneighborsclassifier logistic-regression matplotlib numpy pandas python random-forest-classifier scikit-learn seaborn svc

Last synced: 18 Jan 2026

https://github.com/amritakaur-08/smart_attendance_system

The Smart Attendance System is designed to automate the attendance process using facial recognition. Traditional methods of taking attendance can be time-consuming and prone to errors. This project aims to streamline the process by leveraging computer vision technology, allowing for efficient, contactless, and accurate attendance tracking.

asgiref django mysql numpy opencv pillow-library python sqlparse

Last synced: 08 Apr 2026

https://github.com/jaypanchal9/spotless-data

Spotless Data: A Python-based workflow using Jupyter Notebooks for efficient data cleaning, preprocessing, handling missing values, correcting outliers, and integrating external datasets ideal for quick, reliable, and clean data preparation.

data-cleaning data-preprocessing data-wrangling matplotlib numpy pandas python3

Last synced: 13 Apr 2026

https://github.com/arjunan-k/netflix_analysis

Exploratory Data Analysis on Netflix Movies & TV Shows.

matplotlib numpy pandas plotly seaborn

Last synced: 09 May 2026

https://github.com/steventete/integral-app

IntegralApp is a Python-based GUI application designed to simplify the calculation of definite integrals using numerical methods.

desktop-app numerical-methods numpy pyqt5 python

Last synced: 18 May 2026

https://github.com/iv4n-ga6l/Realtime-Car-Traffic-Tracking

Realtime car traffic tracking using yolov8 model

cartracking numpy opencv python ultralytics yolov8 yolov8n

Last synced: 28 Apr 2025

https://github.com/kaleharshavardhan07/spam_mail-_detector_ai_model

This project implements a spam detection system for SMS messages using machine learning techniques.

mathplotlib nltk numpy panda python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/michael-insights/portfolio

This repository showcases my projects and skills in Data Analytics, Data Science, and Machine Learning. It includes hands-on work in data analysis, predictive modeling, and machine learning algorithms, aimed at solving real-world problems.

data-analytics data-science data-visualization datapreprocessing jupyter-notebooks machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn sql

Last synced: 11 Apr 2026

https://github.com/santoshn86/image-classification

This is an end-to-end project wehre I have used my friends images dataset for Image classification using logistic regreesion. It's applications may be used in Security and Surveillance, Criminal's classification & detection .

data-science flask-server html-css-javascript jupyter-notebook machine-learning matplotlib-pyplot numpy opencv pandas python sklearn-metrics

Last synced: 11 Apr 2026

https://github.com/vidhi1290/hr_employee_prediction

"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.

data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp

Last synced: 13 Apr 2026

https://github.com/soumya6tiwari/customer-segmentation-using-rfm-analysis

This project focuses on customer segmentation using RFM (Recency, Frequency, Monetary) analysis and K-Means clustering. It enables businesses to identify high-value customers, optimize marketing strategies, and improve customer retention through data-driven insights.

backend clustering flask frontend kmeans-clustering matplotlib numpy pandas python rfm-analysis scikit-learn unsupervised-learning

Last synced: 16 Feb 2026

https://github.com/abhishek010397/logisticregression

Using Logistic Regression Classifier to Predict Target Using Three Features

csv iloc logistic-regression machine-learning numpy pandas-dataframe python3 sigmoid-function sklearn

Last synced: 06 May 2026

https://github.com/sergio11/breast_cancer_diagnosis_mlp

🩺🔬 MLP-based Breast Cancer Diagnosis: Predicts tumor malignancy from image features, aiding in early detection. 📊🤖

deep-learning deep-neural-networks machine-learning mlp-classifier mlp-networks numpy sklearn sklearn-classify

Last synced: 21 Feb 2026

https://github.com/datarohit/date-fruit-classification

This is Date Fruit Data taken from Kaggle. This data severs a classification problem to solved. Using various features of the fruit classify the fruit to its type.

classification logisticregression numpy pandas python3 standard-scaler

Last synced: 13 Apr 2026

https://github.com/hossein-cod/deep-learning-for-training

Analysi hear disease data set with deep learning

accuracy chart jupyter keras matplotlib numpy pandas py python sklearn tensorflow

Last synced: 09 Mar 2026

https://github.com/moataz-elmesmary/analog-clock-opencv

Computer vision project implemented with OpenCV

computer-vision cv2 numpy opencv

Last synced: 07 May 2026

https://github.com/amir78729/cv2-video-art

Creating an image from an input video's frames

cv2 numpy pil python video

Last synced: 11 May 2026

https://github.com/camilajaviera91/clustering-first-approach

This code will help to understand, in a simple way, how the clustering model works. To achieve this, we will input certain variables (strings and integers), then plot them to visualize and analyze how the values are positioned.

array cluster clustering dictionaries matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/neerajcodes888/diwali-sales-analysis

An open-source repository for sales data analysis. Dive into insightful trends, metrics, and visualizations to empower data-driven decision-making. Ideal for data analysts, business professionals, and enthusiasts seeking comprehensive sales insights. Clone, customize, and contribute to enhance your sales analytics journey.

data-science-projects data-visualization numpy pandas-dataframe python3 sales-analysis seaborn-plots

Last synced: 26 Mar 2025

https://github.com/aarryasutar/credit_eda

This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.

binning boxplot correlation-matrix data-cleaning data-splitting dataframe feature-engineering heatmap jupyter-notebook matplotlib numpy pandas python scikit-learn seaborn

Last synced: 13 Apr 2026

https://github.com/ayaanjawaid/brain_stroke_prediction

project aims to predict the likelihood of a stroke based on various health parameters using machine learning models. The dataset is preprocessed, analyzed, and multiple models are trained to achieve the best prediction accuracy.

decision-trees exploratory-data-analysis matplotlib numpy pandas python regression xgboost

Last synced: 07 May 2026

https://github.com/mihirh19/analyzing-crimes-in-la

This GitHub repository contains a comprehensive analysis of crime data in Los Angeles. Leveraging a variety of data sources, tools, and techniques, we have explored patterns, trends, and insights related to criminal activities within the city. Our analysis includes data visualization crime dynamics in different neighborhoods and over time.

dataset machine-learning matplotlib numpy pandas seaborn

Last synced: 07 May 2026

https://github.com/otuemre/simplenet

A simple neural network framework built from scratch using Python and NumPy. Designed as a learning tool, SimpleNET includes essential layers, activation functions, and loss functions, enabling users to build and train basic neural networks from the ground up.

backpropagation deep-learning from-scratch machine-learning neural-network numpy python

Last synced: 13 Apr 2026

https://github.com/xaxtric7/learn-numpy

🧮 Welcome to my NumPy Practice repository! This repo contains my hands-on learning, notes, and code experiments as I dive into the fundamentals and advanced features of NumPy – a powerful library for numerical computing with Python.

numpy pyrhon3

Last synced: 15 May 2026

https://github.com/ananyachibber21/python-numpypandas-exercises

Exercises - Python for Data Science - NumPy + Pandas

numpy pandas python udemy

Last synced: 07 May 2026

https://github.com/mansurpro/digitrecognizer

DigitRecognizer is a simple, handcrafted neural network implemented with NumPy to classify handwritten digits from the MNIST dataset. This project demonstrates the foundational principles of neural networks, including forward propagation, backpropagation, and gradient descent, all built from scratch without relying on deep learning frameworks.

backpropagation custom-dataset data-visualization deep-learning from-scratch gradient-descent handwritten-digit-recognition machine-learning minimal-dependencies mnist neural-network numpy python

Last synced: 16 May 2026

https://github.com/johndev19/data-visualization-tool

A Python-based application for generating various types of charts and graphs from user-input data.

charts matplotlib numpy python visualization

Last synced: 17 May 2026

https://github.com/sirinemaaroufi/arabic-sentiment-analysis

This project focuses on developing a sentiment analysis model for Arabic text, leveraging hybrid transformer-based models (AraBERT) and LSTM approaches.

arabert arabic-nlp bert-model feature-extraction lstm machine-learning nlp numpy pandas pretrained-models python pytorch sentiment-analysis transformers

Last synced: 18 Apr 2026

https://github.com/zion-dev1/e-commerce-analysis

Short machine learning project which analyses customer behaviour 🔎

analysis clustering ecommerce machine-learning numpy pandas sklearn

Last synced: 05 Jul 2025

https://github.com/zacheddy/imageclassifier

A convolutional neural network from scratch to classify images (demo available)

image-classifier machine-learning neural-network numpy

Last synced: 19 May 2026

https://github.com/buathiertom/pictures-stacking

SAE de développement d'une interface d'empilement d'image

astropy mathplotlib numpy pyplot pyqt6 python-pyqt6

Last synced: 29 Mar 2025

https://github.com/yash-rewalia/stock-closing-price-prediction-using-regression

The ultimate business objective is to leverage the regression model to provide accurate predictions of the closing price of AMRN stock, enabling stakeholders to make well-informed investment decisions, manage risks effectively, optimize portfolios, Early warning systems to alert any fraud cases and align investment strategies with financial goals.

datapreprocessing eda hypothesis-testing machine-learning numpy pandas python random-forest regression regression-analysis statistics

Last synced: 12 Apr 2026

https://github.com/tasninanika/k-nearest_neighbors

K-Nearest Neighbors (KNN) is a simple and powerful machine learning algorithm used for classification and regression.

knn matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/pvlbzn/kot-or-not

Logistic regression (shallow neural network) which predicts whether input image is a cat or not.

classification-algorithm image-classification logistic-regression machine-learning neural-network numpy

Last synced: 01 Mar 2025

https://github.com/henriqueotogami/curso-de-python3-udemy

Documentos do curso de Python 3, com arquivos das aulas, e Jupyter Notebook. Conceitos de Machine Learning.

iris-dataset jupyter-notebook linear-regression numpy pandas-python python3 udemy

Last synced: 08 May 2026

https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken

Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.

3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics

Last synced: 11 Apr 2026

https://github.com/harmanveer2546/movie-industry

Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.

business business-analytics dataanalysis datetime matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/codewitheshayoutube/corvit_python_data_analytics_course

I teach Python & Data Analytics at Corvit Systems. This repository contains all course materials, including code, assignments, projects, and resources. ⭐ Stay updated!

matplotlib numpy pandas python python3 scikitlearn-machine-learning seaborn

Last synced: 13 Apr 2026

https://github.com/mohamedawnallah/covid19-dataanalysis-python

how to preprocess and merge datasets to calculate needed measures and prepare them for an Analysis. In this project, we are going to work with the COVID19 dataset, published by John Hopkins University, which consists of the data related to the cumulative number of confirmed cases, per day, in each Country. Also, we have another dataset consist of various life factors, scored by the people living in each country around the globe. We are going to merge these two datasets to see if there is any relationship between the spread of the virus in a country and how happy people are, living in that country.

covid19 dataanalysis datavisualization happiness-report jupyternotebook numpy pandas python

Last synced: 08 May 2026

https://github.com/leabrodyheine/water-pump-status-prediction

This project implements machine learning models to predict the status of water pumps in Tanzania using data from DrivenData's competition. The project includes preprocessing steps, model evaluation using cross-validation, and hyperparameter optimization with Optuna.

argparse cross-validation gradient-boosting-classifier logistic-regression machine-learning multilayer-perceptron numpy optuna pandas random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/21aja/house_price_prediction_revised_project

House price prediction using ML with Decision Tree Regressor achieving 100% accuracy. Preprocessing, EDA, and model evaluation applied using Python, Pandas, NumPy, Scikit-learn, and XGBoost for data-driven insights.

decision-tree-classifier linear-regression numpy pandas xgboost

Last synced: 13 Apr 2026

https://github.com/programmer-rd-ai/dimensionality-reduction

DimRed is a comprehensive Python toolkit for advanced dimensionality reduction, integrating with major machine learning libraries and featuring real-time performance monitoring to enhance data analysis and model efficiency.

analytics data-analysis data-science lightgbm machine-learning matplotlib numpy pandas programming python python3 sklearn university xgboost

Last synced: 01 Mar 2025

https://github.com/Krisseck/hki-traffic-predict

Keras neural network to predict traffic in Helsinki

avoindata helsinki keras numpy open-data python traffic

Last synced: 13 Jul 2025

https://github.com/hamdaniqhmqd/project-predict-saham-bbri

Repository Project-Predict-Saham-BBRI is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/aminkhavari78/geoai-challenge-estimating-soil-parameters-from-hyperspectral-images-

Can you predict soil parameters from hyperspectral earth observation data?

dee keras matplotlib neural-network numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/imswappy/brain-tumor-detection

🧠 Deep learning project for brain tumor classification using MRI images. Built with transfer learning (VGG16 + fine-tuning), TensorFlow/Keras, and deployed via Streamlit. Dataset & model loaded dynamically from KaggleHub. Includes training notebook, evaluation, and interactive web app.

kagglehub keras numpy pandas scikit-learn streamlit tensorflow vgg16-model

Last synced: 13 Apr 2026

https://github.com/prasadlokhande-880/imagetransformationtool

This script demonstrates image transformations using OpenCV, including shifting, resizing, rotating with scaling, and skewing for various image processing tasks.

ai datamanipulation ml numpy opencv python

Last synced: 08 May 2026

https://github.com/solrikk/optichrom

This repository provides a collection of Python tools and applications for comparing images using various algorithms such as ORB (Oriented FAST and Rotated BRIEF) and SSIM (Structural Similarity Index). The tools include both command-line scripts and graphical user interface (GUI) applications for ease of use

computer-vision cv2 feature-matching image-comparison image-comparison-tool machine-learning numpy object-detection opencv orb python scikit-image ssim tkinter visualization yolo yolov5

Last synced: 18 Jan 2026

https://github.com/ishutak/disease_prediction

An AI-powered disease prediction system that uses machine learning to predict diseases based on symptoms. The system employs an ensemble of models including Random Forest and Neural Networks to provide accurate predictions with confidence levels.

css3 htlm5 javascript jquery numpy pandas pytorch scikit-learn select2

Last synced: 11 Apr 2026

https://github.com/shreeyas-48/creditcardfrauddetection

Project for detecting credit card frauds using neural networks and logistic regression

autoencoder keras logistic-regression matplotlib neural-networks numpy pandas python scikit-learn

Last synced: 12 Apr 2026

https://github.com/florianwoelki/mnist_fashion_cnn

This is a classification problem based on the mnist fashion dataset. I've used a CNN to classify the images.

classifier cnn fashion keras mnist numpy pandas tensorflow

Last synced: 08 Apr 2026

https://github.com/miroslav-reiter/kurz_ako_sa_stat_datovym_analytikom_python

🐍 Ako sa stať Dátovým ANALYTIKom v Pythone - Materiály, zdroje, skripty k večernému online školeniu Ako sa stať Dátovým Analytikom v Pythone pre Robime.it (Mentor Partners)

analysts analytics data-science math matplotlib numpy pandas python reiter scipy vizualisation

Last synced: 04 Apr 2026

https://github.com/yuanx749/complex_systems

A Python package for studying complex systems.

jupyter matplotlib networkx numpy pep8 project scipy streamlit

Last synced: 05 Apr 2026