An open API service indexing awesome lists of open source software.

NumPy

NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.

https://github.com/bhattbhavesh91/numpy-performance-test

Comparing the performance of Base Python Square root function with a numpy equivalent

list numpy python

Last synced: 20 Apr 2026

https://github.com/shibbir24/a-data-driven-approach-to-food-security-and-supermarket-accessibility

A Data-Driven Approach to Food Security and Supermarket Accessibility

data-analysis matplotlib numpy pandas python3 seaborn

Last synced: 13 Apr 2026

https://github.com/harmanveer2546/recognizing-digits

I used the MNIST dataset for the implementation of a handwritten digit recognition app. To implement this, will be using a special type of deep neural network called Convolutional Neural Networks. In the end, I also build a Graphical user interface(GUI) where you can directly draw the digit and recognize it straight away.

dense flatten gui keras maxpooling mnist numpy pandas pillow python sequential

Last synced: 13 Apr 2026

https://github.com/iv4n-ga6l/Simple-Image-Segmentation

Simple implementation of image segmentation using the Watershed algorithm

imagesegmentation matplotlib numpy opencv python watershed-algorithm

Last synced: 28 Apr 2025

https://github.com/iv4n-ga6l/Crowd-move-simulation

Crowd move simulation in python

matplotlib numpy python simulation

Last synced: 28 Apr 2025

https://github.com/hajaarh/malaria_hematie_cnn

Algorithme de rΓ©seaux de neurones convolutionnels

numpy pandas python resnet-50 sklearn-library tensorflow vgg16

Last synced: 13 Apr 2026

https://github.com/dummumounika/ecommerce-sales-categorization

This repository contains Python code for text classification and analysis of e-commerce sales data. The script processes textual descriptions of products and categorizes them into predefined categories using a Naive Bayes classifier. It also includes various analysis and visualization methods to explore the dataset.

machine-learning matplotlib-pyplot ntlk numpy pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/sorabh-kapoor/face-recognition-attendance-system

The Facial Recognition System is an AI-powered application USING FLASK designed to detect and recognize faces with high accuracy. This system can be integrated into various applications, including security systems, attendance management, and identity verification.

flask flask-application knn ml numpy opencv pandas python scikit-learn

Last synced: 13 Apr 2026

https://github.com/lmtreser/intro-data-science

Trayecto de formaciΓ³n en Ciencias de Datos con Python y Pandas

data-science matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/yesenvidnath/facerecognitionapplight

This is a face recognition application wich is capable of recognizing a person based on thire face and take attendence from it

face-recognition facent flask numpy python

Last synced: 13 Apr 2026

https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t

collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif

Last synced: 09 May 2026

https://github.com/aritrakar/ai-projects

πŸ€–A centralized collection of links to my AI projects roughly divided by topic.

numpy pandas python pytorch tensorflow

Last synced: 13 Apr 2026

https://github.com/nirmaldeepponnada/codeclauseinternshipproject2

Python, NLTK, Scikit-Learn, Pandas, NumPy, Pickle, SciPy, and JSON are used for text preprocessing, feature engineering, multi-label classification, and model persistence.

nltk numpy pandas pickle python scikit-learn scipy

Last synced: 07 Apr 2026

https://github.com/jfjlaros/online-array

Online array, pass a function as a NumPy array.

array function numpy online unbounded

Last synced: 13 May 2026

https://github.com/fabriziomusacchio/ann_with_numpy

A how-to for building an artificial neural network from scratch using NumPy

ann artificial-neural-networks neural-network numpy

Last synced: 08 May 2026

https://github.com/nyx1311/toxicity-detector-using-bilstm

πŸš€ What we built: An AI-powered Women’s Safety & Well-Being Detector β€” a web app that flags multiple forms of online abuse in real time and offers tools for emotional recovery. πŸ“Š Under the hood: BiLSTM + Word2Vec embeddings for deep, context-aware detection Trained on 21K+ labeled comments across 7 toxicity categories Built with Python, Tensor

epoch genism gpu keras-tensorflow matplotlib model nlp nlp-machine-learning nlpaug-textual numpy pandas pandas-library python3 streamlit tensorflow word2vec

Last synced: 14 Apr 2026

https://github.com/ahmedabdalkreem/healty-care

The Complete project using neural network to arrive the best healt know he is stoke or not.

matplotlib neural-network numpy pandas python3 sklearn-library

Last synced: 13 Apr 2026

https://github.com/railab/nxscli-np

Nxscli extention with Numpy operations

numpy nxscli

Last synced: 21 Apr 2026

https://github.com/takana671/partiallytransparentimage

A simple tool to make an image partially transparent

geomipterrain numpy opencv pillow python3 transparent-image

Last synced: 07 May 2026

https://github.com/vedikasnehil/daily-leetcode-

Daily LeetCode Using Python is a project dedicated to solving coding challenges consistently using Python. It focuses on improving problem-solving skills, mastering Python techniques, and building a collection of clean, well-documented solutions. Perfect for interview preparation, learning algorithms, or daily coding practice!

matplotlib numpy pandas python python3 seaborn

Last synced: 13 Apr 2026

https://github.com/danny-1k/nnpy

Neural network library written from scratch in numpy

deep-learning neural-networks-from-scratch numpy python

Last synced: 17 May 2026

https://github.com/tkc/node-neural-network

Built with only pure typescript, neural network iris sample.

javascript machine-learning neural-network node numpy typescript

Last synced: 09 Apr 2026

https://github.com/parthshah02/customer_churn_dashboard

This repository features a comprehensive project showcasing data analysis and interactive dashboard using Python

data-analysis matplotlib numpy pandas python

Last synced: 13 Apr 2026

https://github.com/suap-dev/python-essentials

a repository intended for people who want to know some python basics

lambda-functions numba numpy oop pandas teaching

Last synced: 17 Mar 2025

https://github.com/hansalemaos/a_pandas_ex_split_on_common_char

Splits the elements in a Series using a common separator character and returns a DataFrame.

numexpr numpy pandas separator split string

Last synced: 14 Jan 2026

https://github.com/anikdascodes/fantasy-cricket-project

🏏 Python simulation for generating 20K+ unique fantasy cricket teams with probability-based player selection and constraint optimization. Advanced algorithms, data analysis & visualizations included.

algorithm constraint-satisfaction cricket data-science fantasy-sports jupyter machine-learning numpy optimization pandas plotly probability python simulation sports-analytics

Last synced: 28 Apr 2026

https://github.com/nihanthbhargav/time-series-stock-market

This project combines computer vision and NLP by segmenting pet images with a U-Net model and generating captions using CNN-RNN/LSTM. Using the Oxford-IIIT Pets dataset, it demonstrates a unified pipeline that integrates pixel-level segmentation with automatic caption generation for meaningful image understanding.

matplotlib numpy pandas plotly python scikit-learn seaborn

Last synced: 11 Apr 2026

https://github.com/mehediehteshum/minesweeper

Building classic 'Minesweeper' game from scratch using Pygame with VS Code.

numpy pygame python python3

Last synced: 06 Oct 2025

https://github.com/farhad-here/height-distribution-analysis

Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.

coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance

Last synced: 13 Apr 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/ejw-data/python-bitwise-operators

Difference between bitwise and logical operators when using pandas

numpy pandas

Last synced: 08 May 2026

https://github.com/christabelsakyi/tsec

This project implements a time series forecasting solution for predicting electricity consumption based on historical data. It provides multiple model options, a Flask API for deployment, and visualization tools.

facebook numpy pandas python sklearn time-series-analysis

Last synced: 07 Oct 2025

https://github.com/nimaabediforud/heart-disease-prediction

The project highlights the end-to-end workflow of data preprocessing, model building, evaluation, and comparison for a healthcare prediction task.

biomedical-applications biomedical-data-science biomedical-engineering classification deep-learning machine-learning neural-networks numpy pandas prediction python seaborn sklearn tensorflow

Last synced: 15 Apr 2026

https://github.com/prarthana-singh/bangalore-house-price-predictor

🏑 Bangalore House Price Prediction – A Machine Learning model to predict house prices in Bangalore using real estate data. Built with Linear Regression, Python, Pandas, NumPy, and Scikit-Learn.

data-analysis eda house-price-prediction linear-regression machine-learning numpy pandas python real-estate regression scikit-learn

Last synced: 19 Apr 2026

https://github.com/alex-s2222/mem_chat_bot

Ρ‚Π΅Π»Π΅Π³Ρ€Π°ΠΌΠΌ Π±ΠΎΡ‚ для создания gif ΠΈΠ· тСкста Π²Π²Π΅Π΄Ρ‘Π½Π½ΠΎΠ³ΠΎ ΠΏΠΎΠ»ΡŒΠ·ΠΎΠ²Π°Ρ‚Π΅Π»Π΅ΠΌ

cv2 google-api moviepy numpy python python-telegram-bot requests

Last synced: 08 Oct 2025

https://github.com/harshavardh8n/machine-learning-projects

Machine Learning Projects: A collection of machine learning projects developed as part of the curriculum, utilizing libraries such as NumPy, Pandas, Seaborn, and Matplotlib. These projects demonstrate various algorithms, including Apriori and Decision Trees, showcasing practical applications and data analysis techniques.

apriori-algorithm decision-trees jupyter-notebook machine-learning numpy pandas python

Last synced: 18 Jan 2026

https://github.com/mkdirer/disaster-tweet-nlp-adaboost-transformers

A machine learning project that classifies tweets as disaster-related or non-disaster-related using transformers and the AdaBoost technique. Achieved an accuracy of approximately 80% on the validation set

geopy keras matplotlib numpy pandas python sklearn tensorflow

Last synced: 12 Apr 2026

https://github.com/isk-daniar/signal-filter-plt

Signal filter PLT

numpy pillow python

Last synced: 18 Apr 2026

https://github.com/rachakondaganesh/superstore-sales-data-analysis-project

Analyzed retail sales data to uncover key business insights using Python and Power BI. Explored patterns in profit, sales, and customer segments across regions. Built interactive dashboards to visualize trends, identify top-performing categories, and highlight areas for improvement in shipping and discount strategies.

dashboard exploratory-data-analysis matplotlib-pyplot numpy pandas powerbi seaborn

Last synced: 02 May 2026

https://github.com/hmotez/medclaimml

An AI-powered machine learning application designed to process healthcare reimbursement claims. It analyzes medical documents, client information, insurance policies, and legal regulations to predict accurate reimbursement amounts, ensuring efficiency, compliance, and fraud detection.

healthcare jest-tests mern-stack mongodb nodejs nosql numpy pytorch react scikit-learn tailwindcss

Last synced: 12 Apr 2026

https://github.com/samuelpuerta/modelosiasaberprocolombia

PROYECTO KAGGLE - Crear un modelo para predecir el redimiento de los estudiantes en las pruebas Saber Pro

ai ia jupyter-notebook machine-learning numpy pandas prediction-model predictive-modeling python

Last synced: 06 May 2026

https://github.com/faisal-khann/ipl-analysis

The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.

data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn

Last synced: 08 May 2026

https://github.com/yeisson8a/tallerdataanalysis

Taller de Data Analysis utilizando Python y las librerΓ­as Pandas, NumPy, Seaborn, Matplotlib para manipulaciΓ³n de datos de un archivo, asΓ­ como la librerΓ­a Statsmodels para regresiones lineales.

analisis-de-datos linear-regression matplotlib numpy pandas python3 seaborn statsmodels

Last synced: 14 Apr 2026

https://github.com/netbr3ak/planetary-system-simulation

Planetary System Simulation enables users to simulate the trajectories of two planets in a gravitational system. Through a simple web interface, users can predict planetary movements using technologies such as Flask, NumPy, and SciPy.

astronomy flask gravitational-dynamics numerical-analysis numpy physics planetary-systems scipy simulation web-application

Last synced: 14 Apr 2026

https://github.com/bhavnanahar/coral_project

ChatGPT said: This project analyzes and predicts the health of coral reefs in the Florida Keys by examining coral cover and species richness trends over time.

colab-notebook matplotlib numpy pandas plotly python random-forest scikit-learn seabo

Last synced: 14 Apr 2026

https://github.com/gojoel/brockton-data-analysis

Analyzes police logs from the city of Brockton, MA and attempts predictive analysis.

data-science exploratory-data-analysis geopy machine-learning matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/arsh-pixel-cmd/ai-image-classifier

AI Image Classifier is a web app that uses AI and deep learning to classify images in real-time. Built with Streamlit, TensorFlow (MobileNetV2), OpenCV, and NumPy, it allows users to upload single or multiple images and get the top-3 predictions with confidence scores instantly in the browser.

docker numpy opencv python3 streamlit tensorflow2 uv

Last synced: 10 Apr 2026

https://github.com/naveen88112/vehicle-routing-problem

Vehicle Routing Problem (VRP) Solver This project applies a Genetic Algorithm (GA) to optimize the Vehicle Routing Problem (VRP). This is done for more than one vehicle to ensure delivery routes that save travel distance and increase efficiency.

deap genetic-algorithm matplotlib numpy python vehicle-routing-problem

Last synced: 13 Apr 2026

https://github.com/sohitbennett/roadsafe

A Deep learning computer vision system for real-time traffic safety monitoring.

computer-vision esrgan keras numpy pandas python scikit-learn tensorflow tesseract-ocr yolov5 yolov8

Last synced: 08 Apr 2026

https://github.com/rakesh3596/apple-phone-sales-analysis

Apple Phone Sales Analysis using Python tools like Pandas, Matplotlib, Numpy, Plotly, Apple csv file

csv-files csv-reader matplotlib numpy pandas plotly python

Last synced: 14 Apr 2026

https://github.com/cinnaavox/loan-prediction

Machine Learning project predicting loan approvals using Decision Tree Classification. Includes data cleaning, feature engineering, model evaluation and key business insights.

decision-trees matplotlib numpy pandas python scikit-learn seaborn

Last synced: 14 Apr 2026

https://github.com/rscr1/summer_practice_mai_2

РСшСниС всСх Π·Π°Π΄Π°Π½ΠΈΠΉ Π»Π΅Ρ‚Π½Π΅ΠΉ ΠΏΡ€Π°ΠΊΡ‚ΠΈΠΊΠΈ

matplotlib numpy pandas python scipy sympy

Last synced: 14 Apr 2026

https://github.com/amiridk/statistics-and-algebra-in-python

simple and important statics operations and Famous statistical problems , also algebra operations

algebra itertools matplotlib numpy pandas python scipy seaborn static

Last synced: 14 Apr 2026

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/psgebeline/b2_plotter

A simple Python package for use by Belle II analysts to make various plots out of pandas dataframes and numpy arrays.

jupyter-notebook matplotlib matplotlib-pyplot numpy pandas pandas-dataframe python uproot visualization-tools

Last synced: 19 Jan 2026

https://github.com/akshay-kamath/anomaly_detection

This project addresses the challenge of identifying anomalies in sales data, aiming to optimize sales strategies by uncovering deviations from expected profit patterns.

anomaly anomaly-detection cblof flask isolation-forest knn matplotlib numpy pandas pyod python sklearn

Last synced: 19 Jan 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/alisonmitchell/titanic

Exploration of a subset of the Titanic passenger manifest to create a predictive classification model to determine which passengers were more likely to survive.

deep-learning keras machine-learning matplotlib numpy pandas python scikit-learn scipy seaborn tensorflow

Last synced: 14 Apr 2026

https://github.com/massimilianovisintainer/rock-vs-mine-prediction

Rock vs. Mine classification using Logistic Regression

machine-learning model numpy pandas sklearn

Last synced: 21 Jan 2026

https://github.com/ramchaik/cinebrain

CineBrain uses ML & NLP to analyze movies and recommend similar ones based on user preferences through cosine similarity.

cosine-similarity flask htmx kaggle machine-learning movie-recomendation-system nlp nltk numpy pandas sklearn tailwind tmdb tmdb-api unsupervised-learning vectorization

Last synced: 19 Jan 2026

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/ricardorobledo/paymentcardfrauddetection2025

Comparative analysis of probabilistic classification models for credit card fraud detection, focusing on model calibration and threshold optimization in highly imbalanced datasets.

imbalanced-learn matplotlib numpy pandas python3 scikit-learn search

Last synced: 14 Apr 2026

https://github.com/ashutosh65000/invisible-cloak

Have you watched the Harry potter movie? The Invisible Cloak scene pushed me to make this small code.

image-processing numpy opencv-python

Last synced: 19 Apr 2026

https://github.com/dev-dur/student_exam_analysis

This is a beginner-friendly data analysis mini-project using NumPy, Pandas, and Matplotlib. The project generates random student exam scores, analyzes them, and visualizes the results.

matplotlib-pyplot numpy pandas python

Last synced: 16 Apr 2026

https://github.com/nisch-mhrzn/house_prediction

This project predicts house prices using data exploration, feature engineering, and machine learning models like Linear Regression and Random Forest. It demonstrates how to optimize models and evaluate their performance to accurately forecast house prices.

matplotlib numpy pandas python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/khushi130404/regulexa

Regulexa is a Python project that showcases and compares Ridge, Lasso, and Elastic-Net regularization techniques in machine learning. It includes visualizations and performance insights to help prevent overfitting and improve model generalization.

elastic-net-regression lasso-regression numpy ridge-regression

Last synced: 17 May 2026

https://github.com/yahiazakaria445/ensemble-learning-voting-classifier

Ensemble Learning Using KNN, Naive Bayes, Decision Tree on Biomechanical Data

matplotlib numpy pandas scikit-learn seaborn

Last synced: 30 Apr 2026

https://github.com/hansalemaos/cirdhighspeedcoverter

High speed conversion of IP addresses represented in CIDR notation into their corresponding start and end IPs, along with their respective subnet masks.

cidr data-manipulation ip-addresses ip-conversion network network-management network-security numpy pandas subnet-mask

Last synced: 27 Jan 2026

https://github.com/avik-pal/kaggle-titanic

Predicting whether a given set of people survive on the Titanic

machine-learning numpy pandas scikit-learn scikitlearn-machine-learning

Last synced: 14 Apr 2026

https://github.com/grand-27-master/fingers-tap

A real-time multi-finger tap detection system

mediapipe numpy opencv python3

Last synced: 14 Apr 2026

https://github.com/ananupam/eda-titanic

An Exploratory Data Analysis project done on Titanic Dataset.

numpy pandas python seaborn

Last synced: 14 Apr 2026

https://github.com/pedrosfaria2/analisetitulosnetflix

Estudo de popularidade dos filmes da Netflix no IMDB.

analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python

Last synced: 14 Apr 2026

https://github.com/gregoritsch3/ml_eda_classification_diabetes

An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.

anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics

Last synced: 14 Apr 2026

https://github.com/ljubogdan/solar-cycle-lstm

This project predicts sunspot activity using an LSTM model for time series data. Built with TensorFlow and Keras, it uses Huber loss for outlier handling and MAE for performance evaluation. The dataset, sourced from Kaggle or SIDC, spans over 270 years of monthly sunspot data.

conv1d huber-loss-regression kaggle keras lstm machine matplotlib numpy pandas scikit-learn seaborn solar sunspots tensorflow time-ser

Last synced: 13 Apr 2026

https://github.com/anish37860/python_party

Python summer party by Interview Master & in partnership with DataCamp

jupyter numpy pandas python

Last synced: 06 May 2026

https://github.com/karthik9273/revolutionizing-gold-rate-forecasting-for-small-businesses-with-machine-learning

The "Gold Price Prediction" project focuses on predicting the prices of gold using machine learning techniques. By leveraging popular Python libraries such as NumPy, Pandas, Scikit-learn (sklearn), Matplotlib, Seaborn, Random Forest Regressor, and others, this project provides a comprehensive solution for accurate price estimation.

data-science google-colab-notebook jupyter-notebook machine-learning matplotlib numpy pandas-dataframe python seaborn sklearn

Last synced: 06 May 2026

https://github.com/navindafernando/lane-detection

Lane Detection with Image Feature Extraction Techniques :aerial_tramway:

image-feature-extraction kernel-filters machine-learning neighborhood numpy opencv sobel-filter

Last synced: 13 Apr 2026

https://github.com/icepanorama/internship-visualizations-and-demonstrations

A collection of some of the programs that I've written over the course of my internship.

artificial-intelligence machine-learning matplotlib numpy pandas python3 pytorch scikit-learn

Last synced: 14 Apr 2026

https://github.com/juzershakir/predicting_boston_housing_prices

Builded a model to predict the value of a given house in the Boston real estate market using various statistical analysis tools. Identified the best price that a client can sell their house utilizing machine learning.

bias-variance boston-housing-price-prediction data-exploration decision-tree-regression gridsearchcv k-fold machine-learning matplotlib mlfnd model-evaluation model-validation numpy pandas python3 r2-score sklearn supervised-learning udacity-nanodegree

Last synced: 22 Oct 2025

https://github.com/chirindaopensource/search_benford_law_compatibility

End-to-End Python scalable forensic accounting toolkit implementing Benford's Law analysis for FTSE financial data. Delivers automated anomaly detection with Chi-Squared/MAD testing, comprehensive validation pipelines, and risk-based prioritization of investigative resources. Replicates Ausloos et al.'s (2025) methodology with full reproducibility.

academic-research anomaly-detection benfords-law chi-squared-test data-validation econometrics financial-analysis financial-data forensic-accounting fraud-detection ftse goodness-of-fit jupyter-notebook numpy pandas python reproducible-research risk-management scipy statistical-testing

Last synced: 12 Apr 2026