An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/alex1990/numstat

Output the measures of a list of numbers, such as sum, minimum, maximum and so on.

statistics

Last synced: 16 May 2025

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/mmyros/bayes-window

Pretty, easy, flexible Bayesian estimation with data overlay

altair bayesian-inference inferential-statistics numpyro statistics

Last synced: 16 Jan 2026

https://github.com/g4brielvs/etudes

:robot: My collection of études

data-science etudes mathematics pytudes statistics teaching

Last synced: 23 Mar 2025

https://gitlab.com/prebens-phd-adventures/universal-edit-distance

A small Python library containing some generic metrics implemented in Rust

automatic speech recognition (ASR) metrics python rust statistics

Last synced: 07 May 2026

https://github.com/praisetompane/statistics

An attempt at unpacking statistics and its foundations in layman's terms.

statistics

Last synced: 16 Apr 2025

https://github.com/samsmithnz/azuredevopsstatistics

A PowerShell script to scan organizations for projects, repos, prs, artifacts, and work items

azure-devops azuredevops powershell statistics

Last synced: 31 Mar 2025

https://github.com/gyf9712/stat-theory-skills

A 4-skill pipeline for Claude Code: verify mathematical proofs → repair with literature support → sharpen the theory → write corrected proofs. Integrates Codex MCP for adversarial cross-review. Venue-audited reference library across statistics/econometrics/ML theory.

claude-code claude-skills econometrics latex machine-learning-theory mathematical-proof proof-verification research-tools statistics theoretical-statistics

Last synced: 27 May 2026

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 11 Apr 2026

https://github.com/thechibo/estim

Distribution Parameter Estimation - DEPRECATED - please use https://github.com/thechibo/joker

estimation maximum-likelihood-estimation moment-estimation probability-distribution r r-package statistics

Last synced: 27 May 2026

https://github.com/bozenne/lavasearch2

Tools for model specification in the latent variable framework (add-on for the lava package)

inference latent-variable-models r statistics

Last synced: 05 Jan 2026

https://github.com/dcs-training/pca-2023

PCA workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training

data-analysis data-visualisation data-wrangling r statistics

Last synced: 20 Jun 2026

https://github.com/bessarodrigo/hypothesis_test_healthy_program

Teste de Hipóteses da média de uma população para avaliar o nível de colesterol dos colaboradores de uma empresa.

hypothesis-test hypothesis-testing hypothesis-tests python statistics

Last synced: 02 May 2026

https://github.com/praju-1/deep_learning

This repository include Deep_learning concept which is subset of machine learning which is based on Neural Networking.

keras nltk pandas python sklearn spacy statistics tensorflow

Last synced: 11 Apr 2026

https://github.com/gianfranco7/pf_ci0130_b70866_b65580

Repositorio para el proyecto final del curso CI-0130 Métodos de Modelado y Optimización. II Semestre 2021

r statistics

Last synced: 20 May 2026

https://github.com/beliavsky/dataframe

DataFrame for Fortran for floating point data

dataframe fortran modern-fortran statistics

Last synced: 05 Jan 2026

https://github.com/filip-kustura/statistics-olympics-analysis

A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.

correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project

Last synced: 05 Jan 2026

https://github.com/hhentschke/simulate-standardized-mean-differences

Computation and visualization of standardized mean differences from simulated data

bias-correction confidence-intervals effect-size hedges-g matlab repeated-measurements statistics

Last synced: 28 May 2026

https://github.com/kamilpytlak/hcv-research-analysis

🦠 Results of statistical analysis of HCV testing at the diagnostic point in Wrocław (part of a scientific publication)

hcv research statistics

Last synced: 16 Jan 2026

https://github.com/celiannourry/monty-hall-simulation

Monty hall is a statistical puzzle which fools most of people. Here is a small simulation to prove that switching door is indeed the best choice.

monty-hall-problem monty-hall-simulator python statistics

Last synced: 18 Jun 2025

https://github.com/pmatos/octo-racknews

Statistics gathering for Racket News

github racket racket-news statistics

Last synced: 13 May 2026

https://github.com/gfyoung/statwrappers

Useful wrapper classes around Python stat library functionality

python statistics wrapper

Last synced: 28 Aug 2025

https://github.com/carlosefr/bpstat-api-scripts

(Unofficial) scripts to fetch data from the Bank of Portugal's statistics API

economy government-data opendata portugal statistics

Last synced: 15 Mar 2025

https://github.com/stdlib-js/stats-strided-dmeanpn

Calculate the arithmetic mean of a double-precision floating-point strided array using a two-pass error correction algorithm.

arithmetic-mean array average avg central-tendency double float64 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed

Last synced: 26 Apr 2026

https://github.com/auraoneai/iaa-kit

Modern inter-annotator agreement metrics with bootstrap intervals, ordinal support, and missing-data handling.

ai-evaluation evals inter-annotator-agreement statistics

Last synced: 28 May 2026

https://github.com/james-montgomery/romus

A fun package for exploring conjugate models and gibbs samplers.

bayesian bayesian-inference gibbs gibbs-sampler gibbs-sampling machine-learning modeling statistics

Last synced: 10 Jun 2025

https://github.com/efrod/bca-survival-analyzer

A tool that helps you performing survival analysis on body composition data.

python statistics survival-analysis

Last synced: 13 Jan 2026

https://github.com/szymon-budziak/probability_theory_and_statistics_course_agh

Solutions from Probability theory and Statistics course at AGH UST. If you use it and it helped you please leave a star ⭐

probability-theory r statistics

Last synced: 19 Mar 2025

https://github.com/mncube/simboil

A collection of simulation helper functions

iteration r sample simulation statistics

Last synced: 28 May 2026

https://github.com/gbrsales/kll

Streaming Quantile Approximation for R

r statistics stream-processing

Last synced: 31 Mar 2025

https://github.com/marberts/pysps

Sequential Poisson sampling in Python

python sampling statistics

Last synced: 24 Mar 2025

https://github.com/m-dadej/pd_estimation

Estimation of probability of default on novel data from Orbis

data-science predictive-modeling statistics

Last synced: 02 Jul 2025

https://github.com/alexp11223/d3flightsdatavisualization

Some visualizations for Kaggle 2015 Flight Delays and Cancellations dataset using d3.js and Leaflet

d3 d3js dataset flight-map leaflet sql statistics visualization

Last synced: 31 Mar 2025

https://github.com/stla/gfilmm

Generalized fiducial inference for linear mixed models

fiducial linear-mixed-models r statistics

Last synced: 10 Mar 2026

https://github.com/ritam2006/pmr-backend

A Java Spring Boot backend service for my Portfolio Management Risk project.

api financial-analysis java postgresql springboot sql statistics

Last synced: 02 Jul 2025

https://github.com/odessaz/portfolio-projects

This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science

applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics

Last synced: 12 Apr 2026

https://github.com/messente/messente-api-java

Messente API library: https://bintray.com/messente/messente-api/messente-api

number-lookup omnichannel phonebook statistics

Last synced: 24 Mar 2025

https://github.com/guusdias/programming-language-stats

Programming languages statistics in the last three years.

csv python statistics

Last synced: 02 Jul 2025

https://github.com/deve-sh/normal-distributor

A JavaScript Project to form a Normal Probability Distribution out of a provided Sample.

javascript normal-distribution statistics

Last synced: 03 Sep 2025

https://github.com/muonray/cern-root-financial-mechanics-and-market-analysis-codes

CERN ROOT codes used to develop the images and graphs in the article on my blog: http://muonray.blogspot.com/2014/09/particle-physics-software-and-financial.html

article cern cern-root drift efficient-frontier finance financial-analysis geometric-brownian-motion graphs market-analysis-codes monte-carlo monte-carlo-methods optimization-methods quant simulation statistics stock-market

Last synced: 24 Mar 2025

https://github.com/stdlib-js/stats-strided-dnanvariancetk

Calculate the variance of a double-precision floating-point strided array ignoring NaN values and using a one-pass textbook algorithm.

array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 28 Apr 2026

https://github.com/stdlib-js/stats-strided-dnanvarianceyc

Calculate the variance of a double-precision floating-point strided array ignoring NaN values and using a one-pass algorithm proposed by Youngs and Cramer.

array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 30 Apr 2026

https://github.com/stdlib-js/stats-strided-dnanstdevtk

Calculate the standard deviation of a double-precision floating-point strided array ignoring NaN values and using a one-pass textbook algorithm.

array deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 29 Apr 2026

https://github.com/cybcon/docker.dockerhubstats2mqtt

Collect repository statistics from Dockerhub and publish them to a MQTT topic.

docker-image dockerhub mosquitto-pub statistics

Last synced: 14 Apr 2026

https://github.com/swharden/cleananalytics

Privacy-focused website and application telemetry

analytics google-analytics logging php statistics

Last synced: 17 Jul 2025

https://github.com/makcymal/silvera

My researches on ML and statistics, optimization methods, CS algoritms and numerical methods

algorithms data data-structures machine-learning numerical-methods statistics

Last synced: 01 Apr 2025

https://github.com/minesja/poker-service

A monte-carlo simulation app for poker. Run thousands of rounds holding certain variables constant to calculate probabilities of the game.

monte-carlo poker statistics

Last synced: 20 Mar 2025

https://github.com/mfansler/hmmcov

Migration of https://code.google.com/archive/p/hmmcov/

bioinformatics hmm statistics

Last synced: 08 Apr 2025

https://github.com/justdvnsh/algorithms

A list of all major algorithms, their tests and explanations ranging all the way from basic to advanced.

advanced algorithms basic data-structures intermediate javascript machine-learning-algorithms statistics

Last synced: 12 Jul 2025

https://github.com/srvanderplas/statistical_atlas

Framed Charts and the Statistical Atlas of 1870

census data ggplot2 graphics r statistics visualization

Last synced: 29 May 2026

https://github.com/sodascience/schools_synth_test

Test code for disaggregated synthetic control with simulated data

causal-inference simulation statistics synthetic-control

Last synced: 20 Mar 2025

https://github.com/beliavsky/starpac

Standards Time Series and Regression Package, a library of Fortran subroutines for statistical data analysis developed by the Statistical Engineering Division of the National Institute of Standards and Technology

arima autocorrelation fft forecasting gamma-function nonlinear-least-squares periodogram polynomial-regression rng statistics time-series-analysis

Last synced: 06 Jan 2026

https://github.com/elb4rto/craplog-gui

Parse Apache2 logs to create statistics

apache2 apache2-logs gui java logs statistics

Last synced: 23 Apr 2025

https://github.com/cleoold/linearly_varying_binomial_distribution_calcs_python

a "binomial" distribution with linearly increasing chance.

probability python-c-extension statistics

Last synced: 16 Mar 2025

https://github.com/govind-prakash/r

A collection of R scripts designed for basic bioinformatics and biostatistics projects. This repository includes scripts for data analysis, visualization, and statistical modeling, catering to researchers and students in life sciences

data-science data-visualization r r-base rstudio statistics

Last synced: 05 Sep 2025

https://github.com/bozenne/lavareduce

Latent variable models with linear predictors

latent-variable-models lava-r-package r statistics

Last synced: 15 Jan 2026

https://github.com/lazernata/transport-problem

Bachelor's Thesis Work: Shiny app to solve the transport problem. Available in Spanish and English

operational-research rstudio shinyapps statistics

Last synced: 31 Mar 2025

https://github.com/elkronos/feature_selection

This repo contains a series of functions aimed at helping to select features and reduce dimensionality.

data-science dataanalytics feature-selection r statistics

Last synced: 09 Apr 2025

https://github.com/mrousavy/piechart

A rich PieChart control for WPF which supports easy MVVM bindings and data access

analytics chart control library pie-chart statistics wpf xaml

Last synced: 02 May 2026

https://github.com/rohankalbag/probability-random-processes

Probability and Random Processes - Autumn Semester 2021 - Indian Institute of Technology Bombay

probability python random-processes statistics

Last synced: 16 May 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/samorso/ibpaper

This repository contains the code to replicate the numerical studies presented in the paper "A Flexible Bias Correction Method based on Inconsistent Estimators".

bias-reduction finite-sample-bias indirect-inference regression resampling statistics

Last synced: 02 Apr 2025

https://github.com/pabsan-0/vfs2

Vectorial Mutual-Information based feature selection

feature-selection mutual-information repos-ml statistics

Last synced: 17 Mar 2025

https://github.com/gearkite/yapperfinder

Finds the yappers in your Matrix chats and displays some statistics

matrix statistics

Last synced: 16 Mar 2025

https://github.com/schw4b/titanic

A reanalysis of the Titanic data set in R and Quarto.

quarto r statistics

Last synced: 25 Mar 2025

https://github.com/olafhaag/c3d-statistics

Analyze conditional values of C3D data generated by Phasespace Impulse X2 motion capture system

c3d motion-capture phasespace statistics

Last synced: 14 Jun 2025

https://github.com/mightymetrika/mmibain

Bayesian Informative Hypotheses Evaluation Web Applications

bayes-factor bayesian hypothesis informative r statistics

Last synced: 06 Jul 2025

https://github.com/psygo/monte-carlo-ts

Monte Carlo (Gaussian) with TypeScript and SolidJS

monte-carlo simulation statistics

Last synced: 07 Dec 2025

https://github.com/cesar312/python-data-science-toolbox

A collection of useful data science tools and techniques

data-science jupyter-notebook pandas python scikit-learn statistics visualization

Last synced: 13 Apr 2026

https://github.com/dcs-training/introtostatistics

This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file

data-analysis r rmarkdown statistics

Last synced: 26 Mar 2025

https://github.com/windi-wulandari/sentiment-analysis-imdb

This project analyzes sentiment from 50,000 IMDb movie reviews, aiming to enhance time and cost efficiency. The Naive Bayes model with TF-IDF yielded the best results, achieving up to 99% savings in time and cost, surpassing the initial goals.

imdb-dataset machine-learning sentiment-analysis sentiment-classification statistics supervised-learning

Last synced: 17 Mar 2025

https://github.com/kddubey/microarray-kaggle

Analyze a dataset with 72 observations and 7,129 features

machine-learning statistics

Last synced: 08 Apr 2025

https://github.com/wilsoon7721/whatsapp-chatnalyzer

A tool designed to help users extract various statistics based on their WhatsApp chat exports, providing insights into messaging patterns, participant contributions, and more in a user-friendly interface.

chat statistics trends whatsapp

Last synced: 13 Apr 2026

https://github.com/aayushwankhade/z

z is a versatile programming language known for its simplicity and ease of use in developing web applications. With a strong focus on clean, readable code and efficient performance, z is ideal for both beginner and experienced developers looking to create high-quality software.

apache chatgpt data-engineering data-science data-visualization fish free game immutability machine-learning pattern-matching python statistics zwave

Last synced: 07 Sep 2025

https://github.com/noopur-zambare/nc_pruning

Dense network pruning algorithm inspired from Neural Collapse geometry

deep-learning robust-machine-learning statistics trustworthy-ai

Last synced: 08 May 2026

https://github.com/ik5/tracepath

For those who trespass against us

golang graph plot plotting statistics

Last synced: 05 Oct 2025

https://github.com/agbarnett/medianwatch

My blog "Median Watch"

blog metascience statistics

Last synced: 23 Feb 2026

https://github.com/gabboraron/biostatisztika_es_alkalmazasai

"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"

biostatistics data-analysis data-visualization r statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/sdttttt/huck

💾 Data collection server software. Configuration flexible, no external dependencies.

go http-server simple statistics

Last synced: 14 Jan 2026

https://github.com/mgckaled/rs_data-analytics

Repositório agregador do conteúdo da formação Data Analytics desenvolvido pelo Rocketseat.

data-analytics data-visualization python sql statistics

Last synced: 09 Oct 2025

https://github.com/lleonardus/analise-de-dados

Anotações da matéria de Análise de Dados do curso de Ciência da Computação da Estácio

jupyter-notebook statistics

Last synced: 09 Oct 2025

https://github.com/quanticpony/clothespin-probability-distribution

A small problem of a probability distribution of clothes pins along a string.

challenges-solved python simulation statistics

Last synced: 10 Oct 2025