Statistics
Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.
- GitHub: https://github.com/topics/statistics
- Wikipedia: https://en.wikipedia.org/wiki/Statistics
- Related Topics: data-science, machine-learning, deep-learning, neural-network,
- Last updated: 2026-07-03 00:26:24 UTC
- JSON Representation
https://github.com/johnkou97/numericalrecipes
bash differentiation fft fitting integration latex matrix-decompositions minimization neural-network numerical-analysis numerical-methods ode polynomial-regression python random-generation scipy shell sorting-algorithms statistics tex
Last synced: 11 Apr 2026
https://github.com/alex1990/numstat
Output the measures of a list of numbers, such as sum, minimum, maximum and so on.
Last synced: 16 May 2025
https://github.com/beliavsky/vector-error-correction
Simulate and fit from Vector Error Correction (VECM) models for cointegrated time series using the Johansen method
cointegration econometrics error-correction-model fortran multivariate-time-series multivariate-time-series-analysis simulation statistics time-series-analysis vecm vector-error-correction-model
Last synced: 27 May 2026
https://github.com/beliavsky/multivariate-changepoints
Find changepoints in correlation and covariance
correlation covariance finance fortran modern-fortran quantitative-finance regime-shifts statistics time-series-analysis volatility
Last synced: 27 May 2026
https://github.com/dmarks84/coursework_project_text-mining-spam-analysis
Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.
classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization
Last synced: 11 Apr 2026
https://github.com/mmyros/bayes-window
Pretty, easy, flexible Bayesian estimation with data overlay
altair bayesian-inference inferential-statistics numpyro statistics
Last synced: 16 Jan 2026
https://github.com/g4brielvs/etudes
:robot: My collection of études
data-science etudes mathematics pytudes statistics teaching
Last synced: 23 Mar 2025
https://gitlab.com/prebens-phd-adventures/universal-edit-distance
A small Python library containing some generic metrics implemented in Rust
automatic speech recognition (ASR) metrics python rust statistics
Last synced: 07 May 2026
https://github.com/praisetompane/statistics
An attempt at unpacking statistics and its foundations in layman's terms.
Last synced: 16 Apr 2025
https://github.com/samsmithnz/azuredevopsstatistics
A PowerShell script to scan organizations for projects, repos, prs, artifacts, and work items
azure-devops azuredevops powershell statistics
Last synced: 31 Mar 2025
https://github.com/suvasish114/ml-models
Machine Learning models
jupyter-notebook linear-algebra machine-learning-algorithms mathematics probability-distribution statistics
Last synced: 27 Jul 2025
https://github.com/gyf9712/stat-theory-skills
A 4-skill pipeline for Claude Code: verify mathematical proofs → repair with literature support → sharpen the theory → write corrected proofs. Integrates Codex MCP for adversarial cross-review. Venue-audited reference library across statistics/econometrics/ML theory.
claude-code claude-skills econometrics latex machine-learning-theory mathematical-proof proof-verification research-tools statistics theoretical-statistics
Last synced: 27 May 2026
https://github.com/rakibhhridoy/differentprojects
Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!
data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics
Last synced: 11 Apr 2026
https://github.com/thechibo/estim
Distribution Parameter Estimation - DEPRECATED - please use https://github.com/thechibo/joker
estimation maximum-likelihood-estimation moment-estimation probability-distribution r r-package statistics
Last synced: 27 May 2026
https://github.com/bozenne/lavasearch2
Tools for model specification in the latent variable framework (add-on for the lava package)
inference latent-variable-models r statistics
Last synced: 05 Jan 2026
https://github.com/dcs-training/pca-2023
PCA workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training
data-analysis data-visualisation data-wrangling r statistics
Last synced: 20 Jun 2026
https://github.com/bessarodrigo/hypothesis_test_healthy_program
Teste de Hipóteses da média de uma população para avaliar o nível de colesterol dos colaboradores de uma empresa.
hypothesis-test hypothesis-testing hypothesis-tests python statistics
Last synced: 02 May 2026
https://github.com/praju-1/deep_learning
This repository include Deep_learning concept which is subset of machine learning which is based on Neural Networking.
keras nltk pandas python sklearn spacy statistics tensorflow
Last synced: 11 Apr 2026
https://github.com/gianfranco7/pf_ci0130_b70866_b65580
Repositorio para el proyecto final del curso CI-0130 Métodos de Modelado y Optimización. II Semestre 2021
Last synced: 20 May 2026
https://github.com/beliavsky/dataframe
DataFrame for Fortran for floating point data
dataframe fortran modern-fortran statistics
Last synced: 05 Jan 2026
https://github.com/filip-kustura/statistics-olympics-analysis
A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.
correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project
Last synced: 05 Jan 2026
https://github.com/hhentschke/simulate-standardized-mean-differences
Computation and visualization of standardized mean differences from simulated data
bias-correction confidence-intervals effect-size hedges-g matlab repeated-measurements statistics
Last synced: 28 May 2026
https://github.com/kamilpytlak/hcv-research-analysis
🦠 Results of statistical analysis of HCV testing at the diagnostic point in Wrocław (part of a scientific publication)
Last synced: 16 Jan 2026
https://github.com/celiannourry/monty-hall-simulation
Monty hall is a statistical puzzle which fools most of people. Here is a small simulation to prove that switching door is indeed the best choice.
monty-hall-problem monty-hall-simulator python statistics
Last synced: 18 Jun 2025
https://github.com/ritwika-das-gupta/flight_price_prediction
Flight price Prediction is made using decision tree model and Machine learning concepts
cross-validation data-science data-visualization desiciontree flask gridsearchcv html-css-javascript hyperparameter-tuning json machine-learning matplotlib numpy pandas pickle python scikit-learn sklearn statistics supervised-learning
Last synced: 11 Apr 2026
https://github.com/pmatos/octo-racknews
Statistics gathering for Racket News
github racket racket-news statistics
Last synced: 13 May 2026
https://github.com/kamicollo/blog-posts
Code behind aurimas.eu blog posts
ab-testing analytics bayesian blog data-science data-visualization statistics
Last synced: 05 Jan 2026
https://github.com/gfyoung/statwrappers
Useful wrapper classes around Python stat library functionality
Last synced: 28 Aug 2025
https://github.com/carlosefr/bpstat-api-scripts
(Unofficial) scripts to fetch data from the Bank of Portugal's statistics API
economy government-data opendata portugal statistics
Last synced: 15 Mar 2025
https://github.com/stdlib-js/stats-strided-dmeanpn
Calculate the arithmetic mean of a double-precision floating-point strided array using a two-pass error correction algorithm.
arithmetic-mean array average avg central-tendency double float64 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed
Last synced: 26 Apr 2026
https://github.com/auraoneai/iaa-kit
Modern inter-annotator agreement metrics with bootstrap intervals, ordinal support, and missing-data handling.
ai-evaluation evals inter-annotator-agreement statistics
Last synced: 28 May 2026
https://github.com/james-montgomery/romus
A fun package for exploring conjugate models and gibbs samplers.
bayesian bayesian-inference gibbs gibbs-sampler gibbs-sampling machine-learning modeling statistics
Last synced: 10 Jun 2025
https://github.com/efrod/bca-survival-analyzer
A tool that helps you performing survival analysis on body composition data.
python statistics survival-analysis
Last synced: 13 Jan 2026
https://github.com/szymon-budziak/probability_theory_and_statistics_course_agh
Solutions from Probability theory and Statistics course at AGH UST. If you use it and it helped you please leave a star ⭐
probability-theory r statistics
Last synced: 19 Mar 2025
https://github.com/mncube/simboil
A collection of simulation helper functions
iteration r sample simulation statistics
Last synced: 28 May 2026
https://github.com/aliciagilmatute/simulacion-estadistica
en construcción...
data-analysis data-science distribution-simulation distributions r rstats rstatses rstudio simulation simulation-studies statistics statistics-simulation
Last synced: 24 Mar 2025
https://github.com/gbrsales/kll
Streaming Quantile Approximation for R
r statistics stream-processing
Last synced: 31 Mar 2025
https://github.com/m-dadej/pd_estimation
Estimation of probability of default on novel data from Orbis
data-science predictive-modeling statistics
Last synced: 02 Jul 2025
https://github.com/alexp11223/d3flightsdatavisualization
Some visualizations for Kaggle 2015 Flight Delays and Cancellations dataset using d3.js and Leaflet
d3 d3js dataset flight-map leaflet sql statistics visualization
Last synced: 31 Mar 2025
https://github.com/stla/gfilmm
Generalized fiducial inference for linear mixed models
fiducial linear-mixed-models r statistics
Last synced: 10 Mar 2026
https://github.com/ritam2006/pmr-backend
A Java Spring Boot backend service for my Portfolio Management Risk project.
api financial-analysis java postgresql springboot sql statistics
Last synced: 02 Jul 2025
https://github.com/odessaz/portfolio-projects
This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science
applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics
Last synced: 12 Apr 2026
https://github.com/messente/messente-api-java
Messente API library: https://bintray.com/messente/messente-api/messente-api
number-lookup omnichannel phonebook statistics
Last synced: 24 Mar 2025
https://github.com/guusdias/programming-language-stats
Programming languages statistics in the last three years.
Last synced: 02 Jul 2025
https://github.com/deve-sh/normal-distributor
A JavaScript Project to form a Normal Probability Distribution out of a provided Sample.
javascript normal-distribution statistics
Last synced: 03 Sep 2025
https://github.com/muonray/cern-root-financial-mechanics-and-market-analysis-codes
CERN ROOT codes used to develop the images and graphs in the article on my blog: http://muonray.blogspot.com/2014/09/particle-physics-software-and-financial.html
article cern cern-root drift efficient-frontier finance financial-analysis geometric-brownian-motion graphs market-analysis-codes monte-carlo monte-carlo-methods optimization-methods quant simulation statistics stock-market
Last synced: 24 Mar 2025
https://github.com/stdlib-js/stats-strided-dnanvariancetk
Calculate the variance of a double-precision floating-point strided array ignoring NaN values and using a one-pass textbook algorithm.
array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance
Last synced: 28 Apr 2026
https://github.com/stdlib-js/stats-strided-dnanvarianceyc
Calculate the variance of a double-precision floating-point strided array ignoring NaN values and using a one-pass algorithm proposed by Youngs and Cramer.
array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance
Last synced: 30 Apr 2026
https://github.com/stdlib-js/stats-strided-dnanstdevtk
Calculate the standard deviation of a double-precision floating-point strided array ignoring NaN values and using a one-pass textbook algorithm.
array deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance
Last synced: 29 Apr 2026
https://github.com/cybcon/docker.dockerhubstats2mqtt
Collect repository statistics from Dockerhub and publish them to a MQTT topic.
docker-image dockerhub mosquitto-pub statistics
Last synced: 14 Apr 2026
https://github.com/omatthewy/java-explore-with-me
Template repository for ExploreWithMe project.
api-gateway backend docker docker-compose event-discover event-management java maven microservices postgresql postman rest-api spring-boot sql statistics user-authentication
Last synced: 12 Apr 2026
https://github.com/swharden/cleananalytics
Privacy-focused website and application telemetry
analytics google-analytics logging php statistics
Last synced: 17 Jul 2025
https://github.com/makcymal/silvera
My researches on ML and statistics, optimization methods, CS algoritms and numerical methods
algorithms data data-structures machine-learning numerical-methods statistics
Last synced: 01 Apr 2025
https://github.com/minesja/poker-service
A monte-carlo simulation app for poker. Run thousands of rounds holding certain variables constant to calculate probabilities of the game.
Last synced: 20 Mar 2025
https://github.com/mfansler/hmmcov
Migration of https://code.google.com/archive/p/hmmcov/
Last synced: 08 Apr 2025
https://github.com/justdvnsh/algorithms
A list of all major algorithms, their tests and explanations ranging all the way from basic to advanced.
advanced algorithms basic data-structures intermediate javascript machine-learning-algorithms statistics
Last synced: 12 Jul 2025
https://github.com/srvanderplas/statistical_atlas
Framed Charts and the Statistical Atlas of 1870
census data ggplot2 graphics r statistics visualization
Last synced: 29 May 2026
https://github.com/sodascience/schools_synth_test
Test code for disaggregated synthetic control with simulated data
causal-inference simulation statistics synthetic-control
Last synced: 20 Mar 2025
https://github.com/beliavsky/starpac
Standards Time Series and Regression Package, a library of Fortran subroutines for statistical data analysis developed by the Statistical Engineering Division of the National Institute of Standards and Technology
arima autocorrelation fft forecasting gamma-function nonlinear-least-squares periodogram polynomial-regression rng statistics time-series-analysis
Last synced: 06 Jan 2026
https://github.com/elb4rto/craplog-gui
Parse Apache2 logs to create statistics
apache2 apache2-logs gui java logs statistics
Last synced: 23 Apr 2025
https://github.com/danpoynor/python-phrase-hunter-game
console-application demo dunder-main game oop python statistics unit-testing
Last synced: 29 May 2026
https://github.com/cleoold/linearly_varying_binomial_distribution_calcs_python
a "binomial" distribution with linearly increasing chance.
probability python-c-extension statistics
Last synced: 16 Mar 2025
https://github.com/lukeskywalkerii/website
data-analysis data-visualization powerbi python r sql statistics
Last synced: 12 Apr 2026
https://github.com/govind-prakash/r
A collection of R scripts designed for basic bioinformatics and biostatistics projects. This repository includes scripts for data analysis, visualization, and statistical modeling, catering to researchers and students in life sciences
data-science data-visualization r r-base rstudio statistics
Last synced: 05 Sep 2025
https://github.com/bozenne/lavareduce
Latent variable models with linear predictors
latent-variable-models lava-r-package r statistics
Last synced: 15 Jan 2026
https://github.com/lazernata/transport-problem
Bachelor's Thesis Work: Shiny app to solve the transport problem. Available in Spanish and English
operational-research rstudio shinyapps statistics
Last synced: 31 Mar 2025
https://github.com/elkronos/feature_selection
This repo contains a series of functions aimed at helping to select features and reduce dimensionality.
data-science dataanalytics feature-selection r statistics
Last synced: 09 Apr 2025
https://github.com/mrousavy/piechart
A rich PieChart control for WPF which supports easy MVVM bindings and data access
analytics chart control library pie-chart statistics wpf xaml
Last synced: 02 May 2026
https://github.com/rohankalbag/probability-random-processes
Probability and Random Processes - Autumn Semester 2021 - Indian Institute of Technology Bombay
probability python random-processes statistics
Last synced: 16 May 2026
https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling
All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021
data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics
Last synced: 13 Apr 2026
https://github.com/dmarks84/coursework_project_text-mining-topic-modeling
Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.
data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining
Last synced: 12 Apr 2026
https://github.com/victoorv/analyse_biostatistique
Mémoire détaillé sur les tests multiples en biostatistique.
mathematics multiple-testing multiple-testing-correction probability r research research-project statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/samorso/ibpaper
This repository contains the code to replicate the numerical studies presented in the paper "A Flexible Bias Correction Method based on Inconsistent Estimators".
bias-reduction finite-sample-bias indirect-inference regression resampling statistics
Last synced: 02 Apr 2025
https://github.com/pabsan-0/vfs2
Vectorial Mutual-Information based feature selection
feature-selection mutual-information repos-ml statistics
Last synced: 17 Mar 2025
https://github.com/hafizhhasyhari/statistika-teori-dan-praktikum-s1-collage
Statistik dengan Excel, Bahasa R
r-statistics statistical-methods statistics statistika
Last synced: 02 Apr 2025
https://github.com/gearkite/yapperfinder
Finds the yappers in your Matrix chats and displays some statistics
Last synced: 16 Mar 2025
https://github.com/schw4b/titanic
A reanalysis of the Titanic data set in R and Quarto.
Last synced: 25 Mar 2025
https://github.com/olafhaag/c3d-statistics
Analyze conditional values of C3D data generated by Phasespace Impulse X2 motion capture system
c3d motion-capture phasespace statistics
Last synced: 14 Jun 2025
https://github.com/mightymetrika/mmibain
Bayesian Informative Hypotheses Evaluation Web Applications
bayes-factor bayesian hypothesis informative r statistics
Last synced: 06 Jul 2025
https://github.com/grctest/boinc_scripts
automation boinc data-gathering gridcoin scripts server-status statistics team
Last synced: 26 Mar 2025
https://github.com/psygo/monte-carlo-ts
Monte Carlo (Gaussian) with TypeScript and SolidJS
monte-carlo simulation statistics
Last synced: 07 Dec 2025
https://github.com/cesar312/python-data-science-toolbox
A collection of useful data science tools and techniques
data-science jupyter-notebook pandas python scikit-learn statistics visualization
Last synced: 13 Apr 2026
https://github.com/dcs-training/introtostatistics
This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 26 Mar 2025
https://github.com/windi-wulandari/sentiment-analysis-imdb
This project analyzes sentiment from 50,000 IMDb movie reviews, aiming to enhance time and cost efficiency. The Naive Bayes model with TF-IDF yielded the best results, achieving up to 99% savings in time and cost, surpassing the initial goals.
imdb-dataset machine-learning sentiment-analysis sentiment-classification statistics supervised-learning
Last synced: 17 Mar 2025
https://github.com/kddubey/microarray-kaggle
Analyze a dataset with 72 observations and 7,129 features
Last synced: 08 Apr 2025
https://github.com/wilsoon7721/whatsapp-chatnalyzer
A tool designed to help users extract various statistics based on their WhatsApp chat exports, providing insights into messaging patterns, participant contributions, and more in a user-friendly interface.
chat statistics trends whatsapp
Last synced: 13 Apr 2026
https://github.com/aayushwankhade/z
z is a versatile programming language known for its simplicity and ease of use in developing web applications. With a strong focus on clean, readable code and efficient performance, z is ideal for both beginner and experienced developers looking to create high-quality software.
apache chatgpt data-engineering data-science data-visualization fish free game immutability machine-learning pattern-matching python statistics zwave
Last synced: 07 Sep 2025
https://github.com/noopur-zambare/nc_pruning
Dense network pruning algorithm inspired from Neural Collapse geometry
deep-learning robust-machine-learning statistics trustworthy-ai
Last synced: 08 May 2026
https://github.com/stdlib-js/stats-strided-dmean
Calculate the arithmetic mean of a double-precision floating-point strided array.
arithmetic-mean array average avg central-tendency double float64 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed
Last synced: 13 Apr 2026
https://github.com/ik5/tracepath
For those who trespass against us
golang graph plot plotting statistics
Last synced: 05 Oct 2025
https://github.com/gabboraron/biostatisztika_es_alkalmazasai
"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"
biostatistics data-analysis data-visualization r statistics statistics-course
Last synced: 24 Oct 2025
https://github.com/sdttttt/huck
💾 Data collection server software. Configuration flexible, no external dependencies.
go http-server simple statistics
Last synced: 14 Jan 2026
https://github.com/mgckaled/rs_data-analytics
Repositório agregador do conteúdo da formação Data Analytics desenvolvido pelo Rocketseat.
data-analytics data-visualization python sql statistics
Last synced: 09 Oct 2025
https://github.com/lleonardus/analise-de-dados
Anotações da matéria de Análise de Dados do curso de Ciência da Computação da Estácio
Last synced: 09 Oct 2025
https://github.com/quanticpony/clothespin-probability-distribution
A small problem of a probability distribution of clothes pins along a string.
challenges-solved python simulation statistics
Last synced: 10 Oct 2025