Statistics
Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.
- GitHub: https://github.com/topics/statistics
- Wikipedia: https://en.wikipedia.org/wiki/Statistics
- Related Topics: data-science, machine-learning, deep-learning, neural-network,
- Last updated: 2026-06-25 00:26:05 UTC
- JSON Representation
https://github.com/pgxcentre/genetest
Python package for efficient genetic association analyses
bioinformatics genetics genomics gwas phewas regression-models statistics
Last synced: 28 Jan 2026
https://github.com/moussaak/github-statistics
Exploring github API v4 using GraphQL
github-api-v4 graphql java-8 json maven statistics
Last synced: 14 Apr 2026
https://github.com/mkearney/tidycor
🎓 Tidy correlation tools for academics
correlation quantitative-methods rstats statistics tidyversity
Last synced: 11 May 2026
https://github.com/viglino/insee-map
Display 2.3 million squares provided by the French statistical institute (INSEE) sub-dividing the country into regular grids.
grid insee map openlayers statistics
Last synced: 07 Feb 2026
https://github.com/attipaci/jnum
Java numerical classes for astronomy and beyond
astronomy complex-numbers configuration-parser fft fourier-transform java math-library multidimensional-arrays numerical-methods special-functions spherical-coordinates statistics units utilities wcs
Last synced: 06 Sep 2025
https://github.com/lin-123/vue-sensors
directive sensors-data statistics vue vue-sensors
Last synced: 11 Jul 2025
https://github.com/sambhav228/data_structure_algorithm
A repository which consists of the collection of various Data Structures and Algorithms implemented in various Programming Languages.
Last synced: 04 May 2026
https://github.com/psychbruce/dpi
🛸 The Directed Prediction Index (DPI): Quantifying Relative Endogeneity of Outcome Versus Predictor Variables.
causal-inference causality causality-analysis directed-acyclic-graph influence linear-models linear-regression prediction simulation statistics
Last synced: 22 Oct 2025
https://github.com/kmedian/jackknife
Jackknife resampling, parameter estimation and stability test.
cross-validation estimation jackknife jackknife-resampling pypi stability-test statistics
Last synced: 10 Oct 2025
https://github.com/gastonstat/stat154
STAT 154: Modern Statistical Prediction and Machine Learning
classification-model clustering data-science machine-learning multivariate-analysis predictive-modeling principal-component-analysis statistical-learning statistics syllabus
Last synced: 23 Feb 2026
https://github.com/bitcoin-data/bitcoin-stats-archive
Archive of Bitcoin stats from public sources
Last synced: 17 Jan 2026
https://github.com/marberts/rsmatrix
An R package for making repeat-sales matrices
cran economics housing r r-package rstats statistics
Last synced: 10 Oct 2025
https://github.com/koodiklinikka/palkkakysely
data hacktoberfest koodiklinikka statistics
Last synced: 26 Jan 2026
https://github.com/alsami/covid-19-statistics
Web application showing the data available from the Covid19Api.
angular covid-19 monorepo ngrx statistics
Last synced: 20 Jan 2026
https://github.com/jmsv/hypothesis-testing-calculator
A2 Computing project: Hypothesis Testing Calculator for the binomial distribution, written in WinForms C# and Xamarin C# for Android
android computing csharp hypothesis-testing mathematics statistics winforms xamarin
Last synced: 14 Apr 2026
https://github.com/snelsi/statistics-in-psychology
🌭 A collection of some data analytics methods
calculator dashboard math nextjs psychology react statistics typescript
Last synced: 29 Apr 2026
https://github.com/llnl/smallmoleval
Using machine learning to score potential drug candidates may offer an advantage over traditional imprecise scoring functions because the parameters and model structure can be learned from the data. However, models may lack interpretability, are often overfit to the data, and are not generalizable to drug targets and chemotypes not in the training data. Benchmark datasets are prone to artificial enrichment and analogue bias due to the overrepresentation of certain scaffolds in experimentally determined active sets. Datasets can be evaluated using spatial statistics to quantify the dataset topology and better understand potential biases. Dataset clumping comprises a combination of self-similarity of actives and separation from decoys in chemical space and is associated with overoptimistic virtual screening results. This code explores methods of quantifying potential biases and examines some common benchmark datasets.
machine-learning python statistics
Last synced: 26 May 2026
https://github.com/tecnickcom/rpistat
Web-Service to collect system usage statistics
raspberry-pi statistics system usage
Last synced: 06 Jul 2025
https://github.com/carpentries-incubator/statistical-inference-for-biology
Statistical Inference for Biology
alpha biology carpentries-incubator english lesson r statistics
Last synced: 02 Sep 2025
https://github.com/mccarthy-m-g/psyc-615-lab
PSYC 615: Analysis of Variance
Last synced: 08 Apr 2025
https://github.com/tomlav/snippets
Snippets from my research in geo-sciences (satellite, etc...), often in python.
cartopy notebook python satellite science science-research scientific-visualization snippets statistics
Last synced: 18 Oct 2025
https://github.com/0todd0000/rft1d
One-Dimensional Random Field Theory in Python
functional-data-analysis hypothesis-testing probability statistical-methods statistics time-series
Last synced: 14 Jan 2026
https://github.com/zenmate/stats
:bar_chart: Request statistics middleware
expressjs middleware node nodejs statistics stats
Last synced: 16 Oct 2025
https://github.com/amazon-science/ssepy
Python package for stratifying, sampling, and estimating model performance with fewer annotations.
estimation sampling statistical-inference statistics stratified-sampling
Last synced: 28 Feb 2026
https://github.com/louis-heraut/explore2_toolbox
💧 R toolbox to provide a simple way of interacting with the code necessary to carry out diagnostic of the hydrological models used in Explore2
climate climate-change climate-data climate-model climate-science diagnostic environment explore2 hydrological-model hydrology inrae model r statistics
Last synced: 15 Mar 2026
https://github.com/mine-cetinkaya-rundel/bootcamper-nui-galway
Materials for the #rstats bootcamp for NUI Galway
data-science rstats statistics
Last synced: 18 Mar 2026
https://github.com/willie-conway/meta-data-analyst-portfolio
A comprehensive 📚portfolio showcasing projects and skills developed during the Meta Data Analyst Professional Certificate 🎓course, featuring 📈data analysis, 📊visualization, and 👨🏿💻management using various ⚙️tools.
big-data business-intelligence data-analysis data-cleaning data-driven-decisions data-management data-mining data-visualization exploratory-data-analysis jupyter-notebook machine-learning pandas porfolio predictive-modeling python spreadsheet-analysis sql statistics tableau visualization-tools
Last synced: 11 Apr 2026
https://github.com/tobiasschoch/wbacon
Weighted BACON algorithms
outlier outlier-detection r-package robust-regression statistics
Last synced: 23 Feb 2026
https://github.com/hifly81/bikedump
Bike Dump is a Java GUI that can be used to manage and extract stats from GPX 1.0, GPX 1.1 and TCX 2 activities from your cycling/mountain biking workouts. It also offers graphs and history stats.
biking-applications bing cycling extract-stats gpx java map mountain-bike openstreetmap routes statistics workouts
Last synced: 15 Mar 2026
https://github.com/paritoshtripathi935/machine_learning_projects
Various machine Learning projects
data-science deep-learning machine machine-learning machine-learning-algorithms python scikit-learn statistics
Last synced: 06 May 2026
https://github.com/alfurka/synloc
A Python Package to Create Synthetic Tabular Data
clustering constrained-clustering copulas data-augmentation distributions k-means knn local-sampling machine-learning multivariate-distributions nonparametric-distribution oversampling python resampling sampling semi-parametric-modeling statistics synthetic synthetic-data synthetic-dataset-generation
Last synced: 14 Jan 2026
https://github.com/shawnlaffan/statistics-descriptive-pdl
A close to drop-in replacement for Statistics::Descriptive, but using PDL as the back end.
Last synced: 24 Feb 2026
https://github.com/zen-juen/neuropsynorms
An R package for computing neuropsychological normative data
assessment-tool neuropsychology r statistics
Last synced: 18 Jan 2026
https://github.com/flyingworkshop/desmos-creations
Links to cool graphs I made on Desmos! Almost all of the graphs are interactive, so play around with them as much as you like!
calculus desmos desmos-art geometry graph-theory linear-algebra phase-portrait statistics
Last synced: 18 Mar 2026
https://github.com/tbates/openmx.tmbundle
Textmate support for Structural equation modeling in OpenMx.
behavior-genetics cran machine-learning matrix-algebra optimization psychology r sem statistics structural-equation-modeling textmate textmate-bundle
Last synced: 19 Oct 2025
https://github.com/pharo-ai/tf-idf
Implementation of TF-IDF in Pharo
pharo statistics term-frequency tf-idf
Last synced: 18 Mar 2025
https://github.com/pulipulichen/html-lag-sequential-analysis
A calculator for lag sequential analysis.
jquery lag-sequential-analysis puli-portfolio statistics
Last synced: 05 Oct 2025
https://github.com/dmetivie/robustmeans.jl
Implement some Robust Mean Estimators
confidence-intervals julia julia-language robust robust-statistics statistics
Last synced: 05 Jan 2026
https://github.com/tomerfi/github-viewer-stats
Small NPM package for collecting your own GitHub statistics
github npm-package npm-script statistics
Last synced: 14 Feb 2026
https://github.com/rubenv/ergast-mrd
Mirror of the Ergast.com database images
Last synced: 27 Feb 2026
https://github.com/qcversity/econometrics-versity
This space is about econometrics using Python. It is a series of lectures about Econometrics, the theory, practical example using Python.
analysis econometrics economics finance modeling python statistics time-series
Last synced: 07 Oct 2025
https://github.com/rikurauhala/insights
Visualize your coding journey
cypress data data-visualization github github-api javascript material-ui octokit react statistics typescript vite
Last synced: 11 Feb 2026
https://github.com/gurugray/bot-metrica
Simple Metrica for Telegram Bots
bots metrics metrika statistics telegram yandex yandex-metrika
Last synced: 11 Feb 2026
https://github.com/fnaufel/probestr
Probabilidade e Estatística com R
bookdown probability r statistics
Last synced: 02 Jan 2026
https://github.com/mvuorre/heterogeneity-uncertainty
Manuscript: "Communicating causal effect heterogeneity"
heterogeneity psychology r statistics
Last synced: 06 Oct 2025
https://github.com/br3ndonland/r-guide
A quick reference guide and sample code for statistical programming in R
r rmarkdown rstudio science statistics
Last synced: 02 Apr 2026
https://github.com/dmazarei/ollg
Computes some Measures of OLL-G Family of Distributions
distribution oll-g package r rpackage rpackages statistics
Last synced: 23 Feb 2026
https://github.com/levminer/netflix-statistics
If you want to know how much time you spent in your life watching Netflix!
netflix netflix-statistics statistics
Last synced: 27 Feb 2026
https://github.com/mukundan314/arch-pkgstats-plotter
:package: Plot Arch Linux Package Statistics :chart_with_upwards_trend:
Last synced: 15 Feb 2026
https://github.com/lukem512/mann-whitney-utest
An NPM module for computing the Mann-Whitney U test (a nonparametric statistical test)
analysis mann-whitney statistics
Last synced: 08 Jul 2025
https://github.com/olgaele/playing-with-julia
Playing with data!
data data-analysis data-science julia statistics
Last synced: 19 Apr 2026
https://github.com/git-quick-stats/git-keyword-stats
Gather statistics on keywording in a repo.
descendents git git-swear-stats gitlog keyword statistics swear
Last synced: 06 Feb 2026
https://github.com/wlandau/rmedicine2021-slides
Slides for a possible talk at R/Medicine 2021 (submitted and under review, accepted talks not yet determined)
bayesian bayesian-statistics cmdstanr high-performance-computing markdown pipeline r r-markdown reproducibility stan stantargets statistics targets
Last synced: 01 May 2026
https://github.com/johnthecoolingfan/motorchik
Motorchik, discord bot written in python
discord-bot factorio mod python statistics
Last synced: 09 Jun 2026
https://github.com/Nelson-Gon/shinymde
A shiny interface to mde, the missing data explorer R package. Deployed at https://nelson-gon.shinyapps.io/shinymde
dashboard eda exploratory-data-analysis gui-application missing-data missing-values missingness open-source r r-package recoding-variables rstats scientific-visualization shiny-apps shinydashboard statistical-analysis statistics visualization
Last synced: 29 Jul 2025
https://github.com/jramkiss/jramkiss.github.io
Personal blog about statistics and machine learning
Last synced: 30 Apr 2025
https://github.com/stdlib-js/blas-ext-base-scusum
Calculate the cumulative sum of single-precision floating-point strided array elements.
accumulate array blas cumulative extended javascript math mathematics node node-js nodejs statistics stats stdlib strided strided-array sum summation total typed
Last synced: 28 Feb 2026
https://github.com/davidnabergoj/bootplot
Bootplot is a package for black-box uncertainty visualization.
animation black-box bootstrap bootstrap-sampling images plotting python scientific-visualization statistics uncertainty visualization
Last synced: 02 Mar 2026
https://github.com/stdlib-js/stats-base-meanwd
Calculate the arithmetic mean of a strided array using Welford's algorithm.
arithmetic-mean array average avg central-tendency javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array welford
Last synced: 13 Mar 2025
https://github.com/heyfoz/cpp-baseball-bat-stats-simulator
The Baseball Batting Stats Simulator Program (C++) uses the rand function to create a statistical model, which is used to simulate 1000 at-bats based on a player's careers averages. After the simulation values are calculated and displayed, a loop asks the user if they would like to run another simulation.
baseball batting-statistics cpp simulator statistics
Last synced: 25 Mar 2025
https://github.com/stoverc/slots
A collection of slots-related code (initially in Python3, but perhaps more later)
data-analysis data-science monte-carlo-simulation probabilistic-programming probability probability-theory python3 slot-machine slots statistical-analysis statistics
Last synced: 06 Jun 2026
https://github.com/lordofsunshine/domain-analysis
Domain Analyzer is a lightning-fast, flexible Vue.js application for comprehensive domain analysis. It provides instant insights into DNS, IP details, geolocation, and SSL status.
Last synced: 31 Jan 2026
https://github.com/sipemu/anofox-statistics-rs
Statistical tests in Rust
r rust rust-lang statistical-testing statistical-tests statistics
Last synced: 13 Jan 2026
https://github.com/marianoakamery/smoking-cord
SmokingCord is a Discord bot that helps you track the number of cigarettes you smoke, providing daily and total stats, and offering helpful resources and tips to quit smoking.
addiction cigarette-tracker discord-bot health logging motivation python pythonbot quit-smoking resources smoking-cessation statistics wellness
Last synced: 18 Oct 2025
https://github.com/dmitriitimoshenko/nmrih
NMRiH Server by Krich
analytics dashboard metrics nmrih statistics
Last synced: 13 Mar 2026
https://github.com/GeiserX/secciones-nacionalidades
Foreign Insight - WebApp providing insights about nationalities in Spain (Source: Instituto Nacional de Estadística)
census-data dashboard data-analysis data-visualization demographics geospatial government-data immigration ine nationalities open-data population r self-hosted shiny shinydashboard spain spanish statistics webapp
Last synced: 08 Apr 2026
https://github.com/alexdawn/battle-cogitator
This is a WIP project to get a REST api using flask to calculate stats from a simulated 40k combat.
40k battlescribe simulation statistics
Last synced: 26 Apr 2025
https://github.com/abrouter/symfony-abtest
ABRouter Symfony AB test is a simple package for the base library to run ab-tests and feature flags via ABRouter with Laravel. Prepare and run your ab-test in 10 minutes. Easy to manage and run. Can be also used as a feature flag or feature toggle. Free cloud version available for free. Completely open-source.
a-b-testing ab-testing-php experiments feature-flags php php-ab-testing statistics symfony tracking-funnel
Last synced: 05 Feb 2026
https://github.com/stdlib-js/stats-incr-mstdev
Compute a moving corrected sample standard deviation incrementally.
accumulator corrected deviation dispersion incremental javascript math mathematics moving-variance node node-js nodejs sample sample-variance sliding-window standard statistics stats stdlib variance
Last synced: 13 Oct 2025
https://github.com/cfmtech/optimal_cleaning_for_singular_values_of_cross-covariance_matrices
Python scripts from paper Optimal cleaning for singular values of cross-covariance matrices, by Florent Benaych-Georges, Jean-Philippe Bouchaud, Marc Potters (see https://arxiv.org/abs/1901.05543)
cross-correlation cross-correlation-processing denoising high-dimensional-data high-dimensional-probability high-dimensional-statistics probability probability-statistics probability-theory random-matrices random-matrix random-matrix-theory rotationally-invariant-estimator statistics
Last synced: 12 Apr 2025
https://github.com/stdlib-js/stats-incr-min
Compute a minimum value incrementally.
accumulator domain extent extremes incremental javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib
Last synced: 26 Apr 2025
https://github.com/iautom8things/go-hmm
golang hidden hidden-markov-model hmm machine-learning markov-chain ml statistics
Last synced: 04 May 2026
https://github.com/stdlib-js/stats-base-dists-normal-mode
Normal distribution mode.
center continuous dist distribution gaussian javascript location mode node node-js nodejs normal statistics stats stdlib univariate
Last synced: 15 Oct 2025
https://github.com/stdlib-js/stats-base-dists-studentized-range
Studentized range distribution.
continuous dist distribution javascript lib library node node-js nodejs probability range standard statistics stats stdlib studentized tukey univariate
Last synced: 12 Feb 2026
https://github.com/abmantz/rgw
A lightweight R-language implementation of the affine-invariant sampling method of Goodman & Weare (2010)
markov-chain-monte-carlo statistics
Last synced: 22 Oct 2025
https://github.com/willemsleegers/how-to-science
A Quarto book with my conclusions regarding various methodological and statistical issues.
Last synced: 23 Feb 2026
https://github.com/dk96-os/mathtools
Mathematical Software Components. This library is actively maintained, and aims to stay updated. New feature proposals are welcome, but may not be included.
data-science data-structures prime-numbers statistics
Last synced: 12 Oct 2025
https://github.com/stdlib-js/random-streams-discrete-uniform
Create a readable stream for generating pseudorandom numbers drawn from a discrete uniform distribution.
discrete javascript math mathematics node node-js nodejs prng pseudorandom rand random readable rng seed seedable statistics stats stdlib stream uniform
Last synced: 12 Oct 2025
https://github.com/clok/sig
Statistics in Go - CLI tool for quick statistical analysis of data streams
cli go golang simple-statistics statistics stats
Last synced: 23 Mar 2025
https://github.com/connor-mccarthy/nlp-visualization-of-statistical-learning-book
📙 End-to-end NLP and data visualization pipeline of the text from a machine learning textbook.
clustering glove glove-embeddings hdbscan linear-algebra nlp pca principal-component-analysis statistical-learning statistics word-vectors
Last synced: 18 Mar 2026
https://github.com/tathithienthanh/finaltest_database-sql-data-collection-for-ds
The final test of the "Database SQL and Data Collection for Data Science" course from The Ho Chi Minh City University of Science (19/09/2023)
chrome data-collection data-processing database final-test ipynb-jupyter-notebook mysql pymysql query scraping-websites selenium sql statistics visualization
Last synced: 17 Feb 2026
https://github.com/md-mafujul-hasan/hackerrank_10-days-of-statistics_python
These Contain 10 Days of Statistics Code Solution of Hacker Rank using Python Language😏
10-days-of-python 10-days-of-statistics 100-days-of-code hackerrank hackerrank-10-days-of-statistics hackerrank-30dayschallange hackerrank-challenges hackerrank-python hackerrank-solutions hackerrank-statistic hackerrank-statistics python python-hackerrank statistics
Last synced: 25 Apr 2026
https://github.com/stdlib-js/blas-ext-base-dnannsumkbn
Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using an improved Kahan–Babuška algorithm.
array blas compensated extended float64 javascript math mathematics node node-js nodejs statistics stats stdlib strided strided-array sum summation total typed
Last synced: 09 Apr 2025
https://github.com/briatte/asr
Applied Stats with R and RStudio (first-year social-science tutorials)
course data-analysis data-science data-visualization r statistics
Last synced: 14 Apr 2026
https://github.com/stdlib-js/random-base-gamma
Gamma distributed pseudorandom numbers.
factory gamma generator javascript math mathematics node node-js nodejs prng pseudorandom rand random rgamma rng seed seedable statistics stats stdlib
Last synced: 03 Jul 2025
https://github.com/stdlib-js/stats-base-dmax
Calculate the maximum value of a double-precision floating-point strided array.
array domain extent extremes float64 javascript math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array typed
Last synced: 18 Apr 2025
https://github.com/stdlib-js/blas-ext-base-dsnansum
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values, using extended accumulation, and returning an extended precision result.
array blas extended float32 javascript math mathematics node node-js nodejs single statistics stats stdlib strided strided-array sum summation total typed
Last synced: 14 Apr 2025
https://github.com/ldeluigi/higher-lower-pwned
A game website where you guess the most popular password to win! You can play solo, 1v1 or in a battle royale!
angular anti-cheating docker game nginx nodemon passwords statistics
Last synced: 07 Feb 2026
https://github.com/stdlib-js/stats-base-dists-f-entropy
F distribution differential entropy.
continuous dist distribution entropy f fisher information javascript nats node node-js nodejs prob probability shannon statistics stats stdlib univariate
Last synced: 03 Jul 2025
https://github.com/stdlib-js/stats-base-dists-negative-binomial-logpmf
Natural logarithm of the probability mass function (PMF) for a negative binomial distribution.
binomial density dist distribution javascript ln log logarithm logpmf mass natural negative node node-js nodejs pmf probability statistics stats stdlib
Last synced: 19 Oct 2025
https://github.com/python-opendata-analysis/opendata-casebook
オープンデータや公的統計の分析・活用の事例とサンプルコードを公開しています。
data-analysis opendata python statistics
Last synced: 04 Apr 2026
https://github.com/shoorick/tram-age
Compute age of public transportation in Russian cities
grabbing-content python3 statistics
Last synced: 19 Oct 2025
https://github.com/stdlib-js/random-streams-weibull
Create a readable stream for generating pseudorandom numbers drawn from a Weibull distribution.
continuous javascript math mathematics node node-js nodejs prng pseudorandom rand random readable rng seed seedable statistics stats stdlib stream weibull
Last synced: 26 Jan 2026
https://github.com/xzghua/gitbook-plugin-statistics
baidu + cnzz + google analytics for gitbook plugin
baidu-analytics cnzz gitbook gitbook-plugin google-analytics statistics
Last synced: 12 Jul 2025
https://github.com/tpapp/indirectlikelihood.jl
Framework for indirect likelihood estimation.
bayesian bayesian-inference econometrics julia-language statistics
Last synced: 25 Jul 2025
https://github.com/stdlib-js/stats-incr-ewstdev
Compute an exponentially weighted standard deviation incrementally.
deviation dispersion emsd emv ewmsd ewmv exponential javascript math mathematics node node-js nodejs standard statistics stats stdlib var variance weighted
Last synced: 04 Feb 2026
https://github.com/stdlib-js/stats-base-snanmeanors
Calculate the arithmetic mean of a single-precision floating-point strided array, ignoring NaN values and using ordinary recursive summation.
arithmetic-mean array average avg central-tendency float32 javascript math mathematics mean node node-js nodejs ors statistics stats stdlib strided strided-array typed
Last synced: 19 Apr 2025
https://github.com/slub/solr-fstats
A Python3 program that extracts some statistics regarding field coverage from a Solr index.
cli command-line-tool python solr statistics
Last synced: 08 May 2025