An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/marberts/rsmatrix

An R package for making repeat-sales matrices

cran economics housing r r-package rstats statistics

Last synced: 10 Oct 2025

https://github.com/mine-cetinkaya-rundel/teach-data-public-good

Materials for the JSM 2020 session "Teaching with Data for the Public Good"

data-science education jsm2020 public-good statistics

Last synced: 21 Jan 2026

https://github.com/mkearney/tidycor

🎓 Tidy correlation tools for academics

correlation quantitative-methods rstats statistics tidyversity

Last synced: 11 May 2026

https://github.com/qcversity/econometrics-versity

This space is about econometrics using Python. It is a series of lectures about Econometrics, the theory, practical example using Python.

analysis econometrics economics finance modeling python statistics time-series

Last synced: 07 Oct 2025

https://github.com/andrey123815/andrey123815

:zap: Special repository for README.md public profile preview

achievements preview readme statistical-analysis statistics

Last synced: 06 Feb 2026

https://github.com/civo/civostatsd

This utility is a small Go application that runs in the background on Civo instances, collecting CPU, disk and memory usage and reporting them back to the Civo API so that they can be reported on in the control panel.

civo civo-api statistics

Last synced: 27 Oct 2025

https://github.com/flyingworkshop/desmos-creations

Links to cool graphs I made on Desmos! Almost all of the graphs are interactive, so play around with them as much as you like!

calculus desmos desmos-art geometry graph-theory linear-algebra phase-portrait statistics

Last synced: 18 Mar 2026

https://github.com/pharo-ai/tf-idf

Implementation of TF-IDF in Pharo

pharo statistics term-frequency tf-idf

Last synced: 18 Mar 2025

https://github.com/fnaufel/probestr

Probabilidade e Estatística com R

bookdown probability r statistics

Last synced: 02 Jan 2026

https://github.com/jmsv/hypothesis-testing-calculator

A2 Computing project: Hypothesis Testing Calculator for the binomial distribution, written in WinForms C# and Xamarin C# for Android

android computing csharp hypothesis-testing mathematics statistics winforms xamarin

Last synced: 14 Apr 2026

https://github.com/amazon-science/ssepy

Python package for stratifying, sampling, and estimating model performance with fewer annotations.

estimation sampling statistical-inference statistics stratified-sampling

Last synced: 28 Feb 2026

https://github.com/yoshoku/numo-random

Numo::Random provides random number generation with several distributions for Numo::NArray.

gem random ruby statistics

Last synced: 25 Apr 2025

https://github.com/mvuorre/heterogeneity-uncertainty

Manuscript: "Communicating causal effect heterogeneity"

heterogeneity psychology r statistics

Last synced: 06 Oct 2025

https://github.com/tomlav/snippets

Snippets from my research in geo-sciences (satellite, etc...), often in python.

cartopy notebook python satellite science science-research scientific-visualization snippets statistics

Last synced: 18 Oct 2025

https://github.com/dmazarei/ollg

Computes some Measures of OLL-G Family of Distributions

distribution oll-g package r rpackage rpackages statistics

Last synced: 23 Feb 2026

https://github.com/rubenv/ergast-mrd

Mirror of the Ergast.com database images

f1 formula1 statistics

Last synced: 27 Feb 2026

https://github.com/gurugray/bot-metrica

Simple Metrica for Telegram Bots

bots metrics metrika statistics telegram yandex yandex-metrika

Last synced: 11 Feb 2026

https://github.com/pgxcentre/genetest

Python package for efficient genetic association analyses

bioinformatics genetics genomics gwas phewas regression-models statistics

Last synced: 28 Jan 2026

https://github.com/tecnickcom/rpistat

Web-Service to collect system usage statistics

raspberry-pi statistics system usage

Last synced: 06 Jul 2025

https://github.com/git-quick-stats/git-keyword-stats

Gather statistics on keywording in a repo.

descendents git git-swear-stats gitlog keyword statistics swear

Last synced: 06 Feb 2026

https://github.com/snelsi/statistics-in-psychology

🌭 A collection of some data analytics methods

calculator dashboard math nextjs psychology react statistics typescript

Last synced: 29 Apr 2026

https://github.com/levminer/netflix-statistics

If you want to know how much time you spent in your life watching Netflix!

netflix netflix-statistics statistics

Last synced: 27 Feb 2026

https://github.com/moussaak/github-statistics

Exploring github API v4 using GraphQL

github-api-v4 graphql java-8 json maven statistics

Last synced: 14 Apr 2026

https://github.com/llnl/smallmoleval

Using machine learning to score potential drug candidates may offer an advantage over traditional imprecise scoring functions because the parameters and model structure can be learned from the data. However, models may lack interpretability, are often overfit to the data, and are not generalizable to drug targets and chemotypes not in the training data. Benchmark datasets are prone to artificial enrichment and analogue bias due to the overrepresentation of certain scaffolds in experimentally determined active sets. Datasets can be evaluated using spatial statistics to quantify the dataset topology and better understand potential biases. Dataset clumping comprises a combination of self-similarity of actives and separation from decoys in chemical space and is associated with overoptimistic virtual screening results. This code explores methods of quantifying potential biases and examines some common benchmark datasets.

machine-learning python statistics

Last synced: 26 May 2026

https://github.com/wlandau/rmedicine2021-slides

Slides for a possible talk at R/Medicine 2021 (submitted and under review, accepted talks not yet determined)

bayesian bayesian-statistics cmdstanr high-performance-computing markdown pipeline r r-markdown reproducibility stan stantargets statistics targets

Last synced: 01 May 2026

https://github.com/tmiddlet2666/ghstats

A command line tool to display download and repository metrics for GitHub repositories.

cli github statistics stats

Last synced: 19 Apr 2026

https://github.com/coatless-rpkg/msos

msos: Data Sets and Functions Used in Multivariate Statistics: Old School by John Marden

multivariate r statistics

Last synced: 11 Mar 2026

https://github.com/nnamanx/estat

The comprehensive statistical tool, tailored for diverse student levels including university, middle, and high school, redefines data understanding. From informative visualizations to interactive tools, it offers a dynamic learning experience, enabling students to unravel the complexities of statistics with confidence and precision.

animation bootstrap css html javascript statistical-learning statistics

Last synced: 05 May 2026

https://github.com/simphotonics/sample_statistics

Sample statistics, histograms, probability distributions, and random sample generators for Dart.

error-function probability-distribution random random-number-generators sample statistics

Last synced: 02 Apr 2026

https://github.com/johnthecoolingfan/motorchik

Motorchik, discord bot written in python

discord-bot factorio mod python statistics

Last synced: 09 Jun 2026

https://github.com/slub/statsdelta

A commandline command (Python3 program) that compares two (CSV) statistics with each other and generates delta values from the (old and the new) values

cli command-line-tool csv delta python statistics

Last synced: 11 Apr 2025

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling geographical-data r sentiment-analysis statistics text-analysis web-scraping

Last synced: 25 Apr 2025

https://github.com/matkoniecz/streetcomplete_usage_changeset_analysis

Process metadata of all changesets ever made to make StreetComplete usage analysis

openstreetmap statistics streetcomplete

Last synced: 27 Oct 2025

https://github.com/zenmate/stats

:bar_chart: Request statistics middleware

expressjs middleware node nodejs statistics stats

Last synced: 16 Oct 2025

https://github.com/stephane-martin/mailstats

Parse incoming emails for statistics

email golang milter parsing smtp statistics

Last synced: 24 Mar 2025

https://github.com/ruivieira/scala-gsl

GNU Scientific Library (GSL) bindings for Scala native

bindings gsl numerical scala scala-native statistics

Last synced: 13 Oct 2025

https://github.com/louis-heraut/explore2_toolbox

💧 R toolbox to provide a simple way of interacting with the code necessary to carry out diagnostic of the hydrological models used in Explore2

climate climate-change climate-data climate-model climate-science diagnostic environment explore2 hydrological-model hydrology inrae model r statistics

Last synced: 15 Mar 2026

https://github.com/jramkiss/jramkiss.github.io

Personal blog about statistics and machine learning

blog github-pages statistics

Last synced: 30 Apr 2025

https://github.com/dirkschumacher/tfjs-glm

Generalized linear models in tensorflow.js (WIP)

generalized-linear-models statistics tensorflow tensorflow-js

Last synced: 25 Apr 2026

https://github.com/matackett/modernize-regression

Supplemental materials for the article "Three principles for modernizing an undergraduate regression analysis course"

data-science education r regression statistics

Last synced: 28 Jan 2026

https://github.com/chaoticsomeone/ooen_mining

Statistische Auswertung der Online-Ausgabe der OÖN

austria newspaper statistics

Last synced: 04 May 2025

https://github.com/willie-conway/meta-data-analyst-portfolio

A comprehensive 📚portfolio showcasing projects and skills developed during the Meta Data Analyst Professional Certificate 🎓course, featuring 📈data analysis, 📊visualization, and 👨🏿‍💻management using various ⚙️tools.

big-data business-intelligence data-analysis data-cleaning data-driven-decisions data-management data-mining data-visualization exploratory-data-analysis jupyter-notebook machine-learning pandas porfolio predictive-modeling python spreadsheet-analysis sql statistics tableau visualization-tools

Last synced: 11 Apr 2026

https://github.com/thibaudcolas/statistical-inference

A little exploration of R's power for statistical inference

psychology statistical-inference statistics

Last synced: 15 Mar 2025

https://github.com/stdlib-js/stats-base-dmax

Calculate the maximum value of a double-precision floating-point strided array.

array domain extent extremes float64 javascript math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 18 Apr 2025

https://github.com/stdlib-js/blas-ext-base-dsnansum

Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values, using extended accumulation, and returning an extended precision result.

array blas extended float32 javascript math mathematics node node-js nodejs single statistics stats stdlib strided strided-array sum summation total typed

Last synced: 14 Apr 2025

https://github.com/stdlib-js/random-streams-weibull

Create a readable stream for generating pseudorandom numbers drawn from a Weibull distribution.

continuous javascript math mathematics node node-js nodejs prng pseudorandom rand random readable rng seed seedable statistics stats stdlib stream weibull

Last synced: 26 Jan 2026

https://github.com/briatte/asr

Applied Stats with R and RStudio (first-year social-science tutorials)

course data-analysis data-science data-visualization r statistics

Last synced: 14 Apr 2026

https://github.com/tpapp/indirectlikelihood.jl

Framework for indirect likelihood estimation.

bayesian bayesian-inference econometrics julia-language statistics

Last synced: 25 Jul 2025

https://github.com/abmantz/rgw

A lightweight R-language implementation of the affine-invariant sampling method of Goodman & Weare (2010)

markov-chain-monte-carlo statistics

Last synced: 22 Oct 2025

https://github.com/puripant/thai-lottery-stat

สถิติเลขท้ายสองตัวของลอตเตอรี่ไทยย้อนหลัง 20 ปี (480 งวด)

lottery mosaic-plots statistics thailand visualization

Last synced: 15 May 2026

https://github.com/ldeluigi/higher-lower-pwned

A game website where you guess the most popular password to win! You can play solo, 1v1 or in a battle royale!

angular anti-cheating docker game nginx nodemon passwords statistics

Last synced: 07 Feb 2026

https://github.com/stdlib-js/stats-base-dists-erlang-logpdf

Natural logarithm of the probability density function (PDF) for an Erlang distribution.

continuous dist distribution erlang javascript ln log logarithm logpdf natural node node-js nodejs pdf probability statistics stats stdlib univariate

Last synced: 26 Oct 2025

https://github.com/xzghua/gitbook-plugin-statistics

baidu + cnzz + google analytics for gitbook plugin

baidu-analytics cnzz gitbook gitbook-plugin google-analytics statistics

Last synced: 12 Jul 2025

https://github.com/diraven/rmc14-stats

Last 30 days of RMC14

rmc14 statistics stats

Last synced: 11 Oct 2025

https://github.com/code-jl/nfl-point-kicker-data-scraper

A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.

automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping

Last synced: 06 Sep 2025

https://github.com/ganeshsankaran/matlab-ap-statistics

M-files for distribution functions, confidence intervals, and hypothesis tests

matlab statistics

Last synced: 12 Mar 2025

https://github.com/stdlib-js/stats-base-nanrange-by

Calculate the range of a strided array via a callback function, ignoring NaN values.

dispersion domain extent extremes javascript math mathematics max maximum min minimum node node-js nodejs range statistics stats stdlib strided strided-array

Last synced: 18 Aug 2025

https://github.com/stdlib-js/random-streams-kumaraswamy

Create a readable stream for generating pseudorandom numbers drawn from a Kumaraswamy's double bounded distribution.

continuous javascript kumaraswamy math mathematics node node-js nodejs prng pseudorandom rand random readable rng seed seedable statistics stats stdlib stream

Last synced: 25 Oct 2025

https://github.com/johnsesana/statistics-for-ds

General guide on topics to study for Data Science and Machine Learning

data-science machine-learning python statistics

Last synced: 22 Jan 2026

https://github.com/willemsleegers/how-to-science

A Quarto book with my conclusions regarding various methodological and statistical issues.

methodology statistics

Last synced: 23 Feb 2026

https://github.com/stdlib-js/random-iter-poisson

Create an iterator for generating pseudorandom numbers drawn from a Poisson distribution.

counts discrete generator javascript math mathematics node node-js nodejs poisson prng pseudorandom rand random rng seed seedable statistics stats stdlib

Last synced: 14 Apr 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/trevorbollinger/statbot

Discord Server Statistics tool using discord.py, Django, and React

analytics discord django postgres react statistics

Last synced: 07 Feb 2026

https://github.com/mcanouil/insane

An Interactive Web Application for Quality Control and Analysis of Insulin Secretion from Pancreatic Beta Cells

beta-cells endoc-betah1 insulin-secretion pancreas r r-package rstats shiny statistics stats

Last synced: 15 Jul 2025

https://github.com/xxdavid/hipsta-czech-names

Discover the weirdest forenames used in the Czech Republic.

czech-republic names statistics

Last synced: 01 Mar 2025

https://github.com/mukhammadali/stopkorona

🦠 Coronavirus Statistics PWA built with Gatsby.js

coronavirus covid-19 covid19 gatsby graphql koronavirus kovid-19 react statistics

Last synced: 19 Jan 2026

https://github.com/kelvinleandro/manim-animations

Animations exploring various concepts in computer science, with a special focus on machine learning and statistics

classification-algorithm clustering-algorithm computer-science machine-learning machine-learning-algorithms manim manim-3b1b manim-animations python python3 regression-algorithms statistics

Last synced: 15 Feb 2026

https://github.com/m-damien/statslator.js

🔢 Conversion between statistical reporting styles

conversion estimation nhst p-value statistics

Last synced: 19 Mar 2026

https://github.com/adrn/princetonstatsseminar

Supplementary material for my lecture on 6 Feb.

adrn-teaching bayesian-inference probability python statistics

Last synced: 15 May 2026

https://github.com/stdlib-js/stats-base-smeanpn

Calculate the arithmetic mean of a single-precision floating-point strided array using a two-pass error correction algorithm.

arithmetic-mean array average avg central-tendency float float32 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed

Last synced: 10 Oct 2025

https://github.com/stdlib-js/datasets-suthaharan-multi-hop-sensor-network

Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 10 Oct 2025

https://github.com/stdlib-js/random-streams-mt19937

Create a readable stream for a 32-bit Mersenne Twister pseudorandom number generator.

discrete generator integer javascript math mathematics node node-js nodejs prng pseudorandom rand randint random randu rng statistics stats stdlib uniform

Last synced: 03 Sep 2025