An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/ohspc89/snu_cogsci_2017w_stats

2017 Winter SNU Cogsci Stat Study Group

introduction-to-r statistics

Last synced: 22 Jan 2026

https://github.com/izalu99/past-project-reports

Statistical Projects with R and knitr

predictive-modeling statistics

Last synced: 27 Jan 2026

https://github.com/hugomvale/odrpack.jl

Julia bindings for the modernized version of odrpack95.

mathematics regression statistics

Last synced: 20 Feb 2026

https://github.com/xiaoruizhu/spsp

A novel approach for feature selection based on the entire solution paths rather than the choice of a single tuning parameter, which significantly improves the accuracy of the selection.

feature-selection r-package statistics variable-selection

Last synced: 22 Oct 2025

https://github.com/stephensrmmartin/mires

R package for Bayesian measurement invariance assessment using mixed effects and shrinkage.

bayesian measurement-invariance mixed-effects psychometrics r stan statistics

Last synced: 22 Oct 2025

https://github.com/devmotion/calibrationanalysis.jl

Multi-language suite for analyzing calibration of probabilistic predictive models.

calibration julia machine machine-learning python r reliability statistics

Last synced: 27 Jan 2026

https://github.com/willysr/slackbuilds-stats

Statistic for SlackBuilds repository

sbo slackbuilds statistics

Last synced: 20 Feb 2026

https://github.com/podaac/gibs-imagestat

Calculate statistics on GIBS hosted imagery

development statistics tva

Last synced: 01 Jun 2026

https://github.com/jolars/slope.jl

Julia package for Sorted L-One Penalized Estimation (SLOPE)

lasso optimization regression slope statistics

Last synced: 28 Jan 2026

https://github.com/livrasand/ethicalmetrics

A privacy-focused, open-source web analytics platform designed as a powerful alternative to Google Analytics. Powered by an active community dedicated to ethical data tracking.

alternative analytics charts cloud-native ethical-analytics ethical-metrics gdpr go golang google-analytics google-analytics-alternative marketing metrics open-source product-analytics real-time self-hosted selfhosted statistics web-analytics

Last synced: 06 Feb 2026

https://github.com/alexanderthclark/intro-stats-2023-fall

Statistics 1101 Columbia University

statistics statistics-course stats

Last synced: 16 Mar 2026

https://github.com/stdlib-js/stats-base-dnanstdevwd

Calculate the standard deviation of a double-precision floating-point strided array ignoring NaN values and using Welford's algorithm.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance welford

Last synced: 29 Jan 2026

https://github.com/stdlib-js/random-array-pareto-type1

Create an array containing pseudorandom numbers drawn from a Pareto (Type I) distribution.

continuous extreme-value generator javascript math mathematics node node-js nodejs pareto power-law prng pseudorandom rand random rng statistics stats stdlib type1

Last synced: 30 Jan 2026

https://github.com/jorenham/lmo-web

Visual probability distribution & L-moment playground, using PyScript

l-moments lmo plotly probability pyodide pyscript python scipy scipy-stats statistics tl-moments visualization

Last synced: 15 Apr 2026

https://github.com/pr38/cox_ph_estimation_notebooks

Personal discovery work on estimating Cox Proportional hazards coefficients for for both breslow and efron ties, using both autograd and directly calculating the gradient and hessian

cox-regression dask data-science machine-learning numpy pytensor statistics survival-analysis

Last synced: 15 Apr 2026

https://github.com/akash1070/project---applied-statistics-

To dive deep into this data & find some valuable insights.

data-analysis data-science python statistics

Last synced: 30 Apr 2026

https://github.com/dindinyt37/roblox-stats-tracker

Automated statistics tracker for Roblox games and groups. Collects and logs visits, favorites, player counts, and more at configurable intervals

analytics automation csv data-collection game-analytics monitoring nodejs roblox roblox-api statistics

Last synced: 15 Apr 2026

https://github.com/llrs/biocstats

Analysis about the stats of packages in Bioconductor

bioconductor bioconductor-stats rmarkdown-document statistics

Last synced: 08 Feb 2026

https://github.com/enijkamp/notes

A set of notes on generative learning.

generative-models statistics

Last synced: 18 Mar 2026

https://github.com/matchaboy7/ngram-language-model

🧠 Build an N-gram language model to generate coherent text, predict next words, and evaluate performance with real-world data.

language-model laplace-smoothing machine-learning markov markov-assumption markov-chain model ngram ngram-language-model ngram-model nlp nltk perplexity pharo python smoothing-methods spell-checker statistics

Last synced: 16 Apr 2026

https://github.com/andrewmaksimchuk/spending

How much you spend money?

bash nodejs shell statistics

Last synced: 11 Feb 2026

https://github.com/froozeify/git-ladder

Github Contribution Hall of Fame accross multiple repositories

commits contributions halloffame ladders pull-requests repositories statistics

Last synced: 12 Feb 2026

https://github.com/andypicke/birthday-puzzle-paradox

Simulation of the Birthday Puzzle Problem in Python, including actual US births data

birthday-paradox python statistics

Last synced: 18 Apr 2026

https://github.com/stdlib-js/stats-base-dminabs

Calculate the minimum absolute value of a double-precision floating-point strided array.

abs absolute domain extent extremes javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 14 Feb 2026

https://github.com/stdlib-js/random-strided-rayleigh

Fill a strided array with pseudorandom numbers drawn from a Rayleigh distribution.

continuous generator javascript math mathematics node node-js nodejs prng pseudorandom rand random rayleigh rng seed seedable statistics stats stdlib strided

Last synced: 15 Feb 2026

https://github.com/stdlib-js/datasets-harrison-boston-house-prices-corrected

A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).

boston data dataset datasets house housing javascript linear-regression node node-js nodejs prediction prices statistics stats stdlib value

Last synced: 15 Feb 2026

https://github.com/stdlib-js/random-strided-t

Fill a strided array with pseudorandom numbers drawn from a Student's t distribution.

continuous gaussian generator javascript math mathematics node node-js nodejs normal prng pseudorandom rand random rng statistics stats stdlib student t

Last synced: 15 Feb 2026

https://github.com/aurelliachristie/statistics-and-microsoft-excel-101

Materials about basic statistics & Microsoft Excel that I brought in my talk in Actuarial Science Day 2021 held by Business Mathematics program of Universitas Prasetiya Mulya.

excel statistics

Last synced: 28 Feb 2026

https://github.com/stdlib-js/random-array-normal

Create an array containing pseudorandom numbers drawn from a normal distribution.

continuous gaussian generator javascript math mathematics node node-js nodejs normal prng pseudorandom rand random rng seed seedable statistics stats stdlib

Last synced: 15 Feb 2026

https://github.com/predatorray/krew-index-tracker

Tracks the download stats of Krew plugins.

krew krew-index kubernetes statistics

Last synced: 01 Mar 2026

https://github.com/mundialis/r.change.stats

GRASS GIS addon that calculates change statistics from two discrete raster maps.

grass-addon grass-gis grass-gis-addons hermosa-earth incora statistics

Last synced: 02 Mar 2026

https://github.com/stdlib-js/blas-ext-base-ssumkbn

Calculate the sum of single-precision floating-point strided array elements using an improved Kahan–Babuška algorithm.

blas compensated extended javascript kahan kbn math mathematics node node-js nodejs statistics stats stdlib strided strided-array sum summation total typed

Last synced: 04 Mar 2026

https://github.com/dcs-training/intromachinelearning

This course is aimed at providing an introduction to machine learning for those with some beginner level python skills. Go to the readme file

data-analysis data-wrangling machine-learning python statistics

Last synced: 06 Mar 2026

https://github.com/abelsiqueira/dices.jl

Simple package defining dices, rolls, histograms, statistics, etc. in Julia.

dices julia julia-language rpg statistics

Last synced: 06 Mar 2026

https://github.com/pradeep-selva/github-language-visualizer

A website to see the statistics of languages used in your github repositories. Just enter and search for your github username!

analysis github language statistics svelte visualization

Last synced: 17 Apr 2026

https://github.com/vladaviedov/gh-lang-stats

Calculate Github language stats based on commit history

github github-api language-statistics statistics

Last synced: 03 Apr 2026

https://github.com/timmymatten/spikeball-stat-tracker

Spikeball stat tracking web app built with Streamlit and Python, designed to easily log and analyze player performance over multiple games.

data data-analysis data-visualization dataset matplotlib-pyplot multipage python spikeball statistics streamlit

Last synced: 18 Apr 2026

https://github.com/ayushsubedi/datainnews_v2

Data in News project, redone using Github Actions.

articles dataset flask github-actions heroku newspapers statistics twint work-in-progress

Last synced: 18 Apr 2026

https://github.com/assamirzafar/learning

My Roadmaps and challenges are in this repo...I will add my colab and kaggle notebook links along with py script files in here.

calculus convolutional-neural-networks deep-learning deep-neural-networks keras linear-algebra machine-learning numpy opencv probability python3 pytorch scikit-learn scipy statistics

Last synced: 05 Apr 2026

https://github.com/dimitri4788/git-snoop

:wrench: Command line tool for statistical analysis of a git repository.

command-line-tool git git-addons git-snoop statistics

Last synced: 21 Apr 2026

https://github.com/pronamic/wp-pronamic-telemetry

Pronamic Telemetry is a tool designed to collect and analyze usage data from WordPress websites that use WordPress solutions by Pronamic.

analytics insights pronamic statistics telemetry wordpress wordpress-development wordpress-plugin wordpress-site

Last synced: 22 Apr 2026

https://github.com/ganeshkumartk/ncov-2019

[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019

corona data-analysis ncov ncov-2019 statistics wuhan wuhan-coronavirus wuhan-virus

Last synced: 05 Jun 2026

https://github.com/samirelanduk/fuzz

A lightweight Python utility providing values with associated uncertainty

mathematics statistics uncertainty uncertainty-propagation

Last synced: 06 Jun 2026

https://github.com/diehlpk/hpx_gsoc_stats

Google Summer of Code Statistics of Ste||ar group

gsoc hpx statistics

Last synced: 25 Apr 2026

https://github.com/yzimroni/spotifystreaminganalyzer

Analyze your Spotify's streaming history

python spotify statistics

Last synced: 06 Jun 2026

https://github.com/fynydd/fynydd.benford

Experiment with Benford's Law to find data anomalies in number lists and images (Windows, macOS, Linux, .NET 8.0, x64, Arm64, Apple Silicon)

analytics anomalous-numbers benfords-law dotnet fake fraud normal-distribution numbers patterns statistics

Last synced: 27 Apr 2026

https://github.com/stdlib-js/random-iter-pareto-type1

Create an iterator for generating pseudorandom numbers drawn from a Pareto (Type I) distribution.

continuous extreme-value generator javascript math mathematics node node-js nodejs pareto power-law prng pseudorandom rand random rng statistics stats stdlib type1

Last synced: 28 Apr 2026

https://github.com/nikhilbadyal/pgextras

Unofficial Python port of Heroku's pgextras that provides various statistics for a Postgres instance.

database heroku performance postgres postgresql statistics

Last synced: 29 Apr 2026

https://github.com/angelfqc/psphp

Process Survey With PHP

cli console php statistics surveys symfony

Last synced: 30 Apr 2026

https://github.com/alcestide/scianalytics

Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.

csv data-analysis data-science data-visualization pandas plotly python science-research statistics

Last synced: 30 Apr 2026

https://github.com/flazefy2/ds-airplane_crashes_dataset_since_1908

https://www.kaggle.com/datasets/landfallmotto/airplane-crashes-dataset-since-1908

csv data-science jupyter-notebook python statistics

Last synced: 01 May 2026

https://github.com/flazefy/kumande-ds

Kumande is consume management apps. So you can list and analyze your consume food or drink. Make a budget to set limit of spending. Analyze your health and sync it with your daily consume. Also you can create reminder to remind your daily food schedule or many more. Created using Python Jupiter Notebook

barchart chart csv data-science food jupiter-notebook matplotlib pie-chart python statistics

Last synced: 01 May 2026

https://github.com/italoseara/cet083

Medidas de Posição (ou separatrizes) - CET083

data-science matplotlib numpy pandas portuguese python statistics university

Last synced: 03 May 2026

https://github.com/fedemagnani/binance_watcher.js

A simple script that allows you to download historical price data from all pairs against USDT, BTC, BNB, ETH and to apply simple statistics on it, including the construction of the MVP and the OPTIMAL PORTFOLIO

allocation asset assets-management binance btc cryptocurrency portfolio statistics trading

Last synced: 04 May 2026

https://github.com/jose-jaen/facialrecognizer

Facial Recognition system with AI and Statistical Learning models

cnn computer-vision data-science deep-learning faces facial-recognition lda pca polars python pytorch statistics uc3m

Last synced: 04 May 2026

https://github.com/vedmaka/mediawiki-extension-metrica

Javascript-based statistics for Mediawiki

js mediawiki mediawiki-extension metrics php statistics

Last synced: 04 May 2026

https://github.com/flazefy2/ds-global_cybersecurity_threats

https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

data-science data-visualization numpy python squarify statistics

Last synced: 05 May 2026

https://github.com/flazefy2/ds-customer_shopping

https://www.kaggle.com/datasets/bhadramohit/customer-shopping-latest-trends-dataset

data-science data-visualization jupiter-notebook numpy python sales shopping squarify statistics

Last synced: 05 May 2026

https://github.com/ujstor/streamlit-working-hours

The analysis pipeline involves combining the data forms, performing data cleaning, and starning streamlit server for statistical analysis.

data-pipeline numpy pandas statistics streamlit

Last synced: 06 May 2026

https://github.com/valatwork/statistics

Putting together study material from various sources, from linear algebra to machine learning

calculus learning linear-algebra machine-learning probability python pytorch statistics tensorflow

Last synced: 06 May 2026

https://github.com/mbjoseph/secmr

A research compendium for: “Using visual encounter data to improve capture-recapture abundance estimates”

bayesian capture-recapture ecology research-compendium statistics

Last synced: 09 Jun 2026

https://github.com/steviecurran/z-value

Python code for calculating Z-value from the p-value

numerical-methods p-value significance-testing statistics teaching-materials

Last synced: 06 May 2026

https://github.com/bessarodrigo/linear-regression-salaries

Análise dos fatores que influenciam os salários dos colaboradores de uma empresa, utilizando técnicas de regressão linear múltipla.

matplotlib pandas python regression regression-models seaborn statistics statsmodels

Last synced: 07 May 2026

https://github.com/cherouvim/imdb-stats

I had an argument with my wife on whether great movies are longer than 2 hours or not (I hate long films). In order to resolve this I had to geek it out.

bash imdb imdb-dataset mysql statistics

Last synced: 07 May 2026

https://github.com/emilhein/optifunc

NPM module to make optimizations and tests on your functions

helper-functions nodejs npm performance statistics testing

Last synced: 08 May 2026

https://github.com/subh888999/stackoverflow-tag-predtiction

A machine learning-powered Streamlit app that predicts relevant Stack Overflow tags based on question content, using NLP and multi-label classification for accurate and real-time tag suggestions.

machine-learning matplotlib multilabel-classification nlp nltk pandas python sns stackoverflow-api statistics webscraping

Last synced: 08 May 2026

https://github.com/ctsrc/simulated-victor-game

A simulation of a game that has characteristics similar to The Secretary Problem, but where the numbers are generated in a specific, known way

applied-probability css decision-theory html5 mathematics secretary-problem simulation statistics vanilla-js

Last synced: 08 May 2026

https://github.com/607011/1dollar

Simulation of a counterintuitive distribution problem

animation go golang matplotlib numpy plot python3 simulation statistics

Last synced: 09 May 2026