An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/mrousavy/piechart

A rich PieChart control for WPF which supports easy MVVM bindings and data access

analytics chart control library pie-chart statistics wpf xaml

Last synced: 02 May 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/gabrielrufino/samplesize

📊 Package to determine the ideal sample size for a survey

math mathmatics samplesize statistics

Last synced: 24 Apr 2025

https://github.com/barabasz/primes

Python class that calculates primes, their statistics and other related numbers in specified range

prime-numbers primes statistics

Last synced: 05 Jul 2025

https://github.com/genietim/ache-analyzer

Principal component and other statistical analysis to detect correlations to aches

ache fitbit health statistics weather

Last synced: 08 Apr 2025

https://github.com/elkronos/feature_selection

This repo contains a series of functions aimed at helping to select features and reduce dimensionality.

data-science dataanalytics feature-selection r statistics

Last synced: 09 Apr 2025

https://github.com/lazernata/transport-problem

Bachelor's Thesis Work: Shiny app to solve the transport problem. Available in Spanish and English

operational-research rstudio shinyapps statistics

Last synced: 31 Mar 2025

https://github.com/stdlib-js/stats-array-min-by

Calculate the minimum value of an array via a callback function.

array domain extent extremes javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib

Last synced: 26 Apr 2026

https://github.com/beliavsky/fit-student-t-distribution

Fit a Student t distribution to univariate data via maximum likelihood

maximum-likelihood-estimation nelder-mead probability-distribution statistics t-distribution

Last synced: 27 Feb 2026

https://github.com/bozenne/lavareduce

Latent variable models with linear predictors

latent-variable-models lava-r-package r statistics

Last synced: 15 Jan 2026

https://github.com/lavakin/covid_positivity_statistics

Statistics exercise

statistics

Last synced: 11 Mar 2025

https://github.com/indianajaune/suicidator

Mini data science software for suicide statistics

c darknet data-science libcsv naivebayes statistics

Last synced: 09 Apr 2025

https://github.com/govind-prakash/r

A collection of R scripts designed for basic bioinformatics and biostatistics projects. This repository includes scripts for data analysis, visualization, and statistical modeling, catering to researchers and students in life sciences

data-science data-visualization r r-base rstudio statistics

Last synced: 05 Sep 2025

https://github.com/puravparab/claude-stats

heatmaps for your claude.ai conversation history

claude heatmap nextjs python statistics

Last synced: 13 Apr 2026

https://github.com/cleoold/linearly_varying_binomial_distribution_calcs_python

a "binomial" distribution with linearly increasing chance.

probability python-c-extension statistics

Last synced: 16 Mar 2025

https://github.com/rodrigo5182/stats-strided-dnanvariance

Calculate the variance of a double-precision floating-point strided array ignoring NaN values.

deviation javascript mathematics nodejs sample-variance standard-deviation statistics stats stdlib strided-array unbiased var variance welford

Last synced: 02 Apr 2025

https://github.com/harmim/vut-msp-project

Statistika a pravděpodobnost - Projekt

excel msp regression-analysis statistics vut vut-fit

Last synced: 08 Jan 2026

https://github.com/bt-88/deltasight-statistics

Provides efficient tracking of common statistical descriptors (mean, st. dev., sum, count) of a changing numeric sample

statistics

Last synced: 14 Jan 2026

https://github.com/ai-ahmed/bda-and-ds

Dual Course: EMC2 Data Scientist & Big Data Analytics & IBM Professional Data Science Course

analytics big-data data-science jupyter-notebook jupyterlab linear-algebra mathematics matplotlib ml numpy pandas probability seaborn sklearn statistics

Last synced: 07 May 2026

https://github.com/tharun-mahendra/statistics

A repository for statistics and data analysis projects.

statistics

Last synced: 07 May 2025

https://github.com/gearkite/yapperfinder

Finds the yappers in your Matrix chats and displays some statistics

matrix statistics

Last synced: 16 Mar 2025

https://github.com/gnikolovski/projects_stats

Drupal 8/9 - Projects Stats provides a block, which displays a table or a list with project names and downloads count

drupal drupal-8 drupal-9 drupal-module statistics

Last synced: 05 Aug 2025

https://github.com/mituskillologies/krai-sppu-mca

The repository contains all the practicals of subject "Knowledge Representation in Artificial Intelligence" subject of MCA under Savitribai Phule Pune University, Pune. Programmed by Tushar B. Kute.

artificial-intelligence artificial-neural-networks convolutional-neural-network decision-trees deep-learning machine-learning neural-networks recurrent-neural-networks statistics support-vector-machine

Last synced: 06 Jul 2025

https://github.com/olafhaag/c3d-statistics

Analyze conditional values of C3D data generated by Phasespace Impulse X2 motion capture system

c3d motion-capture phasespace statistics

Last synced: 14 Jun 2025

https://github.com/elb4rto/craplog

Creates statistics from Apache2 log files

apache2 apache2-logs apache2-statistics cli gui logs statistics

Last synced: 16 Oct 2025

https://github.com/juangesino/stats-distributions

Interactive probability distribution tables for easy handling.

meteor meteorjs probability probability-distributions statistics

Last synced: 16 Mar 2025

https://github.com/rustkas/statistics-with-rust

"Statistics with Rust" is your comprehensive resource to unlock Rust's true potential in modern statistical methods.

rust rust-example statistics

Last synced: 21 Mar 2025

https://github.com/mightymetrika/mmibain

Bayesian Informative Hypotheses Evaluation Web Applications

bayes-factor bayesian hypothesis informative r statistics

Last synced: 06 Jul 2025

https://github.com/zajichek/zajichek

Source code for my personal/professional website

consulting data-science statistics website

Last synced: 21 Feb 2026

https://github.com/ergo35/polsl-ms

Projekt z Metod Statystycznych w R

methods r rstudio statistics

Last synced: 04 Sep 2025

https://github.com/xstupi00/Theoretical-Assignments

Elaborated projects with theoretical assignments during the master's degree.

automata complexity information-security markov-chain petri-nets statistics storm vut vut-fit

Last synced: 11 Mar 2025

https://github.com/elb4rto/craplog-gui

Parse Apache2 logs to create statistics

apache2 apache2-logs gui java logs statistics

Last synced: 23 Apr 2025

https://github.com/beliavsky/starpac

Standards Time Series and Regression Package, a library of Fortran subroutines for statistical data analysis developed by the Statistical Engineering Division of the National Institute of Standards and Technology

arima autocorrelation fft forecasting gamma-function nonlinear-least-squares periodogram polynomial-regression rng statistics time-series-analysis

Last synced: 06 Jan 2026

https://github.com/ruancarllo/whatsranker

A tool to identify the most active conversations on your WhatsApp

messages statistics whatsapp

Last synced: 29 Apr 2026

https://github.com/joeloparco/laptop-analysis

Final Project for COSC 3570 intro to Data Science. Project aimed to find a relationship between laptop price and other laptop characteristics using linear regression.

juypter-notebook latex python statistical-analysis statistics

Last synced: 07 May 2026

https://github.com/sodascience/schools_synth_test

Test code for disaggregated synthetic control with simulated data

causal-inference simulation statistics synthetic-control

Last synced: 20 Mar 2025

https://github.com/deypadma2020/dataanalysis-mlalgo

Practice repository for data analysis, feature engineering, statistics, web scraping, and building ML model pipelines in Python.

data-analysis eda feature-engineering machine-learning-algorithms ml-pipeline statistics web-scraping

Last synced: 30 May 2026

https://github.com/chrismainey/chief_analyst_presentation

My presentation for Chief Analyst interview with the Health Economics Unit

data-science interview nhs rstats statistics xaringan

Last synced: 14 Jul 2025

https://github.com/diegopinate/docker-stats-viewer

Simple docker stats viewer made with copilot

docker html javascript npm statistics

Last synced: 12 Apr 2026

https://github.com/psygo/monte-carlo-ts

Monte Carlo (Gaussian) with TypeScript and SolidJS

monte-carlo simulation statistics

Last synced: 07 Dec 2025

https://github.com/srvanderplas/statistical_atlas

Framed Charts and the Statistical Atlas of 1870

census data ggplot2 graphics r statistics visualization

Last synced: 29 May 2026

https://github.com/maxgfr/regressio

Zero-dependency TypeScript regression, classification & statistics library. OLS, Ridge, Lasso, Elastic Net, Logistic, KNN, Neural Network + diagnostics + preprocessing. Optional Rust/WASM engine.

bun knn lasso linear-regression logistic-regression machine-learning neural-network ols regression ridge-regression statistics typescript wasm zero-dependencies

Last synced: 23 Jun 2026

https://github.com/zietzm/webgwas-analysis

Figures and analysis for the WebGWAS project paper

gwas python r statistics

Last synced: 07 May 2026

https://github.com/hannahgsimon/halmodeling2024graphs

Created code to develop and analyze statistical graphs for the spatial radiotherapy model, which can be found at https://github.com/hannahgsimon/HALModeling2024. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/crhntr/visual_stats

Some visualizations for Math352 Undergradutate Probability and Statistics

education probability statistics vuejs2

Last synced: 07 Sep 2025

https://github.com/dcs-training/introtostatistics

This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file

data-analysis r rmarkdown statistics

Last synced: 26 Mar 2025

https://github.com/galihap76/statistik-python

Mean, median, modus pada statistik python untuk machine learning.

python-machine-learning statistics

Last synced: 02 Jul 2025

https://github.com/ayakovlenko/scalai

Artificial Intelligence in Scala

linear-algebra machine-learning statistics

Last synced: 17 Mar 2025

https://github.com/guusdias/programming-language-stats

Programming languages statistics in the last three years.

csv python statistics

Last synced: 02 Jul 2025

https://github.com/messente/messente-api-csharp

Messente API library: https://www.nuget.org/packages/com.Messente.Api

number-lookup omnichannel phonebook statistics

Last synced: 24 Mar 2025

https://github.com/messente/messente-api-java

Messente API library: https://bintray.com/messente/messente-api/messente-api

number-lookup omnichannel phonebook statistics

Last synced: 24 Mar 2025

https://github.com/elkronos/bmber

Bayesian Model Building and Evaluation Repository

bayesian-inference bayesian-statistics r rstan sensitivity-analysis statistics

Last synced: 17 Mar 2025

https://github.com/kddubey/microarray-kaggle

Analyze a dataset with 72 observations and 7,129 features

machine-learning statistics

Last synced: 08 Apr 2025

https://github.com/cjabradshaw/samebirthdayshiny

Probability two people in a group share the same birthday

mathematics probability statistics

Last synced: 21 Feb 2026

https://github.com/cjabradshaw/guns

Data on gun ownership and death/injury rates, plus associated R code

gun-deaths gun-ownership linear-models statistical-assumptions statistics

Last synced: 21 Feb 2026

https://github.com/odessaz/portfolio-projects

This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science

applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics

Last synced: 12 Apr 2026

https://github.com/thautwarm/bioinfoplus

A tool framework for bioinformatics written in multiple scientific languages.

bioinformatics scientific-computing statistics toolchain

Last synced: 08 May 2026

https://github.com/ritam2006/pmr-backend

A Java Spring Boot backend service for my Portfolio Management Risk project.

api financial-analysis java postgresql springboot sql statistics

Last synced: 02 Jul 2025

https://github.com/stla/gfilmm

Generalized fiducial inference for linear mixed models

fiducial linear-mixed-models r statistics

Last synced: 10 Mar 2026

https://github.com/ccrisc/metaanalysis

A meta-analysis whether an increase in minimum wage compress wage inequality.

data-analysis-r learn meta-analysis statistics

Last synced: 23 Aug 2025

https://github.com/loryshamadache/kdd99-intrusion-detection

Network intrusion detection using the KDD Cup 1999 dataset. Explores multiple ML models and combines KNN with Random Forest for optimal results.

kdd99 ml network statistics

Last synced: 20 Jan 2026

https://github.com/drorata/z-vs-ch2-tests

Comparing z-test and chi square test

statistics

Last synced: 05 Oct 2025

https://github.com/alexp11223/d3flightsdatavisualization

Some visualizations for Kaggle 2015 Flight Delays and Cancellations dataset using d3.js and Leaflet

d3 d3js dataset flight-map leaflet sql statistics visualization

Last synced: 31 Mar 2025

https://github.com/mightymetrika/gifistab

A Stability Assessment Framework

datascience r research statistics

Last synced: 05 Oct 2025

https://github.com/jedrzejszelc/my_projects

A collection of Jedrzej (Andrew) Szelc's projects in Python, Robotframework, SQL and R languages.

machine-learning python3 rlanguage robotframework sql statistics xml xml-parser

Last synced: 18 May 2026

https://github.com/m-dadej/pd_estimation

Estimation of probability of default on novel data from Orbis

data-science predictive-modeling statistics

Last synced: 02 Jul 2025

https://github.com/josepablodmg/python--linear-regression---housing-exercise

A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.

california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization

Last synced: 05 Oct 2025