An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/karsterr/repeated-measurement

An R-based workflow for conducting repeated measures ANOVA using the ez package, with data wrangling via tidyverse and visualization through ggplot2. Includes data import, transformation to long format, statistical analysis, and graphical summary.

anove data-analysis experimental-design ezanove ggplot2 r repeated-measurements rstats statistics tidyverse

Last synced: 18 Sep 2025

https://github.com/koulanurag/card-arrangement-game

Card Arrangement Game to introduce statistical notions in fun way :game_die: :black_joker: :slot_machine:

card-game game nodejs statistics

Last synced: 18 May 2026

https://github.com/kartikchugh/covid-case-model

Estimation of undetected COVID cases from mortality data (RaccoonHacks 2020 - Data Science Award)

coronavirus-tracking covid-case-count notebook-jupyter statistics

Last synced: 03 Apr 2025

https://github.com/lukaszlapaj/r

This repo contains code created on Probability Methods and Statistics course.

probability probability-statistics r statistics

Last synced: 26 Jul 2025

https://github.com/m0zgen/pyng3

⚙️ Python and Pyng3 remote server pinger with statistics

ip list ping ping3 python3 statistics

Last synced: 02 Mar 2025

https://github.com/ramapinnimty/udacity-mlfoundation-nanodegree

This is a repository containing solutions to the assignments that are a part of the Udacity Machine Learning Foundation Nanodegree program.

assignments data-analysis python3 statistics udacity-machine-learning-nanodegree

Last synced: 26 Jul 2025

https://github.com/spacebakery/crunchie-munchies-project

Analyze Data with Pyhton

numpy python statistics

Last synced: 27 Jul 2025

https://github.com/esteban2505j/analysisyeldreviews

This project focuses on developing a service designed to help businesses quickly obtain useful summaries of their customer reviews

analytics business-analytics jupyter-notebook nlp nltk-library plotting python review-analysis statistics

Last synced: 18 May 2026

https://github.com/mballarin97/bayesnetworks

Project for the course of Advanced statistics of physical analysis

bayesian-inference bayesian-networks k2 network-generator physical-analysis statistics

Last synced: 12 Sep 2025

https://github.com/sean-doody/gmu-chss-degrees

Analysis of labor market outcomes for humanities and social science college degree holders.

data-visualization r statistics

Last synced: 05 Jul 2025

https://github.com/cpmachado/r-programming-for-statistics-and-data-science

My source code for exercises of a given udemy course on statistics

r rlang statistical-learning statistics statistics-course

Last synced: 24 Aug 2025

https://github.com/nextzhou/lpfloat

low precision float

buckets golang statistics

Last synced: 17 Jan 2026

https://github.com/peterk/tableau-kolada-proxy

A Web Data Connector for Kolada <-> Tableau

govtech kolada statistics sweden tableau tableau-connector

Last synced: 27 Apr 2026

https://github.com/acmo0/meteo

Create graphics and statistics based on your meteorological data.

glade graphics gtk3 meteorology python3 statistics

Last synced: 16 May 2026

https://github.com/mituskillologies/numerical-methods-pgdhpcap-cdac-dec24

The program of Numerical Methods in course PGDHPCAP at CDAC. Conducted by Tushar B. Kute, December 2024

linear-algebra linear-regression mathematics numerical-methods statistics

Last synced: 27 Jul 2025

https://github.com/flazefy2/ds-100_healthiest_foods_nutrition_and_origin

https://www.kaggle.com/datasets/prajwaldongre/top-100-healthiest-food-in-the-world

data-science dataset health jupiter-notebook matplotlib python statistics

Last synced: 20 May 2026

https://github.com/aht205/supervised_learning_classification_bayesian_inference

Bayesian inference with JAGS project along with supervised learning (KNN, QDA, RF, SVM) in R; reproducible coursework

bayesian-inference classification jags machine-learning markdown mcmc r random-forest-classifier roc statistics

Last synced: 18 May 2026

https://github.com/timrijckaert/soccerstatisticview

Simple view that displays statistics for two soccer teams

android soccer statistics view

Last synced: 18 May 2026

https://github.com/tupui/hdr-boxplot

Functional highest density region boxplot

python statistics uncertainty-analysis visualization

Last synced: 27 Mar 2025

https://github.com/zenklinov/monte-carlo

This project is a simple implementation of the Monte Carlo Simulation method using Python. Its primary purpose is to demonstrate how Monte Carlo simulations can be used to estimate expected values from random distributions or stochastic processes.

monte-carlo sampling simulation statistics

Last synced: 30 Jun 2026

https://github.com/merkys/mixturefitting

R package for fitting mixture distributions to data using various approaches

expectation-maximization mixture-modelling r statistics

Last synced: 01 Apr 2025

https://github.com/favstats/delib_slides

Slides for the Deliberation Across the World Paper

analysis deliberation paper slides statistics

Last synced: 11 Jan 2026

https://github.com/xuender/kstats

Golang statistics library package that supports v1.18+.

algorithms analytics data go golang kstats machine-learning math rounding statistics

Last synced: 20 Jul 2025

https://github.com/juanjfarina/git-stats-report

Git Stats Report tool for obtaining the different authors and contributions for a given project in certain timelapse.

2024 analysis cicd cli git product-idea released report statistics tool

Last synced: 09 Apr 2026

https://github.com/stdlib-js/stats-strided-dsvariancepn

Calculate the variance of a single-precision floating-point strided array using a two-pass algorithm with extended accumulation and returning an extended precision result.

array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 20 May 2026

https://github.com/fawadeqbal/data-science

A comprehensive repository covering essential Data Science concepts using PyTorch, including anomaly detection, classification, clustering, regression, and more. Includes hands-on implementations and tutorials for each concept.

calculus calculus-2 data-science data-visualization dataset machine-learning python pytorch statistics

Last synced: 29 Jul 2025

https://github.com/zeeshier/probability-and-statistics

Probability and Statistics for Machine Learning & Data Science

data-science machine-learning math mathematics probability statistics

Last synced: 29 Jul 2025

https://github.com/neoncitylights/nist-ehandbook-datasets

Open-source datasets from the NIST/SEMATECH e-Handbook of Statistical Methods

datasets nist statistics

Last synced: 20 Sep 2025

https://github.com/priyanshul28/ml_classification_eda_parkinsonsdisease

A guided Machine Learning Classification exercise on the Parkinson's Disease dataset demonstrating the use of Logistic Regression, Neural Network Classifiers, Decision Trees, Random Forests and XGBoost algorithms, as well as Data Preprocessing and Exploratory Data Analysis.

classification machine-learning pandas python scikit-learn statistics

Last synced: 16 Apr 2026

https://github.com/blakegearin/extreme_overclocking_client

Ruby client for Extreme Overclocking's Folding@home Data Export

client eoc extreme folding-at-home foldingathome overclocking ruby ruby-client ruby-gem statistics

Last synced: 29 Jul 2025

https://github.com/hauntedhost/modern-drive

ModernDive: An Introduction to Statistical and Data Sciences via R at http://www.moderndive.com

data-science data-visualization r statistics

Last synced: 29 Jul 2025

https://github.com/poeschl/homeassistant-addon-stats

A alternative frontend for the official addon statistics from Home Assistant which lists all available addons.

alternative home-assistant-addons statistics

Last synced: 19 Mar 2026

https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds

With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.

data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics

Last synced: 21 Sep 2025

https://github.com/bessarodrigo/hypothesis_test_human_development_index

Testes de Hipóteses | Média e Variância de duas populações - Índice de Desenvolvimento Humano de municípios do Nordeste, Norte e Sul.

hypothesis-testing hypothesis-tests python statistics

Last synced: 18 Apr 2026

https://github.com/marcomadera/test-for-random-numbers

Test for random number between 0 and 1

data-analysis statistics

Last synced: 09 Jul 2025

https://github.com/jabulente/tukey-s-hsd-for-pairwise-group-comparisons

This repository contains a Python project dedicated to performing Tukey’s Honest Significant Difference (HSD) test for pairwise group comparisons.

ai anova-analysis anova-test data-science data-visualization machine-learning math matplotlib-pyplot post-hoc post-hoc-analysis re real-world-problem-solving scipy-stats seaborn-plots statistics statsmodels string turkey-hsd

Last synced: 29 Jul 2025

https://github.com/nicolay-r/semeval2024-task3

The supplementary sevice over THoR Chain-of-Thought framework as part of SemEval-2024 Task 3 paper

analysis datasets-preparation emotion-analysis semeval semeval-2024 span statistics

Last synced: 05 Apr 2025

https://github.com/eivindarvesen/ssb-table

A sortable table presenting data from SSB.

api-client javscript statistics

Last synced: 29 Jul 2025

https://github.com/nalderto/custom-dice-simulation

An AP Statistics demonstration with various amounts of dice

python3 statistics tkinter

Last synced: 29 Jul 2025

https://github.com/sidratulmuntahasara/automax-recommendation-system-ai

A recommendation system that selects the top 3 most comparable properties for a given subject property. It analyzes a subject property and hundreds of nearby candidates to recommend the best comps using a labeled dataset of 100 appraisals where each has a subject property, candidate properties, and selected comps.

llms machine-learning python statistics

Last synced: 25 Jun 2025

https://github.com/j-sephb-lt-n/understanding

Understanding the variability of data by visualising a simulated a/b/n test

ab-test ab-testing data-vis dataviz statistics tutorial

Last synced: 07 Nov 2025

https://github.com/karthik9273/electric-vehicle-market-segmentation

Electric Vehicle market in India using Segmentation analysis and come up with a feasible strategy to enter the market, targeting the segments most likely to use Electric vehicles.

data-science machine-learning-algorithms matplotlib numpy numpy-arrays pandas seaborn statistics

Last synced: 18 Apr 2026

https://github.com/andreea-sindrilaru/statistics-research-project

Within this project, me and a teammate wanted to see if there is any relation between less hours of sleep and school performance.

pandas python statistics

Last synced: 09 May 2026

https://github.com/warrenweckesser/yanova

Functions for one-way and two-way ANOVA.

anova python statistics

Last synced: 27 Mar 2025

https://github.com/asqiriba/comp-578-data-mining

AIl projects for COMP 578 Data Mining.

datamining r statistics

Last synced: 20 May 2026

https://github.com/stdlib-js/stats-array-nanstdevwd

Calculate the standard deviation of an array ignoring `NaN` values and using Welford's algorithm.

array deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib unbiased var variance

Last synced: 19 Apr 2026

https://github.com/matthewfeickert/pydistcore

The interface library for probabilistic modeling in HEP

high-energy-physics probabilistic-modeling python statistical-modeling statistics

Last synced: 18 Oct 2025

https://github.com/reconditematter/ons2

Operations on points on the unit sphere S².

go golang math mathematics statistics

Last synced: 01 Feb 2026

https://github.com/crispengari/data-visualisation-python

💎Visualization of data in python using seaborn and matplotlib

matplotlib python seaborn statistics visualization word-clouds wordcloud

Last synced: 18 May 2026

https://github.com/baslinders/happyhorizon_statstoolkit

An ongoing project for an online toolkit to analyze online controlled experiments. Its mission: To make inferential statistics accessible for everyone.

calculator-python data-science inference inferential-statistics statistics statistics-for-data-science

Last synced: 20 Jan 2026

https://github.com/lastunicorn/ins-toolkit

A .NET library that helps working with files and data from INS (Institutul Național de Statistică din România).

consumer-price-index inflation ins statistics

Last synced: 01 Jul 2026

https://github.com/kashicode/datasight

Data Visualization and Analytics Application

r rshinyapp statistics

Last synced: 07 May 2025

https://github.com/ferdos-coder/instacartmarketbasketanalysisportflio

a repository for statistical and visual analyses utilizing tools like Pandas, NumPy, SQL, and Power BI using InstaCart Market Basket Data

jupyter-notebook machine-learning numpy pandas python statistics

Last synced: 01 May 2026

https://github.com/mattip/presentations

Different presentations I have made

python statistics

Last synced: 18 May 2026

https://github.com/okwilkins/retailanalysis

A comprehensive exploratory analysis and implementation of kmeans/hierarchical clustering on online retail data.

data-analysis data-science machine-learning statistics

Last synced: 18 Oct 2025

https://github.com/ruivieira/oresme

Numerical library for Scala

numerical scala statistics

Last synced: 21 Jul 2025

https://github.com/erictleung/poofsi

:zap: Learn concepts from Nature Methods' series "Points of Significance"

learning nature package r statistics

Last synced: 20 Jan 2026

https://github.com/cyprianfusi/complete-statistical-hypothesis-tests

Complete Statistical Hypothesis Test using real-world data is a blueprint for hypothesis testing! It covers almost all the hypothesis tests commonly used.

hypothesis-testing pandas-python statistics visualization

Last synced: 21 Mar 2025

https://github.com/palashmoon/champange-sales-forecasting

In this notebook, I will use time series forecasting to forecast champagne sales. The sales data of champagnes of a company named 'Perrin Freres' is available

machine-learning python statistics time-series

Last synced: 12 May 2025

https://github.com/zanottipaolo/sms1-regression

Case Study about air quality for Statistics 1 exam @ UniBg - Computer Engineering 2021/2022.

air-quality matlab regression-model statistics

Last synced: 30 Jul 2025

https://github.com/guilherme-marcello/r-ecommercedata-analysis

Reading CSV, using descriptive statistics, and exporting boxplots

boxplot csv r statistics

Last synced: 30 Jul 2025

https://github.com/caterinatasinato/excel-projects

Projects I worked on as Trainee in Data Analytics at ProfessionAI

excel statistics

Last synced: 09 Mar 2026

https://github.com/ruivieira/java-naive-bayes

A Java naive bayes classifier implementation.

classification java machine-learning naive-bayes statistics

Last synced: 21 Jul 2025

https://github.com/oelin/github-404s

A dataset containing non-existant GitHub usernames. Useful for generating usernames that haven't already been taken.

data-science dataset github nlp statistics

Last synced: 22 Sep 2025

https://github.com/davidalexandermoe/economic-time-series-analysis-and-forecasting-arima

Economic Time Series Analysis, Prediction and Forecasting using advanced Statistical methods and an ad-hoc estimated ARIMA (SARIMAX) model in R.

analysis arima arima-model forecasting statistics time-series

Last synced: 21 Jul 2025

https://github.com/iankitnegi/statistically_speaking

Explore diverse projects showcasing statistical techniques with real-world data, comprehensive docs, and interactive visualizations.

data excel statistical-analysis statistics

Last synced: 09 Feb 2026

https://github.com/clemix37/work-stats

🔎 Web based statistics for my work session as a personal project to have stats on my work & productivity ✅

javascript statistics

Last synced: 31 Jul 2025

https://github.com/kxlmtdx/atlantis

A simple template for building your own discord bot with cogs and an initial database 🔥

cogs discord-bot discord-py discord-py-bot python slash-commands sqlite sqlite3 statistics stats

Last synced: 20 Apr 2026

https://github.com/curegit/dentakun

総和計算や数値積分などを含む多機能関数電卓

awt calculator numerical-analysis statistics swing

Last synced: 11 May 2025

https://github.com/egarpor/nonpar

Website for the "First Nonparametric UC3M Workshop"

carlos-iii-university-of-madrid nonparametrics statistics website

Last synced: 11 Feb 2026

https://github.com/brianlesko/machine_learning_5307

This repository concerns Machine Learning concepts Contents: Written by Brian Lesko, the repository contains Python Notebooks demonstrating Statistical Machine Learning theories largely originating from the book, An Introduction to Statistical Learning, by Gareth James.

linear-regression machine-learning ml multiple-linear-regression regression statistics supervised-learning

Last synced: 18 May 2026

https://github.com/messente/messente-api-python

Messente API library: https://pypi.org/project/messente-api

number-lookup omnichannel phonebook statistics

Last synced: 21 Jul 2025

https://github.com/stephaneguerrier/pempi

Proportion estimation with marginal proxy information

covid prevalence r rare-infectious-diseases statistics

Last synced: 17 Mar 2025

https://github.com/stdlib-js/stats-strided-sstdevpn

Calculate the standard deviation of a single-precision floating-point strided array using a two-pass algorithm.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 17 May 2026

https://github.com/humburg/pvalue-distribution

A shiny app to visualise p-value distributions. Intended to facilitate the discussion of how to interpret p-values.

p-values shiny-apps statistics

Last synced: 08 Apr 2025

https://github.com/kynaaaan/rscopula

A simple library implementing copulas in rust

copula rust statistics

Last synced: 23 Sep 2025

https://github.com/theduardomaciel/cc-pe

Conteúdos, scripts em R e datasets utilizados durante a matéria de Probabilidade e Estatística.

data probability r statistics

Last synced: 27 Mar 2025

https://github.com/mehradi-github/ref-jupyter-2510

using python in machine learning

matplotlib numpy pandas python sklearn statistics

Last synced: 11 Apr 2026

https://github.com/lilyreber/cfinversion

Python package for characteristic functions inversion

characteristic-functions numerical-methods python statistics

Last synced: 18 May 2026

https://github.com/lucianotres/ltres.oltui

A GUI to administer the OLT API monitoring tool and provide access to OLT information.

blazor fiber-optic management olt statistics ui webapi

Last synced: 08 May 2026