An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/orvn/some-visualizations

Just some visualizations of concepts and data

d3js data-visualization math statistics

Last synced: 24 Jun 2026

https://github.com/vinodbaste/hr-analytics-employee-attrition-and-performance-prediction

In this project, we enlisted the numerical and categorical attributes present in the publicly available dataset. Missing values were dropped to give better insights in data analysis. ANOVA and Chi-Square tests were carried out during statistical analysis. Machine Learning algo's were applied to understand, manage, and mitigate employee attrition.

data-science dataanalytics datavisualization machine-learning statistics

Last synced: 24 Mar 2025

https://github.com/oelin/parametric-complexity

Estimating the parametric complexity (minimum description length) of binary classifiers.

bias-variance-tradeoff machine-learning minimum-description-length model-selection statistics

Last synced: 29 Apr 2026

https://github.com/shenxianpeng/gitstats-action

GitHub Action that generates insightful visual reports from Git repositories using GitStats

composite-action git git-stats github-actions report statistics

Last synced: 27 May 2026

https://github.com/sukuasoft/stat-js

It is a lightweight and easy-to-use library for Node.js that offers basic functions essential for simple statistical analysis.

basic nodejs statistics

Last synced: 03 May 2026

https://github.com/mauriciogtec/statsmodelling2

See the README for a link to the solutions

bayesian-inference statistics student-project

Last synced: 24 Feb 2026

https://github.com/stdlib-js/stats-strided-sstdevch

Calculate the standard deviation of a single-precision floating-point strided array using a one-pass trial mean algorithm.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 16 May 2026

https://github.com/pblischak/zig-ndarray

N-Dimensional Arrays in Zig

data-science ndarray statistics ziglang

Last synced: 06 Feb 2026

https://github.com/subh888999/car-prices--analysis-projects

This repository houses projects focused on data collection, assessment, cleaning, visualization, and analysis. It includes workflows and methodologies for handling data, from initial gathering and evaluation to processing, visualizing insights, and performing in-depth analysis

jupyter-notebook matplotlib numpy panda seaborn statistics

Last synced: 03 May 2026

https://github.com/rustkas/statistics-with-rust

"Statistics with Rust" is your comprehensive resource to unlock Rust's true potential in modern statistical methods.

rust rust-example statistics

Last synced: 21 Mar 2025

https://github.com/quentin18/pga-tour

PGA Tour data analysis from 2010 to 2020

classification dataanalysis golf rlang rmd scraping statistics

Last synced: 29 Jan 2026

https://github.com/anikov/filestats

Program that update folder statistic in excel book

excel files statistics

Last synced: 05 Mar 2026

https://github.com/m-clark/connections

connections among various statistical methods

graph statistical-methods statistical-models statistics

Last synced: 23 Mar 2025

https://github.com/nfaltir/law-of-averages

:books: A simple script that explains the relationship between the results of a coin tosses and the Law of Averages

datascience python statistics

Last synced: 31 Mar 2025

https://github.com/digital-wellbeing/paradigm-comments

Commentary on proposed new paradigm(s) in social media effects research

psychology social-media statistics well-being

Last synced: 30 Jan 2026

https://github.com/mksingh431/r-programming-language

Free R programming notes pdf are provided here for R programming students so that they can prepare and score high marks in their R programming exam

note notes r static static-site-generator statistics

Last synced: 01 Apr 2025

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 07 Jan 2026

https://github.com/gastonstat/stat2

Introduction to Statistics

introduction-to-statistics statistics syllabus

Last synced: 25 Feb 2026

https://github.com/shaheennabi/maths-for-data-science-explained

📚 Maths for Data Science Explained ✨🔢 A dedicated space where I explore and explain the mathematics behind data science, machine learning, deep learning, and algorithms. 🚀💡 Each topic comes with a detailed explanation, covering key concepts, step-by-step derivations, and practical insights. 🧠⚡ This repo serves as my personal learning journey.

linear-algebra maths-behind-reinforcement-learning maths-for-computer-vision maths-for-deep-learning maths-for-machine-learning maths-for-nlp neural-networks probability statistics

Last synced: 18 Jan 2026

https://github.com/jakubfr4czek/apartment-prices-analysis

This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.

agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning

Last synced: 04 May 2026

https://github.com/denisecase/datafun-02-functions

Practice with builtin functions, creating functions, creating methods (functions in a class), and employing statistical functions

beginner functions methods python statistics

Last synced: 30 Oct 2025

https://github.com/gemmaro/msgstat

gettext PO file statistics (to learn C gettext PO library)

gettext po-file statistics

Last synced: 18 Apr 2026

https://github.com/gbrsales/kll

Streaming Quantile Approximation for R

r statistics stream-processing

Last synced: 31 Mar 2025

https://github.com/sroman0/data-analytics

Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.

data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics

Last synced: 15 Apr 2026

https://github.com/fabnavarro/makedensity2d

2-dimensional Normal Mixture Models

bivariate-analysis matlab statistics

Last synced: 14 Jun 2026

https://github.com/josericodata/statisticsapp

Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.

alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test

Last synced: 26 Feb 2026

https://github.com/rahulbhadani/statistical-sauce

A curated list of definitions and concepts from statistics

statistics

Last synced: 09 Feb 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 11 Apr 2026

https://github.com/uscbiostats/hpc-with-r

Workshop: Introduction to R (for HPC users)

datascience hpc parallel-computing rstats statistics workshop

Last synced: 03 Mar 2026

https://github.com/linggarm/statistics

My personal repository where I can keep files associated with my learning of Statistics

correlation-coefficient covariance pearson-correlation spearman-rank-correlation standard-deviation statistics variance

Last synced: 17 Jun 2026

https://github.com/beliavsky/regression_spaeth

Fortran 90 library of John Burkardt for regression using least-squares and other criteria, based on Spaeth's codes

linear-regression multiple-linear-regression regression robust-regression statistics

Last synced: 10 Feb 2026

https://github.com/stdlib-js/stats-strided-smin

Calculate the minimum value of a single-precision floating-point strided array.

array domain extent extremes float32 javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 05 May 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/alan-y/blogdown-website

This is my personal website and blog built using the blogdown R package and deployed with Netlify.

r statistics

Last synced: 27 May 2026

https://github.com/spikehd/worldstat

All-in-one CLI tool and Rust library for interfacing with Minecraft world information

minecraft statistics world

Last synced: 11 Feb 2026

https://github.com/rodrigojunqueiradev/python-exercises

Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language

data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics

Last synced: 16 Apr 2026

https://github.com/marberts/pysps

Sequential Poisson sampling in Python

python sampling statistics

Last synced: 24 Mar 2025

https://github.com/stdlib-js/stats-strided-dstdevyc

Calculate the standard deviation of a double-precision floating-point strided array using a one-pass algorithm proposed by Youngs and Cramer.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 05 May 2026

https://github.com/toro-nicolas/toro-nicolas

The readme of my profile.

curriculum-vitae profil readme statistics

Last synced: 12 Feb 2026

https://github.com/beliavsky/r_and_fortran

Examples of simple R and Fortran programs that calculate descriptive statistics and of equivalent R and Fortran syntax

fortran fortran-tutorial language-comparison r r-tutorial simulation statistics

Last synced: 25 Mar 2025

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/marcpinet/mtsad-benchmarks-are-mostly-univariate

🔍 Evidence that current multivariate time series anomaly detection benchmarks don't actually test cross-channel modeling.

anomaly-detection benchmarks data-science datasets machine-learning statistics synthetic-data

Last synced: 28 Jun 2026

https://github.com/rici4kubicek/statistics

Implementation of simple statistics functionality for embedded systems.

embedded embedded-systems math max min statistics

Last synced: 27 Feb 2026

https://github.com/diegopinate/docker-stats-viewer

Simple docker stats viewer made with copilot

docker html javascript npm statistics

Last synced: 12 Apr 2026

https://github.com/m-dadej/pd_estimation

Estimation of probability of default on novel data from Orbis

data-science predictive-modeling statistics

Last synced: 02 Jul 2025

https://github.com/aliramazanyildirim/aliramazanyildirim

My GitHub contribution graph is not statistics this time, but a space battle. As I code, the enemies grow stronger, and the game updates itself every day 🎮✨

contributions github graph space-battles statistics

Last synced: 13 Feb 2026

https://github.com/artginzburg/npmstalk

JS module · Total downloads of an NPM maintainer

badge github npm npm-package statistics

Last synced: 14 Feb 2026

https://github.com/a-herzog/multimodalfit

MultimodalFit is a Python package for fitting a combination of multiple distributions to one measurement series.

distribution-fitting jupyter-notebook python statistics

Last synced: 28 Feb 2026

https://github.com/cybcon/docker.dockerhubstats2mqtt

Collect repository statistics from Dockerhub and publish them to a MQTT topic.

docker-image dockerhub mosquitto-pub statistics

Last synced: 14 Apr 2026

https://github.com/dideler/course-level-3-certificate-data-science

Level 3 Certificate in Data - Spring 2025 - City of London (Guildhall)

data-science statistics

Last synced: 24 Mar 2025

https://github.com/dav-idka-j/stash-plugins

A collection of plugins for stash

javascript stash stash-plugin stashapp statistics

Last synced: 28 Feb 2026

https://github.com/ptfonseca/inspector

inspector: Validation of arguments and objects in user-defined functions

input-validation r r-package statistics validation validations

Last synced: 21 Feb 2026

https://github.com/democritus-project/d8s-stats

Democritus functions for working with statistics.

democritus python statistics statistics-utility utility

Last synced: 01 Apr 2025

https://github.com/mncube/mxsrquick

Streamline workflows for Bayesian mixing model and MixSIAR projects

bayesian mixsiar r statistics

Last synced: 27 May 2026

https://github.com/aleksandrhovhannisyan/statisticalinferencesinr

Custom package for performing statistical inferences in the R programming language. Written for STA3032 Engineering Statistics to make my life easier.

r statistics

Last synced: 18 Apr 2025

https://github.com/mattsebastianh/AB-Testing-at-Nosh-Mish-Mosh-Project

Analyze Data with Python | Hypothesis Testing with Scipy | Sample Size Determination

ab-testing lift sample-size-determination statistics

Last synced: 18 Jun 2026

https://github.com/alexp11223/d3flightsdatavisualization

Some visualizations for Kaggle 2015 Flight Delays and Cancellations dataset using d3.js and Leaflet

d3 d3js dataset flight-map leaflet sql statistics visualization

Last synced: 31 Mar 2025

https://github.com/joeribreedveld/strong-stats

Strong workout insights, private, free and simple.

free gym nextjs originui recharts shadcn-ui statistics stats strong tailwindcss visualize workout

Last synced: 07 May 2026

https://github.com/sodascience/workshop_efficient_microdata

Workshop for efficiently doing projects with CBS microdata.

computing microdata project-organisation statistics

Last synced: 19 Mar 2026

https://github.com/joeloparco/laptop-analysis

Final Project for COSC 3570 intro to Data Science. Project aimed to find a relationship between laptop price and other laptop characteristics using linear regression.

juypter-notebook latex python statistical-analysis statistics

Last synced: 07 May 2026

https://github.com/zietzm/webgwas-analysis

Figures and analysis for the WebGWAS project paper

gwas python r statistics

Last synced: 07 May 2026

https://github.com/juanfranciscocis/probability-and-statistics

University project, using a poll data base and python make a full Probability and Statistics data research.

googlecolab latex probability python3 statistics

Last synced: 08 May 2026

https://github.com/eric15342335/stat2602

STAT2602 Probability and statistics II [Section 1A, 2024]

statistics

Last synced: 02 Mar 2026

https://github.com/oncoray/power.transform

Repository for the power.transform R package

data-preprocessing statistics

Last synced: 02 Mar 2026

https://github.com/alexander-ignition/stepic-statistics

Fundamentals of Statistics

python python3 statistics stepic

Last synced: 08 May 2026

https://github.com/qbarthelemy/stats-simple-cpp

Library for statistics in simple C++, for different sequence containers of different numeric data types.

c-plus-plus cplusplus machine-learning scientific-computing statistics

Last synced: 28 Mar 2025

https://github.com/bzubs/mlzero

Implementation of widely used ML algorithms in vanilla python

machine-learning machine-learning-algorithms machinelearning numpy python statistics

Last synced: 03 Mar 2026

https://github.com/justdvnsh/algorithms

A list of all major algorithms, their tests and explanations ranging all the way from basic to advanced.

advanced algorithms basic data-structures intermediate javascript machine-learning-algorithms statistics

Last synced: 12 Jul 2025

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 27 Mar 2026

https://github.com/intranda/goobi-plugin-statistics-sudan-memory

This Statistics plugin for Goobi workflow determines the activity of edits to translations within specific metadata fields.

digitisation goobi goobi-workflow plugin statistics

Last synced: 14 Mar 2026

https://github.com/thautwarm/bioinfoplus

A tool framework for bioinformatics written in multiple scientific languages.

bioinformatics scientific-computing statistics toolchain

Last synced: 08 May 2026

https://github.com/queelius/compositional.mle

Composable MLE solvers: a DSL for maximum likelihood estimation where solvers are first-class functions that combine via chaining, racing, and restarts

composable dsl estimation maximum-likelihood mle mle-estimation numerical-methods optimization r-package statistics

Last synced: 04 Mar 2026

https://github.com/saurabh274/aerofit-statistics-and-probability

The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.

matplotlib-pyplot numpy pandas probability python seaborn statistics

Last synced: 08 May 2026

https://github.com/stdlib-js/stats-strided-dnanmin

Calculate the minimum value of a double-precision floating-point strided array, ignoring NaN values.

array domain extent extremes float64 javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 08 May 2026

https://github.com/elgohr/github-action-analyzer

Analyzer for the usage of Github Actions

actions analyzer github statistics usage

Last synced: 09 May 2026

https://github.com/banghasan/telegram-statistic-bot

Telegram Bot Statistic

bot statistics telegram

Last synced: 16 Apr 2026

https://github.com/nikhilfuke1/hypothesis-testing-analysis-python-statistics

This project explores hypothesis testing techniques in Python, focusing on analyzing real-world data to draw meaningful conclusions. project also emphasizes presenting findings effectively through data storytelling and impactful visual presentations.

hypothesis-testing pandas python statistics

Last synced: 09 May 2026