An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/mikma03/r_lang_progr

Subject during MA studies at SGH. Example of using basic libraries with practical use.

distribution programing r statistics

Last synced: 26 Feb 2025

https://github.com/shaheennabi/maths-for-data-science-explained

📚 Maths for Data Science Explained ✨🔢 A dedicated space where I explore and explain the mathematics behind data science, machine learning, deep learning, and algorithms. 🚀💡 Each topic comes with a detailed explanation, covering key concepts, step-by-step derivations, and practical insights. 🧠⚡ This repo serves as my personal learning journey.

linear-algebra maths-behind-reinforcement-learning maths-for-computer-vision maths-for-deep-learning maths-for-machine-learning maths-for-nlp neural-networks probability statistics

Last synced: 18 Jan 2026

https://github.com/airdipu/household-survey-analysis

This project is a Baseline study on Household Survey of healthcare services for pregnant women and the newborn including family planning services using SPSS.

spss statistics survey-analysis

Last synced: 30 Jan 2026

https://github.com/lvmalware/lsm-module

A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)

data-analysis least-square-regression regression regression-analysis statistics

Last synced: 31 Mar 2025

https://github.com/gastonstat/stat2

Introduction to Statistics

introduction-to-statistics statistics syllabus

Last synced: 25 Feb 2026

https://github.com/stdlib-js/stats-strided-dmskmax

Calculate the maximum value of a double-precision floating-point strided array according to a mask.

domain extent extremes javascript mask masked masked-array math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array

Last synced: 17 May 2026

https://github.com/lazernata/deep-learning-course-exercises

Exercises from ADR Formación "Deep Learning. Redes Neuronales. Tensorflow.Python" Course

colab-notebook data-science deep-learning neural-network python statistics

Last synced: 01 Feb 2026

https://github.com/aaishikasb/surfs-up-hacks

Team megaBite's Official Submission for Surfs Up Hacks by MLH.

beach statistics weather

Last synced: 01 Feb 2026

https://github.com/billimarie/reentry

A repository of various articles, abstracts, essays, and studies which analyze successful reentry strategies and report methods of avoiding recidivism.

men-of-color prison prisoners prisoners-rights recidivism reentry research statistics women women-of-color

Last synced: 02 Feb 2026

https://github.com/stdlib-js/blas-ext-linspace

Return a new ndarray filled with linearly spaced values over a specified interval along one or more ndarray dimensions.

arange arrange javascript linear linspace math mathematics matlab ndarray node node-js nodejs numpy seq sequence statistics stats stdlib

Last synced: 04 May 2026

https://github.com/sroman0/data-analytics

Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.

data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics

Last synced: 15 Apr 2026

https://github.com/chmue/vam-bayes-intro

A brief introduction to Bayesian Statistics for Very Applied Methods

bayes introduction statistics

Last synced: 04 Jan 2026

https://github.com/uscbiostats/hpc-with-r

Workshop: Introduction to R (for HPC users)

datascience hpc parallel-computing rstats statistics workshop

Last synced: 03 Mar 2026

https://github.com/lazerlambda/udl-negation

Comparing Data-Driven Techniques for Enhancing Negation Sensitivity in MLM-Based Laguage-Models

bert computational-linguistics deep-learning deeplearning dl encoder machine-learning masked-language-modeling ml negation nlp nlu python research statistics torch transformers

Last synced: 15 Apr 2026

https://github.com/haroontrailblazer/machine_learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics

Last synced: 16 Apr 2026

https://github.com/danpoynor/python-number-guessing-game-with-stats

A number guessing game written in Python 3 that presents median, mode, and mean statistics

console-game data-analysis number-guessing-game python3 statistics

Last synced: 26 May 2026

https://github.com/beliavsky/regression_spaeth

Fortran 90 library of John Burkardt for regression using least-squares and other criteria, based on Spaeth's codes

linear-regression multiple-linear-regression regression robust-regression statistics

Last synced: 10 Feb 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/venkat-a/olympiananalytics

OlympianAnalytics uncovers Olympic trends, focusing on medal distribution by region, gender, and athlete attributes. It highlights dominance shifts and the role of inclusivity in sports, offering key insights through data visualization.

data-engineering data-visualization statistics tableau

Last synced: 10 Feb 2026

https://github.com/alan-y/blogdown-website

This is my personal website and blog built using the blogdown R package and deployed with Netlify.

r statistics

Last synced: 27 May 2026

https://github.com/spikehd/worldstat

All-in-one CLI tool and Rust library for interfacing with Minecraft world information

minecraft statistics world

Last synced: 11 Feb 2026

https://github.com/andrii-zapukhlyi/otomoto_visualization

Scraping, data visualization, and building a price prediction model with data from the car classifieds website otomoto.pl

data-analysis machine-learning r scraping statistics visualization

Last synced: 26 Jul 2025

https://github.com/ptfonseca/inspector

inspector: Validation of arguments and objects in user-defined functions

input-validation r r-package statistics validation validations

Last synced: 21 Feb 2026

https://github.com/mrtkp9993/fred-downloader

Download econometric time series data from FRED.

fred-api java statistics swing time-series

Last synced: 13 Jun 2026

https://github.com/callmemaverick/deyields_mro

Interactive Streamlit app analyzing German 10-Year Bond Yields & ECB’s MRO Rate with visualizations, regression analysis, and correlation insights.

data-science python python3 statistics

Last synced: 16 May 2025

https://github.com/fn-jakubkarp/digit-span-experiment

This repository contains a replica of Miller Magic Number experiment

jasp magic-number opensesame psychology-experiments statistics

Last synced: 15 Mar 2025

https://github.com/mncube/mxsrquick

Streamline workflows for Bayesian mixing model and MixSIAR projects

bayesian mixsiar r statistics

Last synced: 27 May 2026

https://github.com/murilobsd/capybara

:neckbeard: Capybara: C DataFrames for geeks who like numbers. :alien:

analytics bigdata c-plus-plus dataframe geeksforgeeks pandas statistics

Last synced: 16 Apr 2026

https://github.com/aleksandrhovhannisyan/statisticalinferencesinr

Custom package for performing statistical inferences in the R programming language. Written for STA3032 Engineering Statistics to make my life easier.

r statistics

Last synced: 18 Apr 2025

https://github.com/pottekkat/best-stats-you-have-ever-seen

These are the best stats you've ever seen. Website offline.

data-science data-visualization graphs open-source statistics

Last synced: 28 Feb 2026

https://github.com/adijo/ph525-statistics-and-r

Exercises from the Statistics and R course on edX.

inference r statistics

Last synced: 05 May 2026

https://github.com/evgnbch/esports-steam-tools

🎮 Инструменты Steam для киберспорта и автоматизации

analysis api automation counter-strike csgo cybersport dota2 esports esports-analytics gaming gaming-tools python statistics steam steam-api tools valve

Last synced: 01 Mar 2026

https://github.com/stdlib-js/stats-strided-smeanwd

Calculate the arithmetic mean of a single-precision floating-point strided array using Welford's algorithm.

arithmetic-mean array average avg central-tendency float32 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed welford

Last synced: 05 May 2026

https://github.com/intranda/goobi-plugin-statistics-sudan-memory

This Statistics plugin for Goobi workflow determines the activity of edits to translations within specific metadata fields.

digitisation goobi goobi-workflow plugin statistics

Last synced: 14 Mar 2026

https://github.com/analyticbastard/statistical-independence-financial-forecasting

Statistical independence for (impossible) financial forecasting

cryptocurrency finance machine-learning statistics trading

Last synced: 27 May 2026

https://github.com/nabilshadman/multiprocessing-time-series-data-simulation

A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples

multiprocessing numpy pandas scipy simulation statistics time-series

Last synced: 05 May 2026

https://github.com/imnotannamaria/ia_statistics_for_devs

Repository focused on learning statics to deal with AI with pandas.

ia pandas python statistics

Last synced: 11 Apr 2026

https://github.com/renatomaynard/time-series-forecasting-methods

Time Series Forecasting Methods — A collection of Python implementations for essential time series forecasting techniques, including Simple, Double, Triple Exponential Smoothing, and Moving Averages.

data-science demand-forecasting exponential-smoothing forecasting holt-winters holt-winters-forecasting moving-average python python-for-everybody seasonality statistics time-series time-series-analysis timeseries-forecasting trend-analysis

Last synced: 17 May 2026

https://github.com/purplem0n/anicount

anime profile degen counter

anilist anime myanimelist statistics

Last synced: 05 May 2026

https://github.com/pointlander/txt

A natural language model based on context mixing

machine-learning self-attention statistics

Last synced: 03 Feb 2026

https://github.com/eric15342335/stat2602

STAT2602 Probability and statistics II [Section 1A, 2024]

statistics

Last synced: 02 Mar 2026

https://github.com/oncoray/power.transform

Repository for the power.transform R package

data-preprocessing statistics

Last synced: 02 Mar 2026

https://github.com/asuquoaa/bar_chart_visualization_with_confidence_intervals_and_interactive_slider

This project visualizes probabilistic data using bar charts with 95% confidence intervals, allowing users to explore deviations from a Value of Interest (V of I) interactively.

data-visualization interactive-visualizations statistics

Last synced: 01 Sep 2025

https://github.com/xnacly/statlib

go library for commonly used statistical computations, optimized for performance, zero allocations

go math statistics zero-allocation

Last synced: 27 May 2026

https://github.com/bzubs/mlzero

Implementation of widely used ML algorithms in vanilla python

machine-learning machine-learning-algorithms machinelearning numpy python statistics

Last synced: 03 Mar 2026

https://github.com/the-tave/stats-picker

The Stats Picker (Statistics Picker) is now openly available on GitHub!

learning psychology r shinyapps simulation statistics teaching

Last synced: 21 Feb 2026

https://github.com/stdlib-js/stats-strided-smeanli

Calculate the arithmetic mean of a single-precision floating-point strided array using a one-pass trial mean algorithm.

arithmetic-mean array average avg central-tendency javascript math mathematics mean node node-js nodejs one-pass statistics stats stdlib strided strided-array typed welford

Last synced: 28 Apr 2026

https://github.com/giladturok/drghmc-old

Delayed Rejection Generalized HMC sampler

hamiltonian-monte-carlo markov-chain-monte-carlo statistics

Last synced: 01 Sep 2025

https://github.com/linggarm/statistics

My personal repository where I can keep files associated with my learning of Statistics

correlation-coefficient covariance pearson-correlation spearman-rank-correlation standard-deviation statistics variance

Last synced: 17 Jun 2026

https://github.com/ai-ahmed/bda-and-ds

Dual Course: EMC2 Data Scientist & Big Data Analytics & IBM Professional Data Science Course

analytics big-data data-science jupyter-notebook jupyterlab linear-algebra mathematics matplotlib ml numpy pandas probability seaborn sklearn statistics

Last synced: 07 May 2026

https://github.com/joeloparco/laptop-analysis

Final Project for COSC 3570 intro to Data Science. Project aimed to find a relationship between laptop price and other laptop characteristics using linear regression.

juypter-notebook latex python statistical-analysis statistics

Last synced: 07 May 2026

https://github.com/mattsebastianh/AB-Testing-at-Nosh-Mish-Mosh-Project

Analyze Data with Python | Hypothesis Testing with Scipy | Sample Size Determination

ab-testing lift sample-size-determination statistics

Last synced: 18 Jun 2026

https://github.com/yandexdataschool/ml-sweights-experiments

Experiments for the "Machine Learning on data with sPlot background subtraction" paper

data-analysis high-energy-physics machine-learning statistics

Last synced: 15 May 2025

https://github.com/silvio2402/whatsappstats

A simple python script to generate statistics from exported WhatsApp chats.

matplotlib python statistics whatsapp

Last synced: 11 Apr 2026

https://github.com/brunos3d/monty-hall-game-simulation

A web-based implementation of the famous Monty Hall problem, a probability puzzle based on a game show scenario.

advent-of-code algorithms demo game playable statistics study

Last synced: 28 Mar 2025

https://github.com/jfaccioli/school-performance-pandas

A data analysis to showcase trends in school performance.

jupyter-notebook pandas python statistics

Last synced: 04 May 2026

https://github.com/komangandika/time-series

Im really interested in the world of trading, so why not create a repo dedicated to Time-Series

quant statistics time-series

Last synced: 15 Mar 2025

https://github.com/ilhanyumer/simplestatistics

Generate simple statistics from a given set of numbers.

java statistics

Last synced: 23 Mar 2025

https://github.com/loryshamadache/tennis-betting-analysis

A program for statistical analysis of tennis matches aimed at informing betting decisions. Includes data exploration, processing, and predictive modeling tools.

betting betting-bot statistics

Last synced: 31 Aug 2025

https://github.com/alexander-ignition/stepic-statistics

Fundamentals of Statistics

python python3 statistics stepic

Last synced: 08 May 2026

https://github.com/thautwarm/bioinfoplus

A tool framework for bioinformatics written in multiple scientific languages.

bioinformatics scientific-computing statistics toolchain

Last synced: 08 May 2026

https://github.com/stdlib-js/stats-strided-dstdevwd

Calculate the standard deviation of a double-precision floating-point strided array using Welford's algorithm.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array unbiased var variance welford

Last synced: 17 Apr 2026

https://github.com/fonnesbeck/uterine_fibroids_ma

Meta-analysis of uterine fibroids treatment outcomes

clinical-research meta-analysis pymc3 python statistics

Last synced: 08 May 2026

https://github.com/legolasbo/go-mqtt-system-monitor

a daemon that periodically reads PC sensors values and publishes them on an MQTT broker

homeassistant mqtt sensor statistics

Last synced: 17 Apr 2026

https://github.com/fauzancodes/statistic-calculator

calculate root mean square, variance, standard deviation, skewness, percentile covariance, pearson product-moment correlation coefficient, spearman correlation coefficient, kendall correlation coefficient, determination coefficient, slope, equation and plot of linear and polynomial regression degree 2 and 3 in various way using python library math, statistics, numpy, scipy and scikit-learn

covariance determination-coefficient equation intercept kendall-correlation-coefficient linear-regression mean median mode pearson-correlation percentile plot polynomial-regression root-mean-square skewness slope spearman-correlation-coefficient standard-deviation statistics variance

Last synced: 18 Apr 2026

https://github.com/charlenry/python_math_machine_learning

Mes notebooks de travaux pratiques sur Python, NumPy, SimPy, SciPy, Matplotlib, Plotly, Seaborn et les Mathématiques pour le Machine Learning

algebra derivatives functions jupyter linear matplotlib matrix numpy plotly probabilities pyplot python pytorch scipy seaborn sklearn statistics sympy tensorflow time

Last synced: 25 Jun 2026

https://github.com/nikhilfuke1/hypothesis-testing-analysis-python-statistics

This project explores hypothesis testing techniques in Python, focusing on analyzing real-world data to draw meaningful conclusions. project also emphasizes presenting findings effectively through data storytelling and impactful visual presentations.

hypothesis-testing pandas python statistics

Last synced: 09 May 2026

https://github.com/ram1103/python-101

Python playground , A python beginner playground to play around code and understand the different functioanlities of python

beginner-friendly iitmod ml python python-beginner python-beginners statistics

Last synced: 15 May 2025

https://github.com/shauryashaurya/learning-how-machines-learn

Practical notes and references on common machine learning algorithms. Let's Go!

artificial-intelligence jupyter-notebook machine-learning pandas python statistics

Last synced: 20 Apr 2026

https://github.com/mohamedawnallah/costpro-quarterly-sales-metrics-dashboard

This is a dashboard that displays sales metrics for the CostPro retail store.

dashboard e-commerce statistics streamlit

Last synced: 20 Apr 2026

https://github.com/abinashsahoo007/project-bankruptcy-prevention

The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.

data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit

Last synced: 20 Apr 2026

https://github.com/zhukovanan/stepik_

The completed tasks of different data or computer science related fields on stepik

data statistical-learning statistics stepik-course

Last synced: 21 Apr 2026

https://github.com/rexlmanu/climbchallenge

League of Legends Climb Challenge Tracking website built for my friend group

laravel league-of-legends riot-api shadcn statistics

Last synced: 09 May 2026