An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/secary/stats7022

Data Science PG

r rstudio statistics

Last synced: 08 Oct 2025

https://github.com/shenxianpeng/gitstats-action

GitHub Action that generates insightful visual reports from Git repositories using GitStats

composite-action git git-stats github-actions report statistics

Last synced: 27 May 2026

https://github.com/ndomah1/learning-probability-and-statistics

This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.

correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics

Last synced: 18 Jan 2026

https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report

This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.

dashboard data-analysis data-visualization financial-analysis powerbi statistics

Last synced: 21 Jan 2026

https://github.com/stdlib-js/stats-strided-dnanmin

Calculate the minimum value of a double-precision floating-point strided array, ignoring NaN values.

array domain extent extremes float64 javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 08 May 2026

https://github.com/exp98/statistical-information-processing-with-r

Практические задания по дисциплине "Методы статистической обработки информации" (1 курс магистратуры, Матмех, СПбГУ) + "Data Science: инструментарий и жизненный цикл проекта" (2 курс)

r statistics

Last synced: 09 Oct 2025

https://github.com/milad-rasouli/toker

Toker is a lightweight app that clones GitHub projects and analyzes their codebase with Tokei, offering insightful statistics and details about project structure.

code-analysis code-metrics statistics tokei

Last synced: 14 Jan 2026

https://github.com/bhavnanahar/breast.cancer.detection

This project aims to build a machine learning model to predict breast cancer using a dataset containing various medical features

colab-notebook machine-learning python statistics

Last synced: 10 Oct 2025

https://github.com/j-sephb-lt-n/joes_giant_toolbox

A large collection of general python functions and classes that I use in my daily work

ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping

Last synced: 10 Oct 2025

https://github.com/stdlib-js/stats-strided-sstdevch

Calculate the standard deviation of a single-precision floating-point strided array using a one-pass trial mean algorithm.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 16 May 2026

https://github.com/kasworld/genenum

generate enumeration and statistics

enum enumeration golang metaprogramming statistics vector

Last synced: 09 May 2026

https://github.com/klima7/statistics-in-data-analysis-exercises

Exercises for Statistic in data analysis subject - NN, KNN, NM.

knn nm nn statistics

Last synced: 11 Oct 2025

https://github.com/montanaz0r/testing-if-mma-math-deduction-works-using-ufc-fighters-data

The probabilistic reasoning about phenomenon called MMA math using UFC fighters data and Python.

bayesian-inference data-analysis data-science graphviz jupyter-notebook pandas python scipy statistics

Last synced: 14 Apr 2026

https://github.com/paragpvyas/projects

good, sometimes ugly, functional code. First program to mine frequent patterns, a smart pill organizer python app, finally testing global randomness presented by RNGs

embedded-systems frequent-pattern-mining guizero infrared-sensors java json mime mqtt-protocol normalization-score paho-mqtt python random-number-generators raspberry-pi sms-api state-machine statistics

Last synced: 11 Apr 2026

https://github.com/leflon/pile-ou-face

Simulateur de pile ou face.

css html javascript probability statistics website

Last synced: 09 May 2026

https://github.com/m-clark/connections

connections among various statistical methods

graph statistical-methods statistical-models statistics

Last synced: 23 Mar 2025

https://github.com/best-doctor/we_are_venom

Checks which modules developer contributed using git history.

developer-tools git onboarding statistics

Last synced: 12 Oct 2025

https://github.com/sasai-lab/statplay-opensource

統計を視覚的に理解できるツール。Interactive statistics visualizer ‐ learn by doing. Vanilla JS, Canvas 2D, zero dependencies

bayesian bilingual canvas data-visualization educational interactive-visualization javascript oer open-educational-resources probability pwa regression statistics vanilla-js

Last synced: 30 May 2026

https://github.com/casaper/swiss_wastewater_covid_virus_load

R Notebook with Swiss covid waste water analysis cuves for different cities

covid-19 public-health statistics switzerland wastewater-surveillance

Last synced: 12 Oct 2025

https://github.com/rrohitramsen/transaction-statistics

Transaction-Statistics + LRU Cache + Java 8 + Spring Boot + Microservice

java-8 junit-test lru-cache microservice restful-api springboot2 statistics

Last synced: 04 Apr 2026

https://github.com/coder5omkar/logistic-regression-customer-churn-prediction

This project uses Logistic Regression to predict customer churn in the telecom industry. To run, clone the repository, install dependencies, and run the Jupyter notebook for full analysis and predictions.

logistic-regression ml pandas scikit-learn seaborn statistics

Last synced: 20 Apr 2026

https://github.com/mikma03/r_lang_progr

Subject during MA studies at SGH. Example of using basic libraries with practical use.

distribution programing r statistics

Last synced: 26 Feb 2025

https://github.com/stdlib-js/stats-strided-dsnanmean

Calculate the arithmetic mean of a single-precision floating-point strided array, ignoring NaN values, using extended accumulation, and returning an extended precision result.

arithmetic-mean array average avg central-tendency float32 javascript math mathematics mean node node-js nodejs single statistics stats stdlib strided strided-array typed

Last synced: 09 May 2026

https://github.com/bmaitner/statistical_ecology_course

Content for a Statistical Ecology course

course ecology open-educational-resources statistics

Last synced: 13 Mar 2026

https://github.com/stolsky/popular-baby-names

Interactive app listing popular baby names by year

baby-names birth-rates d3js destatis-data statistics

Last synced: 14 Oct 2025

https://github.com/eva-kaushik/probnetx

ProbNetX is a research-grade Bayesian network framework that learns conditional dependencies from discrete datasets and performs exact inference for predicting missing values.

algorithms-and-data-structures machne-learning naive-bayes-classifier statistics

Last synced: 14 Oct 2025

https://github.com/shaheennabi/maths-for-data-science-explained

📚 Maths for Data Science Explained ✨🔢 A dedicated space where I explore and explain the mathematics behind data science, machine learning, deep learning, and algorithms. 🚀💡 Each topic comes with a detailed explanation, covering key concepts, step-by-step derivations, and practical insights. 🧠⚡ This repo serves as my personal learning journey.

linear-algebra maths-behind-reinforcement-learning maths-for-computer-vision maths-for-deep-learning maths-for-machine-learning maths-for-nlp neural-networks probability statistics

Last synced: 18 Jan 2026

https://github.com/egjfour/dsti-course-notes

Notes for classes taken at DSTI stored in an Obsidian vault and backed up to Github. Includes notes for all courses taken during my Master's program

aws calculus cloud graph law linear-algebra mlops neo4j optimization-algorithms owl-ontology project-management rdf software-engineering sql statistics

Last synced: 19 Apr 2026

https://github.com/coatless-textbooks/timeseriesisgreat

Notes from my odyssey in Time Series

bookdown notes r statistics time-series

Last synced: 15 Oct 2025

https://github.com/lvmalware/lsm-module

A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)

data-analysis least-square-regression regression regression-analysis statistics

Last synced: 31 Mar 2025

https://github.com/florasteve/ml-foundations-day2

Day-2 ML foundations: probability/stats refresh and NumPy logistic regression; notebooks with visuals.

data-science jupyter-notebook logistic-regression machine-learning matplotlib numpy statistics

Last synced: 10 May 2026

https://github.com/itsn1x/8047_rainin_0xfff

Air Atomized Di-Hydrogen Peroxide on the Blockchain | N1X ( @ i t s N 1 X )

autocomplete automation blockchain day8047 finance fintech itsn1x n1x nikhil-pandita scala scalability scalajs stablecoin statistics

Last synced: 23 Jun 2026

https://github.com/mohdrasmil7/ml-notebooks

This repository contains Jupyter notebooks demonstrating machine learning exercises using both supervised classification and unsupervised algorithms. Each notebook offers a hands-on approach to understanding and applying ML techniques to real-world datasets, providing valuable insights and practical skills for data analysis and predictive modeling.

machine-learning-algorithms neural-networks statistics supervised-learning unsupervised-learning

Last synced: 15 Oct 2025

https://github.com/stdlib-js/blas-ext-linspace

Return a new ndarray filled with linearly spaced values over a specified interval along one or more ndarray dimensions.

arange arrange javascript linear linspace math mathematics matlab ndarray node node-js nodejs numpy seq sequence statistics stats stdlib

Last synced: 04 May 2026

https://github.com/anras5/criteo-search-data

EDA and statistical tests on CriteoSearchData dataset

data-science pandas scikit-learn statistics

Last synced: 11 May 2026

https://github.com/gastonstat/rcompendium

Comprehensive collection of slides about R

data-science introduction-to-r r-language r-programming slides statistics

Last synced: 24 Feb 2026

https://github.com/uscbiostats/hpc-with-r

Workshop: Introduction to R (for HPC users)

datascience hpc parallel-computing rstats statistics workshop

Last synced: 03 Mar 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/alan-y/blogdown-website

This is my personal website and blog built using the blogdown R package and deployed with Netlify.

r statistics

Last synced: 27 May 2026

https://github.com/andrii-zapukhlyi/otomoto_visualization

Scraping, data visualization, and building a price prediction model with data from the car classifieds website otomoto.pl

data-analysis machine-learning r scraping statistics visualization

Last synced: 26 Jul 2025

https://github.com/sagnac/fresdet

Simple analysis tool for estimating the original resolution of standard upscaled images using ffts and basic statistical metrics.

fft image-processing julia statistics

Last synced: 22 Oct 2025

https://github.com/ptfonseca/inspector

inspector: Validation of arguments and objects in user-defined functions

input-validation r r-package statistics validation validations

Last synced: 21 Feb 2026

https://github.com/callmemaverick/deyields_mro

Interactive Streamlit app analyzing German 10-Year Bond Yields & ECB’s MRO Rate with visualizations, regression analysis, and correlation insights.

data-science python python3 statistics

Last synced: 16 May 2025

https://github.com/mncube/mxsrquick

Streamline workflows for Bayesian mixing model and MixSIAR projects

bayesian mixsiar r statistics

Last synced: 27 May 2026

https://github.com/murilobsd/capybara

:neckbeard: Capybara: C DataFrames for geeks who like numbers. :alien:

analytics bigdata c-plus-plus dataframe geeksforgeeks pandas statistics

Last synced: 16 Apr 2026

https://github.com/aleksandrhovhannisyan/statisticalinferencesinr

Custom package for performing statistical inferences in the R programming language. Written for STA3032 Engineering Statistics to make my life easier.

r statistics

Last synced: 18 Apr 2025

https://github.com/stevensolleder/passengersonlinetransportationinselectedgermanstates

This is a website to visualize the number of passengers in regular transport in selected states of Germany over the last years.

chart chartjs corona css germany html javascript public-transport statistics

Last synced: 23 Oct 2025

https://github.com/andrewrporter/text-stats

Displays various text statistics for the currently opened document in Visual Studio Code

extension nodejs statistics typescript visual-studio-code vscode vscode-extension

Last synced: 12 Apr 2026

https://github.com/intranda/goobi-plugin-statistics-sudan-memory

This Statistics plugin for Goobi workflow determines the activity of edits to translations within specific metadata fields.

digitisation goobi goobi-workflow plugin statistics

Last synced: 14 Mar 2026

https://github.com/analyticbastard/statistical-independence-financial-forecasting

Statistical independence for (impossible) financial forecasting

cryptocurrency finance machine-learning statistics trading

Last synced: 27 May 2026

https://github.com/mkstratos/detectable_climate

Design and test improvements to MVK from evv4esm

climate climate-model-evaluation climate-modelling statistics

Last synced: 25 Oct 2025

https://github.com/imnotannamaria/ia_statistics_for_devs

Repository focused on learning statics to deal with AI with pandas.

ia pandas python statistics

Last synced: 11 Apr 2026

https://github.com/renatomaynard/time-series-forecasting-methods

Time Series Forecasting Methods — A collection of Python implementations for essential time series forecasting techniques, including Simple, Double, Triple Exponential Smoothing, and Moving Averages.

data-science demand-forecasting exponential-smoothing forecasting holt-winters holt-winters-forecasting moving-average python python-for-everybody seasonality statistics time-series time-series-analysis timeseries-forecasting trend-analysis

Last synced: 17 May 2026

https://github.com/sofiia-chorna/estimation-et-identification-statistique-project

Estimation et identification statistique - Course Project (MA201)

matlab statistics

Last synced: 23 Jun 2026

https://github.com/asuquoaa/bar_chart_visualization_with_confidence_intervals_and_interactive_slider

This project visualizes probabilistic data using bar charts with 95% confidence intervals, allowing users to explore deviations from a Value of Interest (V of I) interactively.

data-visualization interactive-visualizations statistics

Last synced: 01 Sep 2025

https://github.com/xnacly/statlib

go library for commonly used statistical computations, optimized for performance, zero allocations

go math statistics zero-allocation

Last synced: 27 May 2026

https://github.com/peterbenoit/maths.js

Maths.js is a lightweight JavaScript library that extends the basic mathematical capabilities of JavaScript

javascript lightweight math math-library statistics utility-function

Last synced: 12 May 2026

https://github.com/stdlib-js/stats-strided-smeanli

Calculate the arithmetic mean of a single-precision floating-point strided array using a one-pass trial mean algorithm.

arithmetic-mean array average avg central-tendency javascript math mathematics mean node node-js nodejs one-pass statistics stats stdlib strided strided-array typed welford

Last synced: 28 Apr 2026

https://github.com/giladturok/drghmc-old

Delayed Rejection Generalized HMC sampler

hamiltonian-monte-carlo markov-chain-monte-carlo statistics

Last synced: 01 Sep 2025

https://github.com/yandexdataschool/ml-sweights-experiments

Experiments for the "Machine Learning on data with sPlot background subtraction" paper

data-analysis high-energy-physics machine-learning statistics

Last synced: 15 May 2025

https://github.com/silvio2402/whatsappstats

A simple python script to generate statistics from exported WhatsApp chats.

matplotlib python statistics whatsapp

Last synced: 11 Apr 2026

https://github.com/matbesancon/distributionsexamples.jl

Examples of analyses run using Distributions.jl

julialang probability-distribution statistics tutorial

Last synced: 04 Jul 2026

https://github.com/m-jahn/europe-by-numbers

The content of my blog, https://europebynumbers.wordpress.com/.

blog computational-biology europe r-markdown statistics

Last synced: 06 Feb 2026

https://github.com/vincentlaucsb/statistical-models

A collection of notes detailing statistical models, including both their theoretical aspects and applications (in R).

logistic-regression statistics

Last synced: 24 Jan 2026

https://github.com/brunos3d/monty-hall-game-simulation

A web-based implementation of the famous Monty Hall problem, a probability puzzle based on a game show scenario.

advent-of-code algorithms demo game playable statistics study

Last synced: 28 Mar 2025

https://github.com/jfaccioli/school-performance-pandas

A data analysis to showcase trends in school performance.

jupyter-notebook pandas python statistics

Last synced: 04 May 2026

https://github.com/ilhanyumer/simplestatistics

Generate simple statistics from a given set of numbers.

java statistics

Last synced: 23 Mar 2025

https://github.com/loryshamadache/tennis-betting-analysis

A program for statistical analysis of tennis matches aimed at informing betting decisions. Includes data exploration, processing, and predictive modeling tools.

betting betting-bot statistics

Last synced: 31 Aug 2025