Statistics
Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.
- GitHub: https://github.com/topics/statistics
- Wikipedia: https://en.wikipedia.org/wiki/Statistics
- Related Topics: data-science, machine-learning, deep-learning, neural-network,
- Last updated: 2026-06-29 00:25:57 UTC
- JSON Representation
https://github.com/orvn/some-visualizations
Just some visualizations of concepts and data
d3js data-visualization math statistics
Last synced: 24 Jun 2026
https://github.com/vinodbaste/hr-analytics-employee-attrition-and-performance-prediction
In this project, we enlisted the numerical and categorical attributes present in the publicly available dataset. Missing values were dropped to give better insights in data analysis. ANOVA and Chi-Square tests were carried out during statistical analysis. Machine Learning algo's were applied to understand, manage, and mitigate employee attrition.
data-science dataanalytics datavisualization machine-learning statistics
Last synced: 24 Mar 2025
https://github.com/oelin/parametric-complexity
Estimating the parametric complexity (minimum description length) of binary classifiers.
bias-variance-tradeoff machine-learning minimum-description-length model-selection statistics
Last synced: 29 Apr 2026
https://github.com/lempiji/lantern
d-programming-language datatable dlang statistics
Last synced: 24 Jan 2026
https://github.com/shenxianpeng/gitstats-action
GitHub Action that generates insightful visual reports from Git repositories using GitStats
composite-action git git-stats github-actions report statistics
Last synced: 27 May 2026
https://github.com/sukuasoft/stat-js
It is a lightweight and easy-to-use library for Node.js that offers basic functions essential for simple statistical analysis.
Last synced: 03 May 2026
https://github.com/mauriciogtec/statsmodelling2
See the README for a link to the solutions
bayesian-inference statistics student-project
Last synced: 24 Feb 2026
https://github.com/stdlib-js/stats-strided-sstdevch
Calculate the standard deviation of a single-precision floating-point strided array using a one-pass trial mean algorithm.
deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance
Last synced: 16 May 2026
https://github.com/rodrigojunqueiradev/rodrigojunqueiradev.github.io
Professional Portfolio - Rodrigo Junqueira
analytics artificial-intelligence data-analysis data-engineering data-science data-visualization machine-learning mathematics nosql powerbi python r sql statistics
Last synced: 15 May 2026
https://github.com/pblischak/zig-ndarray
N-Dimensional Arrays in Zig
data-science ndarray statistics ziglang
Last synced: 06 Feb 2026
https://github.com/subh888999/car-prices--analysis-projects
This repository houses projects focused on data collection, assessment, cleaning, visualization, and analysis. It includes workflows and methodologies for handling data, from initial gathering and evaluation to processing, visualizing insights, and performing in-depth analysis
jupyter-notebook matplotlib numpy panda seaborn statistics
Last synced: 03 May 2026
https://github.com/richardwarepam16/learning-statistics-using-python-for-data-analysis
Data Analyst: Learning Statistics using Python (Descriptive Stats, Inferential Stats, ML)
data-analysis data-analysis-python data-science descriptive-statistics inferential-statistics statistical-learning statistics
Last synced: 15 Mar 2025
https://github.com/rustkas/statistics-with-rust
"Statistics with Rust" is your comprehensive resource to unlock Rust's true potential in modern statistical methods.
Last synced: 21 Mar 2025
https://github.com/quentin18/pga-tour
PGA Tour data analysis from 2010 to 2020
classification dataanalysis golf rlang rmd scraping statistics
Last synced: 29 Jan 2026
https://github.com/anikov/filestats
Program that update folder statistic in excel book
Last synced: 05 Mar 2026
https://github.com/m-clark/connections
connections among various statistical methods
graph statistical-methods statistical-models statistics
Last synced: 23 Mar 2025
https://github.com/aliciagilmatute/simulacion-estadistica
en construcción...
data-analysis data-science distribution-simulation distributions r rstats rstatses rstudio simulation simulation-studies statistics statistics-simulation
Last synced: 24 Mar 2025
https://github.com/nfaltir/law-of-averages
:books: A simple script that explains the relationship between the results of a coin tosses and the Law of Averages
Last synced: 31 Mar 2025
https://github.com/digital-wellbeing/paradigm-comments
Commentary on proposed new paradigm(s) in social media effects research
psychology social-media statistics well-being
Last synced: 30 Jan 2026
https://github.com/mksingh431/r-programming-language
Free R programming notes pdf are provided here for R programming students so that they can prepare and score high marks in their R programming exam
note notes r static static-site-generator statistics
Last synced: 01 Apr 2025
https://github.com/cxw42/games-dice-tester
Statistical tests for random dice-rolling programs
dice dice-rolling-statistics dice-rolls perl random-number random-number-distributions random-numbers statistics
Last synced: 09 Jun 2026
https://github.com/nakshjainsonigara/football-eda
This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.
dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics
Last synced: 07 Jan 2026
https://github.com/gastonstat/stat2
Introduction to Statistics
introduction-to-statistics statistics syllabus
Last synced: 25 Feb 2026
https://github.com/shaheennabi/maths-for-data-science-explained
📚 Maths for Data Science Explained ✨🔢 A dedicated space where I explore and explain the mathematics behind data science, machine learning, deep learning, and algorithms. 🚀💡 Each topic comes with a detailed explanation, covering key concepts, step-by-step derivations, and practical insights. 🧠⚡ This repo serves as my personal learning journey.
linear-algebra maths-behind-reinforcement-learning maths-for-computer-vision maths-for-deep-learning maths-for-machine-learning maths-for-nlp neural-networks probability statistics
Last synced: 18 Jan 2026
https://github.com/jakubfr4czek/apartment-prices-analysis
This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.
agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning
Last synced: 04 May 2026
https://github.com/lunatic-fox/github-interface-app-plus
GitHub Interface App + GitHub Colors
api colors github languages percentage react statistics
Last synced: 15 Apr 2026
https://github.com/denisecase/datafun-02-functions
Practice with builtin functions, creating functions, creating methods (functions in a class), and employing statistical functions
beginner functions methods python statistics
Last synced: 30 Oct 2025
https://github.com/pawal/tldmonitor-ui-go
Web UI for TLDMonitor
analysis data dns go golang mongodb statistics webapp website
Last synced: 16 Jan 2026
https://github.com/shlokarora2709/mentorness
Repo for mentorness internship tasks
anova-test arima chi2-contingency data-cleaning data-engineering data-science django machine-learning pandas python sklearn statistics time-series xgboost
Last synced: 12 Apr 2026
https://github.com/sshaplygin/abcs
Adaptive bandit cache selection
2q-cache arc-cache golang lfu-cache lfuda-cache lru-cache mab statistics
Last synced: 15 Mar 2025
https://github.com/abideen-olawuwo/georgiatimeseries
A times series forcasting
arima itertools linear-regression matplotlib numpy pandas python seaborn statistics statsmodels time-series
Last synced: 11 Apr 2026
https://github.com/gemmaro/msgstat
gettext PO file statistics (to learn C gettext PO library)
Last synced: 18 Apr 2026
https://github.com/gbrsales/kll
Streaming Quantile Approximation for R
r statistics stream-processing
Last synced: 31 Mar 2025
https://github.com/sroman0/data-analytics
Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.
data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics
Last synced: 15 Apr 2026
https://github.com/fabnavarro/makedensity2d
2-dimensional Normal Mixture Models
bivariate-analysis matlab statistics
Last synced: 14 Jun 2026
https://github.com/polymathorg/project-proposals
Project proposals and idea list for PolyMath community
data-science ideas math mathematics numerical-methods pharo pharo-smalltalk project-proposal smalltalk statistics
Last synced: 15 Mar 2025
https://github.com/josericodata/statisticsapp
Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.
alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test
Last synced: 26 Feb 2026
https://github.com/rahulbhadani/statistical-sauce
A curated list of definitions and concepts from statistics
Last synced: 09 Feb 2026
https://github.com/armahdavi/data_pipeline_analytics_statistics_ml_pm_psd_residential_qff
Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782
data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics
Last synced: 11 Apr 2026
https://github.com/uscbiostats/hpc-with-r
Workshop: Introduction to R (for HPC users)
datascience hpc parallel-computing rstats statistics workshop
Last synced: 03 Mar 2026
https://github.com/linggarm/statistics
My personal repository where I can keep files associated with my learning of Statistics
correlation-coefficient covariance pearson-correlation spearman-rank-correlation standard-deviation statistics variance
Last synced: 17 Jun 2026
https://github.com/beliavsky/regression_spaeth
Fortran 90 library of John Burkardt for regression using least-squares and other criteria, based on Spaeth's codes
linear-regression multiple-linear-regression regression robust-regression statistics
Last synced: 10 Feb 2026
https://github.com/stdlib-js/stats-strided-smin
Calculate the minimum value of a single-precision floating-point strided array.
array domain extent extremes float32 javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib strided strided-array typed
Last synced: 05 May 2026
https://github.com/stdlib-js/stats-strided-drange
Calculate the range of a double-precision floating-point strided array.
dispersion domain extent extremes javascript math mathematics max maximum min minimum node node-js nodejs range statistics stats stdlib strided strided-array
Last synced: 05 May 2026
https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle
Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9
classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml
Last synced: 11 Apr 2026
https://github.com/alan-y/blogdown-website
This is my personal website and blog built using the blogdown R package and deployed with Netlify.
Last synced: 27 May 2026
https://github.com/spikehd/worldstat
All-in-one CLI tool and Rust library for interfacing with Minecraft world information
Last synced: 11 Feb 2026
https://github.com/rodrigojunqueiradev/python-exercises
Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language
data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics
Last synced: 16 Apr 2026
https://github.com/stdlib-js/stats-strided-dstdevyc
Calculate the standard deviation of a double-precision floating-point strided array using a one-pass algorithm proposed by Youngs and Cramer.
deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance
Last synced: 05 May 2026
https://github.com/toro-nicolas/toro-nicolas
The readme of my profile.
curriculum-vitae profil readme statistics
Last synced: 12 Feb 2026
https://github.com/beliavsky/r_and_fortran
Examples of simple R and Fortran programs that calculate descriptive statistics and of equivalent R and Fortran syntax
fortran fortran-tutorial language-comparison r r-tutorial simulation statistics
Last synced: 25 Mar 2025
https://github.com/nabilshadman/multiprocessing-time-series-data-simulation
A simulation using multiprocessing to generate 10,000 time dependent samples from an initial dataset of 20 samples
multiprocessing numpy pandas scipy simulation statistics time-series
Last synced: 05 May 2026
https://github.com/marcpinet/mtsad-benchmarks-are-mostly-univariate
🔍 Evidence that current multivariate time series anomaly detection benchmarks don't actually test cross-channel modeling.
anomaly-detection benchmarks data-science datasets machine-learning statistics synthetic-data
Last synced: 28 Jun 2026
https://github.com/rici4kubicek/statistics
Implementation of simple statistics functionality for embedded systems.
embedded embedded-systems math max min statistics
Last synced: 27 Feb 2026
https://github.com/diegopinate/docker-stats-viewer
Simple docker stats viewer made with copilot
docker html javascript npm statistics
Last synced: 12 Apr 2026
https://github.com/m-dadej/pd_estimation
Estimation of probability of default on novel data from Orbis
data-science predictive-modeling statistics
Last synced: 02 Jul 2025
https://github.com/ahmedmmahrous/sales-data-analysis-e-commerce-case-study-
Sales Data Analysis E-Commerce Case Study
deployment e ecommerce pandas plotly python retail seaborn statistics streamlit
Last synced: 16 Apr 2026
https://github.com/aliramazanyildirim/aliramazanyildirim
My GitHub contribution graph is not statistics this time, but a space battle. As I code, the enemies grow stronger, and the game updates itself every day 🎮✨
contributions github graph space-battles statistics
Last synced: 13 Feb 2026
https://github.com/artginzburg/npmstalk
JS module · Total downloads of an NPM maintainer
badge github npm npm-package statistics
Last synced: 14 Feb 2026
https://github.com/a-herzog/multimodalfit
MultimodalFit is a Python package for fitting a combination of multiple distributions to one measurement series.
distribution-fitting jupyter-notebook python statistics
Last synced: 28 Feb 2026
https://github.com/cybcon/docker.dockerhubstats2mqtt
Collect repository statistics from Dockerhub and publish them to a MQTT topic.
docker-image dockerhub mosquitto-pub statistics
Last synced: 14 Apr 2026
https://github.com/stdlib-js/stats-array-variancepn
Calculate the variance of an array using a two-pass algorithm.
array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib unbiased var variance
Last synced: 18 May 2026
https://github.com/dideler/course-level-3-certificate-data-science
Level 3 Certificate in Data - Spring 2025 - City of London (Guildhall)
Last synced: 24 Mar 2025
https://github.com/dav-idka-j/stash-plugins
A collection of plugins for stash
javascript stash stash-plugin stashapp statistics
Last synced: 28 Feb 2026
https://github.com/ptfonseca/inspector
inspector: Validation of arguments and objects in user-defined functions
input-validation r r-package statistics validation validations
Last synced: 21 Feb 2026
https://github.com/democritus-project/d8s-stats
Democritus functions for working with statistics.
democritus python statistics statistics-utility utility
Last synced: 01 Apr 2025
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/mncube/mxsrquick
Streamline workflows for Bayesian mixing model and MixSIAR projects
Last synced: 27 May 2026
https://github.com/aleksandrhovhannisyan/statisticalinferencesinr
Custom package for performing statistical inferences in the R programming language. Written for STA3032 Engineering Statistics to make my life easier.
Last synced: 18 Apr 2025
https://github.com/stefanocoretta/qdal
bayesian-statistics brms r-stats statistics statistics-course tidyverse
Last synced: 20 Jan 2026
https://github.com/mattsebastianh/AB-Testing-at-Nosh-Mish-Mosh-Project
Analyze Data with Python | Hypothesis Testing with Scipy | Sample Size Determination
ab-testing lift sample-size-determination statistics
Last synced: 18 Jun 2026
https://github.com/alexp11223/d3flightsdatavisualization
Some visualizations for Kaggle 2015 Flight Delays and Cancellations dataset using d3.js and Leaflet
d3 d3js dataset flight-map leaflet sql statistics visualization
Last synced: 31 Mar 2025
https://github.com/joeribreedveld/strong-stats
Strong workout insights, private, free and simple.
free gym nextjs originui recharts shadcn-ui statistics stats strong tailwindcss visualize workout
Last synced: 07 May 2026
https://github.com/kellsworks/data-science-python-scripts
Data science python scripts
beautifulsoup4 numpy pandas python python-csv selenium selenium-webdriver statistics
Last synced: 12 Apr 2026
https://github.com/sodascience/workshop_efficient_microdata
Workshop for efficiently doing projects with CBS microdata.
computing microdata project-organisation statistics
Last synced: 19 Mar 2026
https://github.com/joeloparco/laptop-analysis
Final Project for COSC 3570 intro to Data Science. Project aimed to find a relationship between laptop price and other laptop characteristics using linear regression.
juypter-notebook latex python statistical-analysis statistics
Last synced: 07 May 2026
https://github.com/zietzm/webgwas-analysis
Figures and analysis for the WebGWAS project paper
Last synced: 07 May 2026
https://github.com/juanfranciscocis/probability-and-statistics
University project, using a poll data base and python make a full Probability and Statistics data research.
googlecolab latex probability python3 statistics
Last synced: 08 May 2026
https://github.com/eric15342335/stat2602
STAT2602 Probability and statistics II [Section 1A, 2024]
Last synced: 02 Mar 2026
https://github.com/oncoray/power.transform
Repository for the power.transform R package
Last synced: 02 Mar 2026
https://github.com/alexander-ignition/stepic-statistics
Fundamentals of Statistics
python python3 statistics stepic
Last synced: 08 May 2026
https://github.com/qbarthelemy/stats-simple-cpp
Library for statistics in simple C++, for different sequence containers of different numeric data types.
c-plus-plus cplusplus machine-learning scientific-computing statistics
Last synced: 28 Mar 2025
https://github.com/fauzancodes/variance-and-standard-deviation-calculator
Variance & Standard Deviation Calculator
standard-deviation statistics variance
Last synced: 05 Jan 2026
https://github.com/bzubs/mlzero
Implementation of widely used ML algorithms in vanilla python
machine-learning machine-learning-algorithms machinelearning numpy python statistics
Last synced: 03 Mar 2026
https://github.com/justdvnsh/algorithms
A list of all major algorithms, their tests and explanations ranging all the way from basic to advanced.
advanced algorithms basic data-structures intermediate javascript machine-learning-algorithms statistics
Last synced: 12 Jul 2025
https://github.com/marcozanotti/tsforecasting-dashboard
Forecast App
deep-learning forecasting machine-learning statistics timeseries
Last synced: 21 Feb 2026
https://github.com/sahiltiwariiii/dssp
Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc
docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit
Last synced: 27 Mar 2026
https://github.com/intranda/goobi-plugin-statistics-sudan-memory
This Statistics plugin for Goobi workflow determines the activity of edits to translations within specific metadata fields.
digitisation goobi goobi-workflow plugin statistics
Last synced: 14 Mar 2026
https://github.com/thautwarm/bioinfoplus
A tool framework for bioinformatics written in multiple scientific languages.
bioinformatics scientific-computing statistics toolchain
Last synced: 08 May 2026
https://github.com/queelius/compositional.mle
Composable MLE solvers: a DSL for maximum likelihood estimation where solvers are first-class functions that combine via chaining, racing, and restarts
composable dsl estimation maximum-likelihood mle mle-estimation numerical-methods optimization r-package statistics
Last synced: 04 Mar 2026
https://github.com/saurabh274/aerofit-statistics-and-probability
The market research team at AeroFit wants to identify the characteristics of the target audience for each type of treadmill offered by the company, to provide a better recommendation of the treadmills to the new customers. The team decides to investigate whether there are differences across the product with respect to customer characteristics.
matplotlib-pyplot numpy pandas probability python seaborn statistics
Last synced: 08 May 2026
https://github.com/stdlib-js/stats-strided-dnanmin
Calculate the minimum value of a double-precision floating-point strided array, ignoring NaN values.
array domain extent extremes float64 javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib strided strided-array typed
Last synced: 08 May 2026
https://github.com/elgohr/github-action-analyzer
Analyzer for the usage of Github Actions
actions analyzer github statistics usage
Last synced: 09 May 2026
https://github.com/nikhilfuke1/hypothesis-testing-analysis-python-statistics
This project explores hypothesis testing techniques in Python, focusing on analyzing real-world data to draw meaningful conclusions. project also emphasizes presenting findings effectively through data storytelling and impactful visual presentations.
hypothesis-testing pandas python statistics
Last synced: 09 May 2026