Statistics
Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.
- GitHub: https://github.com/topics/statistics
- Wikipedia: https://en.wikipedia.org/wiki/Statistics
- Related Topics: data-science, machine-learning, deep-learning, neural-network,
- Last updated: 2026-06-27 00:25:54 UTC
- JSON Representation
https://github.com/gabboraron/biostatisztika_es_alkalmazasai
"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"
biostatistics data-analysis data-visualization r statistics statistics-course
Last synced: 24 Oct 2025
https://github.com/rodrigojunqueiradev/rodrigojunqueiradev.github.io
Professional Portfolio - Rodrigo Junqueira
analytics artificial-intelligence data-analysis data-engineering data-science data-visualization machine-learning mathematics nosql powerbi python r sql statistics
Last synced: 15 May 2026
https://github.com/sharmas1ddharth/10_days_of_statistics_hackerrank
The code in this repository is the solution of HackerRank's 10 day of statistics challenge problems.
10daysofstatistics hackerrank hackerrank-solutions r statistics
Last synced: 07 Oct 2025
https://github.com/mrtkp9993/datascienceexamples
Data science examples with Python and Julia.
data-science datascience julia-language python3 statistics veri-bilimi
Last synced: 10 May 2026
https://github.com/kashyap-prabhat/sigma
A Scala library for probability and statistics formulas, including rules for probability calculations.
data formulas library mathematics probability scala statistics
Last synced: 06 Oct 2025
https://github.com/jo-tham/geosample
Generate representative sample locations from spatial data
gis spatial-analysis statistics
Last synced: 22 Jan 2026
https://github.com/gfyoung/statwrappers
Useful wrapper classes around Python stat library functionality
Last synced: 28 Aug 2025
https://github.com/stolsky/popular-baby-names
Interactive app listing popular baby names by year
baby-names birth-rates d3js destatis-data statistics
Last synced: 14 Oct 2025
https://github.com/eva-kaushik/probnetx
ProbNetX is a research-grade Bayesian network framework that learns conditional dependencies from discrete datasets and performs exact inference for predicting missing values.
algorithms-and-data-structures machne-learning naive-bayes-classifier statistics
Last synced: 14 Oct 2025
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/palewire/ipsos-credibility-interval
A Python tool that calculates Bayesian credibility intervals for online polling using the Ipsos method
bayesian data-journalism journalism news polling python statistics
Last synced: 19 Feb 2026
https://github.com/ik5/tracepath
For those who trespass against us
golang graph plot plotting statistics
Last synced: 05 Oct 2025
https://github.com/steffin12-git/logistic-regression-social-network-ads-ml
Built an interpretable Logistic Regression model to predict whether a user will purchase a product from social network ads using demographic and behavioral features. The notebook demonstrates a complete ML workflow — data ingestion, preprocessing, scaling, modeling, evaluation, and visual diagnostics.
matplotlib-pyplot pandas python seaborn sklearn statistics
Last synced: 03 May 2026
https://github.com/ilke-kas/multivariate-data-analysis
A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.
classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics
Last synced: 05 Oct 2025
https://github.com/stdlib-js/stats-strided-smskrange
Calculate the range of a single-precision floating-point strided array according to a mask.
dispersion domain extent extremes javascript math mathematics max maximum min minimum node node-js nodejs range statistics stats stdlib strided strided-array
Last synced: 30 Apr 2026
https://github.com/ccrisc/metaanalysis
A meta-analysis whether an increase in minimum wage compress wage inequality.
data-analysis-r learn meta-analysis statistics
Last synced: 23 Aug 2025
https://github.com/mohdrasmil7/ml-notebooks
This repository contains Jupyter notebooks demonstrating machine learning exercises using both supervised classification and unsupervised algorithms. Each notebook offers a hands-on approach to understanding and applying ML techniques to real-world datasets, providing valuable insights and practical skills for data analysis and predictive modeling.
machine-learning-algorithms neural-networks statistics supervised-learning unsupervised-learning
Last synced: 15 Oct 2025
https://github.com/bmwant/spoor
Track invocations of methods and functions
ab-testing apm data-collection debugging library-tools metrics metrics-gathering performance profiling python statistics summary tools tracing tracking utility
Last synced: 15 Oct 2025
https://github.com/lvmalware/lsm-module
A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)
data-analysis least-square-regression regression regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/tyriek-cloud/statistical-work-sample
The purpose of this study is to observe if a sample of people that has siblings is independent of a sample of people that possess an opinion of whether patients with incurable diseases should be allowed to die.
analysis data spss statistics t-test
Last synced: 22 Jan 2026
https://github.com/connorodea/statistics-toolkit-cli
📊 A comprehensive command-line statistics learning tool with step-by-step explanations
cli edtech education learning mathematics python statistics
Last synced: 14 Jan 2026
https://github.com/adelmofilho/adelmofilho.github.io
Blog Pessoal
data-science education r statistics
Last synced: 31 Mar 2025
https://github.com/aayushwankhade/z
z is a versatile programming language known for its simplicity and ease of use in developing web applications. With a strong focus on clean, readable code and efficient performance, z is ideal for both beginner and experienced developers looking to create high-quality software.
apache chatgpt data-engineering data-science data-visualization fish free game immutability machine-learning pattern-matching python statistics zwave
Last synced: 07 Sep 2025
https://github.com/jhrcook/demeter2-stan
The DEMETER2 model of the impact of shRNA on cell line growth in Stan.
bayesian bioinformatics biostatistics demeter2 stan statistics
Last synced: 18 Oct 2025
https://github.com/shenxianpeng/gitstats-action
GitHub Action that generates insightful visual reports from Git repositories using GitStats
composite-action git git-stats github-actions report statistics
Last synced: 27 May 2026
https://github.com/kddubey/microarray-kaggle
Analyze a dataset with 72 observations and 7,129 features
Last synced: 08 Apr 2025
https://github.com/polymathorg/project-proposals
Project proposals and idea list for PolyMath community
data-science ideas math mathematics numerical-methods pharo pharo-smalltalk project-proposal smalltalk statistics
Last synced: 15 Mar 2025
https://github.com/gastonstat/rcompendium
Comprehensive collection of slides about R
data-science introduction-to-r r-language r-programming slides statistics
Last synced: 24 Feb 2026
https://github.com/johnkou97/numericalrecipes
bash differentiation fft fitting integration latex matrix-decompositions minimization neural-network numerical-analysis numerical-methods ode polynomial-regression python random-generation scipy shell sorting-algorithms statistics tex
Last synced: 11 Apr 2026
https://github.com/cesar312/python-data-science-toolbox
A collection of useful data science tools and techniques
data-science jupyter-notebook pandas python scikit-learn statistics visualization
Last synced: 13 Apr 2026
https://github.com/olekscode/statisticseconometrics
My solutions to the assignments from Elements of Statistics, Econometrics, and Time Series Analysis course at UCU
course econometrics homework r statistics time-series
Last synced: 06 Jul 2025
https://github.com/pordarman/ultimate-stat-bot
A comprehensive statistics bot for Discord servers that tracks user voice and message activity. Features detailed leaderboards, personal stat lookups (!me), channel analysis, a blacklist system, and persistent data storage with MongoDB.
analytics-bot discord-bot discord-js discordjs-v14 javascript mongodb nodejs statistics statistics-bot
Last synced: 03 May 2026
https://github.com/baschin1103/machine-learning-linear-regression-tsi
The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.
csv linear-regression machine-learning numpy pandas python statistics
Last synced: 11 May 2026
https://github.com/stdlib-js/stats-array-variancetk
Calculate the variance of an array using a one-pass textbook algorithm.
array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib unbiased
Last synced: 30 Apr 2026
https://github.com/beliavsky/vector-error-correction
Simulate and fit from Vector Error Correction (VECM) models for cointegrated time series using the Johansen method
cointegration econometrics error-correction-model fortran multivariate-time-series multivariate-time-series-analysis simulation statistics time-series-analysis vecm vector-error-correction-model
Last synced: 27 May 2026
https://github.com/grctest/grc-magnitude-mapreduce-hadoop
GRC Magnitude MapReduce Hadoop
boinc gridcoin hadoop hdfs magnitude mapreduce no-team-req statistics
Last synced: 26 Mar 2025
https://github.com/jacekkala/statistics_hypothesis_testing
Statistics & Hypothesis Testing in Python
charts hypothesis-testing jupyter-notebook matplotlib numpy pandas python scipy-stats seaborn statistics
Last synced: 11 Feb 2026
https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle
Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9
classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml
Last synced: 11 Apr 2026
https://github.com/alan-y/blogdown-website
This is my personal website and blog built using the blogdown R package and deployed with Netlify.
Last synced: 27 May 2026
https://github.com/andrii-zapukhlyi/otomoto_visualization
Scraping, data visualization, and building a price prediction model with data from the car classifieds website otomoto.pl
data-analysis machine-learning r scraping statistics visualization
Last synced: 26 Jul 2025
https://github.com/lajuman/proportion-of-probability
calculator probability python statistics
Last synced: 17 Jun 2026
https://github.com/dmarks84/coursework_project_text-mining-spam-analysis
Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.
classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization
Last synced: 11 Apr 2026
https://github.com/nanodba/sp_statupdate
Priority-based statistics maintenance for SQL Server 2016+
database-maintenance dba-tools microsoft-sql-server ms-sql-server performance performance-analysis query-store sql-server statistics statistics-maintenance stored-procedure t-sql t-sql-scripts
Last synced: 10 Jun 2026
https://github.com/g4brielvs/etudes
:robot: My collection of études
data-science etudes mathematics pytudes statistics teaching
Last synced: 23 Mar 2025
https://gitlab.com/prebens-phd-adventures/universal-edit-distance
A small Python library containing some generic metrics implemented in Rust
automatic speech recognition (ASR) metrics python rust statistics
Last synced: 07 May 2026
https://github.com/kamicollo/blog-posts
Code behind aurimas.eu blog posts
ab-testing analytics bayesian blog data-science data-visualization statistics
Last synced: 05 Jan 2026
https://github.com/mituskillologies/krai-sppu-mca
The repository contains all the practicals of subject "Knowledge Representation in Artificial Intelligence" subject of MCA under Savitribai Phule Pune University, Pune. Programmed by Tushar B. Kute.
artificial-intelligence artificial-neural-networks convolutional-neural-network decision-trees deep-learning machine-learning neural-networks recurrent-neural-networks statistics support-vector-machine
Last synced: 06 Jul 2025
https://github.com/suvasish114/ml-models
Machine Learning models
jupyter-notebook linear-algebra machine-learning-algorithms mathematics probability-distribution statistics
Last synced: 27 Jul 2025
https://github.com/wattyven/opencovidca-dashboard
A Jupyter notebook for making quick and dirty visualizations of Canadian COVID statistics using data from the OpenCOVID API. (API unfortunately deprecated)
canada-covid-19 covid-19 data-visualization jupyter jupyter-notebook statistics
Last synced: 14 Mar 2025
https://github.com/maxbiostat/binarymarkovchains
Code to fit and explore two-state discrete-time Markov Chains (DTMCs)
Last synced: 23 Jan 2026
https://github.com/sofiia-chorna/estimation-et-identification-statistique-project
Estimation et identification statistique - Course Project (MA201)
Last synced: 23 Jun 2026
https://github.com/callmemaverick/deyields_mro
Interactive Streamlit app analyzing German 10-Year Bond Yields & ECB’s MRO Rate with visualizations, regression analysis, and correlation insights.
data-science python python3 statistics
Last synced: 16 May 2025
https://github.com/gyf9712/stat-theory-skills
A 4-skill pipeline for Claude Code: verify mathematical proofs → repair with literature support → sharpen the theory → write corrected proofs. Integrates Codex MCP for adversarial cross-review. Venue-audited reference library across statistics/econometrics/ML theory.
claude-code claude-skills econometrics latex machine-learning-theory mathematical-proof proof-verification research-tools statistics theoretical-statistics
Last synced: 27 May 2026
https://github.com/bt-88/deltasight-statistics
Provides efficient tracking of common statistical descriptors (mean, st. dev., sum, count) of a changing numeric sample
Last synced: 14 Jan 2026
https://github.com/murilobsd/capybara
:neckbeard: Capybara: C DataFrames for geeks who like numbers. :alien:
analytics bigdata c-plus-plus dataframe geeksforgeeks pandas statistics
Last synced: 16 Apr 2026
https://github.com/rakibhhridoy/differentprojects
Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!
data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics
Last synced: 11 Apr 2026
https://github.com/conjfrnk/statistics-projects
Some projects I worked on for AP statistics (2020-2021)
ap-statistics math probability statistics
Last synced: 26 Oct 2025
https://github.com/victoorv/analyse_biostatistique
Mémoire détaillé sur les tests multiples en biostatistique.
mathematics multiple-testing multiple-testing-correction probability r research research-project statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/thechibo/estim
Distribution Parameter Estimation - DEPRECATED - please use https://github.com/thechibo/joker
estimation maximum-likelihood-estimation moment-estimation probability-distribution r r-package statistics
Last synced: 27 May 2026
https://github.com/genietim/ache-analyzer
Principal component and other statistical analysis to detect correlations to aches
ache fitbit health statistics weather
Last synced: 08 Apr 2025
https://github.com/akimuddinshaikh/statistics-project-1
find the best combination of multiple linear regression models to estimate the predicted cancer-related mortality rate per county using the county's cancer- increased incidence and accessible macroeconomic data.
Last synced: 26 Oct 2025
https://github.com/bessarodrigo/hypothesis_test_healthy_program
Teste de Hipóteses da média de uma população para avaliar o nível de colesterol dos colaboradores de uma empresa.
hypothesis-test hypothesis-testing hypothesis-tests python statistics
Last synced: 02 May 2026
https://github.com/spacebakery/variance-in-weather-project
Statistics for Data Analysis | Variance and Standard Deviation
data-analysis python standard-deviation statistics variance
Last synced: 05 Jul 2025
https://github.com/nanotubing/statistics
Spatial Statistical analyses created using R and RStudio for an "Advanced Statistics for Urban Applications" at Temple University
autocorrelation geographically-weighted-regression r spatial-autocorrelation statistics
Last synced: 11 Mar 2026
https://github.com/pawel-slowik/git-stats
Display contributor statistics for a Git repository.
Last synced: 27 Apr 2026
https://github.com/stdlib-js/stats-strided-smaxabssorted
Calculate the maximum absolute value of a sorted single-precision floating-point strided array.
absolute absolute-value domain extent extremes javascript magnitude math mathematics max maxabs maximum node node-js nodejs range sorted statistics stats stdlib
Last synced: 27 Apr 2026
https://github.com/stdlib-js/random-tools
Pseudorandom number generator ndarray creation function tools.
javascript math mathematics multidimensional namespace ndarray node node-js nodejs prng pseudorandom rand random rng statistics stats stdlib tools utilities utils
Last synced: 27 Apr 2026
https://github.com/stdlib-js/stats-strided-snanmax
Calculate the maximum value of a single-precision floating-point strided array, ignoring NaN values.
array domain extent extremes float32 javascript math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array typed
Last synced: 27 Apr 2026
https://github.com/gyselle-marques/calculodemetricas-desafiodio
Machine Learning: Cálculo de Métricas de Avaliação de Aprendizado em Python.
accuracy brazilian-portuguese colab-notebook confusion-matrix dio-bootcamp dio-challenges f-measure f-score machine-learning matplotlib numpy precision python recall sklearn specificity statistics xgboost
Last synced: 27 Apr 2026
https://github.com/drkane/area-profiles
Produce UK area profiles based on various data sources
dash-plotly data flask statistics uk
Last synced: 27 Apr 2026
https://github.com/jurgenjacobsen/dsc.stats
A easy to use statistics package mainly for discord bots.
Last synced: 27 Apr 2026
https://github.com/stdlib-js/stats-strided-dsmean
Calculate the arithmetic mean of a single-precision floating-point strided array using extended accumulation and returning an extended precision result.
arithmetic-mean array average avg central-tendency float float32 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed
Last synced: 28 Apr 2026
https://github.com/vnery5/projetos
Repositório para abrigar alguns projetos pessoais feitos no intuito de aprender um pouco de Estatística e Machine Learning
machine-learning python statistics
Last synced: 28 Apr 2026
https://github.com/dadosdelaplace/dadosdelaplace.github.io
Web/blog de Javier Álvarez Liébana (@dadosdelaplace)
blog dataviz divulgacion programming r statistics website
Last synced: 28 Apr 2026
https://github.com/frankfont/rawloadtester
This javascript/PHP application calls the URL you select as many times as you choose and tells you how long it took the server to respond.
javascript php-log qatools statistics
Last synced: 28 Apr 2026
https://github.com/simranjeet97/data-science
Data Science Programs Based on Mathematics, Statistics and Machine Learning.
algrothm data-science machine-learning mathematics python3 statistics
Last synced: 08 Jun 2026
https://github.com/stdlib-js/stats-strided-dnanmean
Calculate the arithmetic mean of a double-precision floating-point strided array, ignoring NaN values.
arithmetic-mean array average avg central-tendency double float64 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed
Last synced: 28 Apr 2026
https://github.com/stdlib-js/stats-strided-nanmeanwd
Calculate the arithmetic mean of a strided array, ignoring NaN values and using Welford's algorithm.
arithmetic-mean array average avg central-tendency javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array welford
Last synced: 28 Apr 2026
https://github.com/johanneswiesner/statplot
A repository for creating publishable annotated plots
annotations plotting seaborn statistics
Last synced: 08 Jun 2026
https://github.com/antoniojcosta/probability-and-statistics-exercises
Probability and statistics exercises solved using python
excel jupyter-notebook matplotlib probability python statistics
Last synced: 28 Apr 2026
https://github.com/boldandbrad/py-elo-db
python tool for calculating and storing elo values and statistics for two sided match-based games
Last synced: 22 Jun 2026
https://github.com/codegeekr/test_datasciencestarter
test Data Science Starter
analytics data data-science data-visualization machine-learning python science starter-kit statistics test
Last synced: 28 Apr 2026
https://github.com/rpodcast/opensub-ness2025
R-based Submissions talk at NESS 2025
Last synced: 29 Apr 2026
https://github.com/ouhscbbmc/randomrpql
A variable selection method for generalize linear mixed-effect models
Last synced: 29 Apr 2026
https://github.com/misha-mayskiy/lootbox_analytics
Lootbox Analytics: Your personal dashboard for tracking and analyzing lootbox/gacha opening statistics from popular games. Currently supports Genshin Impact with detailed Pity/luck analysis. (Python, Flask, SQLAlchemy)
chartjs data-visualization flask gacha game-analytics genshin-impact pity-tracker python sqlalchemy statistics
Last synced: 29 Apr 2026
https://github.com/stdlib-js/stats-array-nanmeanwd
Calculate the arithmetic mean of an array, ignoring NaN values and using Welford's algorithm.
arithmetic-mean array average avg central-tendency domain extent javascript math mathematics mean node node-js nodejs statistics stats stdlib
Last synced: 29 Apr 2026
https://github.com/v-mayya/programming-statistics-and-econometrics-resources
Programming, statistics and econometrics resources
econometrics programming python r statistics
Last synced: 29 Apr 2026
https://github.com/bilgeswe/datascience
Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX
box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx
Last synced: 29 Apr 2026
https://github.com/stdlib-js/stats-array-variancepn
Calculate the variance of an array using a two-pass algorithm.
array deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib unbiased var variance
Last synced: 18 May 2026
https://github.com/stdlib-js/stats-strided-scumax
Calculate the cumulative maximum of single-precision floating-point strided array elements.
accumulate cumulative domain extent extremes javascript math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array typed
Last synced: 29 Apr 2026
https://github.com/stdlib-js/stats-base-ztest-one-sample-results-to-string
Serialize a one-sample Z-test results object as a formatted string.
javascript node node-js nodejs statistics stats stdlib string util utilities utility utils z-test ztest
Last synced: 29 Apr 2026
https://github.com/jxsl13/playerranking
A working draft of a ranking server for player statistics using either Redis or SQLite3 with automatic reconnect handling and backlogging of tasks, that were not saved during database outage times. This is mainly used in the zCatch repository.
cpp database player rank ranking redis sqlite sqlite3 statistics
Last synced: 29 Apr 2026
https://github.com/fanisgl/video-games-sales-data
Data Analysis of Sales Dataset using Python.
data-analysis data-science data-visualization dataset jupyter-notebooks matplotlib numpy pandas poisson-distribution python python3 sales statistics
Last synced: 29 Apr 2026
https://github.com/jose-jaen/amazon-nlp
Using NLP algorithms and Machine Learning modeling a Sentiment Classifier is built for Amazon Reviews
algorithms big-data machine-learning nlp python r statistics supervised-learning
Last synced: 29 Apr 2026