An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/olekscode/statisticseconometrics

My solutions to the assignments from Elements of Statistics, Econometrics, and Time Series Analysis course at UCU

course econometrics homework r statistics time-series

Last synced: 06 Jul 2025

https://github.com/srvanderplas/statistical_atlas

Framed Charts and the Statistical Atlas of 1870

census data ggplot2 graphics r statistics visualization

Last synced: 29 May 2026

https://github.com/cesar312/python-data-science-toolbox

A collection of useful data science tools and techniques

data-science jupyter-notebook pandas python scikit-learn statistics visualization

Last synced: 13 Apr 2026

https://github.com/imbryan/day-tracker

Daily statistics tracking

calendar python3 statistics tracker

Last synced: 31 May 2026

https://github.com/justdvnsh/algorithms

A list of all major algorithms, their tests and explanations ranging all the way from basic to advanced.

advanced algorithms basic data-structures intermediate javascript machine-learning-algorithms statistics

Last synced: 12 Jul 2025

https://github.com/rayraegah/mean-value-adjuster

A python class to combat troll votes in ranking systems

python ranking-methods statistics

Last synced: 24 Oct 2025

https://github.com/crhntr/visual_stats

Some visualizations for Math352 Undergradutate Probability and Statistics

education probability statistics vuejs2

Last synced: 07 Sep 2025

https://github.com/dmarks84/coursework_project_sentiment-analysis

Project for University of Michigan Python Programming Specialization -- Read in tweets and analyzed their content to perform basic sentiment analysis

classification programming python sentiment-analysis statistics web-scraping

Last synced: 09 Apr 2025

https://github.com/komangandika/stat

I like statistic (pls help me), yeah so basically this repo is about my journey to the world of statistics because why not right?

notes statistics

Last synced: 24 Feb 2026

https://github.com/lazernata/transport-problem

Bachelor's Thesis Work: Shiny app to solve the transport problem. Available in Spanish and English

operational-research rstudio shinyapps statistics

Last synced: 31 Mar 2025

https://github.com/dcs-training/introtostatistics

This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file

data-analysis r rmarkdown statistics

Last synced: 26 Mar 2025

https://github.com/alek-dr/math4ds

Примеры на Python и R из Сборника задач по математике т.4 под редакцией А.В.Ефимова и А.С.Попелова, Schaum's Outline of Theory and Problems of Statistics 3rd Edition by Murray R Spiegel, Larry J Stephens

education statistics

Last synced: 12 Mar 2026

https://github.com/simonbaese/nonrevmcmc

Non-reversible Monte Carlo simulations of spin models

examples monte-carlo-simulation random-walk spin-dynamics statistics

Last synced: 14 Feb 2026

https://github.com/redzic/statistics

Statistical library written in Rust

statistics

Last synced: 30 Mar 2025

https://github.com/vuevska/vaccine-statistics

Семинарска работа во R по предметот Бизнис статистика, ФИНКИ 2021

r r-programming r-studio statistical-analysis statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/sixarm/sixarm_shell_statistics

SixArm.com » Shell » Statistics commands

awk numcommand scripts shell statistics unix

Last synced: 30 Mar 2025

https://github.com/alex1990/numstat

Output the measures of a list of numbers, such as sum, minimum, maximum and so on.

statistics

Last synced: 16 May 2025

https://github.com/ayakovlenko/scalai

Artificial Intelligence in Scala

linear-algebra machine-learning statistics

Last synced: 17 Mar 2025

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 16 Apr 2026

https://github.com/artainmo/machine-learning-bootcamp

42AI project, organization related to 42 school. In this python machine-learning bootcamp we go over statistics, linear-regression, logistic-regression and regularization.

42school artificial-intelligence bootcamp linear-regression logistic-regression python statistics

Last synced: 16 Mar 2025

https://github.com/shenxianpeng/gitstats-action

GitHub Action that generates insightful visual reports from Git repositories using GitStats

composite-action git git-stats github-actions report statistics

Last synced: 27 May 2026

https://github.com/windi-wulandari/sentiment-analysis-imdb

This project analyzes sentiment from 50,000 IMDb movie reviews, aiming to enhance time and cost efficiency. The Naive Bayes model with TF-IDF yielded the best results, achieving up to 99% savings in time and cost, surpassing the initial goals.

imdb-dataset machine-learning sentiment-analysis sentiment-classification statistics supervised-learning

Last synced: 17 Mar 2025

https://github.com/chmue/vam-bayes-intro

A brief introduction to Bayesian Statistics for Very Applied Methods

bayes introduction statistics

Last synced: 04 Jan 2026

https://github.com/pratycodes/simulated-option-pricer

Monte Carlo-based Option Pricing library in C++ with support for European and Asian options featuring parallelization.

cpp machine-learning quantitative-finance statistics

Last synced: 02 Apr 2026

https://github.com/rohankalbag/probability-random-processes

Probability and Random Processes - Autumn Semester 2021 - Indian Institute of Technology Bombay

probability python random-processes statistics

Last synced: 16 May 2026

https://github.com/lingumd/mechacar_statistical_analysis

Analysis on MechaCar data to help the manufacturing team, using R to run multiple linear regression, summary statistics, and t-tests.

multiple-linear-regression r-studio statistical-analysis statistics summary-statistics t-tests

Last synced: 16 Mar 2025

https://github.com/elkronos/bmber

Bayesian Model Building and Evaluation Repository

bayesian-inference bayesian-statistics r rstan sensitivity-analysis statistics

Last synced: 17 Mar 2025

https://github.com/kddubey/microarray-kaggle

Analyze a dataset with 72 observations and 7,129 features

machine-learning statistics

Last synced: 08 Apr 2025

https://github.com/shoaib1522/discrete-structures

This is the proper channel and a repository for all of you to learn Discrete Structures to understand the basic concepts of Mathematics in Computer Science.

algorithms combinations gates graph predicates probability set statistics theory

Last synced: 08 Apr 2025

https://github.com/mfansler/hmmcov

Migration of https://code.google.com/archive/p/hmmcov/

bioinformatics hmm statistics

Last synced: 08 Apr 2025

https://github.com/wilsoon7721/whatsapp-chatnalyzer

A tool designed to help users extract various statistics based on their WhatsApp chat exports, providing insights into messaging patterns, participant contributions, and more in a user-friendly interface.

chat statistics trends whatsapp

Last synced: 13 Apr 2026

https://github.com/minesja/poker-service

A monte-carlo simulation app for poker. Run thousands of rounds holding certain variables constant to calculate probabilities of the game.

monte-carlo poker statistics

Last synced: 20 Mar 2025

https://github.com/makcymal/silvera

My researches on ML and statistics, optimization methods, CS algoritms and numerical methods

algorithms data data-structures machine-learning numerical-methods statistics

Last synced: 01 Apr 2025

https://github.com/venkat-a/olympiananalytics

OlympianAnalytics uncovers Olympic trends, focusing on medal distribution by region, gender, and athlete attributes. It highlights dominance shifts and the role of inclusivity in sports, offering key insights through data visualization.

data-engineering data-visualization statistics tableau

Last synced: 10 Feb 2026

https://github.com/bilgehangecici/r-assignments

R projects that are useful for understanding the area of Statistics and Probability.

r statistics

Last synced: 30 Mar 2025

https://github.com/nanotubing/statistics

Spatial Statistical analyses created using R and RStudio for an "Advanced Statistics for Urban Applications" at Temple University

autocorrelation geographically-weighted-regression r spatial-autocorrelation statistics

Last synced: 11 Mar 2026

https://github.com/beliavsky/fit-student-t-distribution

Fit a Student t distribution to univariate data via maximum likelihood

maximum-likelihood-estimation nelder-mead probability-distribution statistics t-distribution

Last synced: 27 Feb 2026

https://github.com/taha-parsayan/brain-suvr-reference-region-measurements

Assessed the impact of various brain reference regions in the calculation of human brain glucose uptake values from [18]F-FDG-PET data. Data storage and analysis were based on Cloud, image processing were implemented using Python, and statistical analysis was implemented using R.

brain glucose reference statistics suv uptake

Last synced: 25 Mar 2025

https://github.com/aayushwankhade/z

z is a versatile programming language known for its simplicity and ease of use in developing web applications. With a strong focus on clean, readable code and efficient performance, z is ideal for both beginner and experienced developers looking to create high-quality software.

apache chatgpt data-engineering data-science data-visualization fish free game immutability machine-learning pattern-matching python statistics zwave

Last synced: 07 Sep 2025

https://github.com/connorodea/statistics-toolkit-cli

📊 A comprehensive command-line statistics learning tool with step-by-step explanations

cli edtech education learning mathematics python statistics

Last synced: 14 Jan 2026

https://github.com/govind-prakash/r

A collection of R scripts designed for basic bioinformatics and biostatistics projects. This repository includes scripts for data analysis, visualization, and statistical modeling, catering to researchers and students in life sciences

data-science data-visualization r r-base rstudio statistics

Last synced: 05 Sep 2025

https://github.com/raunaksingh9800/the-monty-hall-problem

Explore the fascinating Monty Hall Problem through this C++ simulation!

probability statistics

Last synced: 25 Mar 2025

https://github.com/DCS-training/IntroToStatistics

This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file

data-analysis r rmarkdown statistics

Last synced: 25 Apr 2025

https://github.com/noopur-zambare/nc_pruning

Dense network pruning algorithm inspired from Neural Collapse geometry

deep-learning robust-machine-learning statistics trustworthy-ai

Last synced: 08 May 2026

https://github.com/stdlib-js/stats-strided-dnanstdevyc

Calculate the standard deviation of a double-precision floating-point strided array ignoring NaN values and using a one-pass algorithm proposed by Youngs and Cramer.

array deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 29 Apr 2026

https://github.com/swharden/cleananalytics

Privacy-focused website and application telemetry

analytics google-analytics logging php statistics

Last synced: 17 Jul 2025

https://github.com/democritus-project/d8s-stats

Democritus functions for working with statistics.

democritus python statistics statistics-utility utility

Last synced: 01 Apr 2025

https://github.com/stdlib-js/stats-strided-sstdevch

Calculate the standard deviation of a single-precision floating-point strided array using a one-pass trial mean algorithm.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 16 May 2026

https://github.com/mine-cetinkaya-rundel/teach-ds-wsc-2021

Materials for the Teaching data science conference at WSC 2021

data-science rstats statistics

Last synced: 08 Apr 2025

https://github.com/ccrisc/metaanalysis

A meta-analysis whether an increase in minimum wage compress wage inequality.

data-analysis-r learn meta-analysis statistics

Last synced: 23 Aug 2025

https://github.com/mdequeljoe/statsday

understanding financial accounts display at 2018 statistics day

chord-diagram data-visualization financial-markets statistics

Last synced: 08 Apr 2025

https://github.com/hendersontrent/qldyjcost

Simple R package for costs and calculations of youth offending in Queensland, Australia

cost-benefit cost-benefit-analysis econometrics economics evaluation statistics

Last synced: 08 Apr 2025

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/ndomah/mathematics

Materials for computer science mathematics prerequisites.

discrete-mathematics linear-algebra probability statistics

Last synced: 18 Jan 2026

https://github.com/RedInfinityPro/InvestorGenius

Rating: (5/10) InvestorGenius is a simulation project that simulates investment market dynamics using random decisions and statistical analysis, providing insights into investment strategies and market dynamics.

matplotlib random statistics

Last synced: 30 Sep 2025

https://github.com/steffin12-git/logistic-regression-social-network-ads-ml

Built an interpretable Logistic Regression model to predict whether a user will purchase a product from social network ads using demographic and behavioral features. The notebook demonstrates a complete ML workflow — data ingestion, preprocessing, scaling, modeling, evaluation, and visual diagnostics.

matplotlib-pyplot pandas python seaborn sklearn statistics

Last synced: 03 May 2026

https://github.com/drorata/z-vs-ch2-tests

Comparing z-test and chi square test

statistics

Last synced: 05 Oct 2025

https://github.com/almeidaleandro28/port-proba-estatistica

3 semestre - PROBABILIDADE E ESTATISTICA PARA ANALISE DE DADOS

r statistics

Last synced: 01 Apr 2025

https://github.com/lavakin/covid_positivity_statistics

Statistics exercise

statistics

Last synced: 11 Mar 2025

https://github.com/mksingh431/r-programming-language

Free R programming notes pdf are provided here for R programming students so that they can prepare and score high marks in their R programming exam

note notes r static static-site-generator statistics

Last synced: 01 Apr 2025

https://github.com/cybcon/docker.dockerhubstats2mqtt

Collect repository statistics from Dockerhub and publish them to a MQTT topic.

docker-image dockerhub mosquitto-pub statistics

Last synced: 14 Apr 2026

https://github.com/maxbiostat/binarymarkovchains

Code to fit and explore two-state discrete-time Markov Chains (DTMCs)

markov-chain statistics

Last synced: 23 Jan 2026

https://github.com/ik5/tracepath

For those who trespass against us

golang graph plot plotting statistics

Last synced: 05 Oct 2025

https://github.com/mightymetrika/gifistab

A Stability Assessment Framework

datascience r research statistics

Last synced: 05 Oct 2025

https://github.com/mkstratos/detectable_climate

Design and test improvements to MVK from evv4esm

climate climate-model-evaluation climate-modelling statistics

Last synced: 25 Oct 2025

https://github.com/andreypomortsev/statistical-list.pop-performance-evaluation

This repository contains a performance comparison of list.pop() versus list.pop(-2) in Python. The project involves measuring execution times of these list operations, performing statistical tests to evaluate the significance of differences, and visualizing the results using histograms and box plots.

jupyter-notebook scipy statistics visualization

Last synced: 06 Jan 2026

https://github.com/jedrzejszelc/my_projects

A collection of Jedrzej (Andrew) Szelc's projects in Python, Robotframework, SQL and R languages.

machine-learning python3 rlanguage robotframework sql statistics xml xml-parser

Last synced: 18 May 2026

https://github.com/eubrunoo/beer-consumption-predictor

An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.

data-analysis data-science data-visualization machine-learning r statistical-analysis statistics

Last synced: 02 Apr 2025

https://github.com/josepablodmg/python--linear-regression---housing-exercise

A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.

california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization

Last synced: 05 Oct 2025

https://github.com/thomasx-0/env223

Research project measuring the affects of an increase in beef consumption on degradation of the amazon rainforest

environment project r statistics

Last synced: 02 Apr 2025

https://github.com/m-clark/connections

connections among various statistical methods

graph statistical-methods statistical-models statistics

Last synced: 23 Mar 2025

https://github.com/jw3126/bernsteinvonmises.jl

Visualizations of the Bernstein–von Mises theorem

bayesian-statistics information-geometry julia statistics

Last synced: 12 Apr 2026

https://github.com/danpoynor/python-number-guessing-game-with-stats

A number guessing game written in Python 3 that presents median, mode, and mean statistics

console-game data-analysis number-guessing-game python3 statistics

Last synced: 26 May 2026

https://github.com/papposilene/podstats-lddm

Code-source du site de statistiques du podcast Les Démons du MIDI.

laravel7 podcast statistics

Last synced: 23 Feb 2026