An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/zajichek/zajichek

Source code for my personal/professional website

consulting data-science statistics website

Last synced: 21 Feb 2026

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 07 Jan 2026

https://github.com/gnikolovski/projects_stats

Drupal 8/9 - Projects Stats provides a block, which displays a table or a list with project names and downloads count

drupal drupal-8 drupal-9 drupal-module statistics

Last synced: 05 Aug 2025

https://github.com/cleoold/linearly_varying_binomial_distribution_calcs_python

a "binomial" distribution with linearly increasing chance.

probability python-c-extension statistics

Last synced: 16 Mar 2025

https://github.com/bernhardangerer/gpx-stats-helper

The "GPX stats helper" library allows reading from GPX files (GPS data) and calculates a lot of helpful parameters

geocoding gps-data gpx gpx-library gpx-reader java java-11 openstreetmap reverse-geocoding statistics

Last synced: 03 Jan 2026

https://github.com/indianajaune/suicidator

Mini data science software for suicide statistics

c darknet data-science libcsv naivebayes statistics

Last synced: 09 Apr 2025

https://github.com/bozenne/lavareduce

Latent variable models with linear predictors

latent-variable-models lava-r-package r statistics

Last synced: 15 Jan 2026

https://github.com/lazernata/transport-problem

Bachelor's Thesis Work: Shiny app to solve the transport problem. Available in Spanish and English

operational-research rstudio shinyapps statistics

Last synced: 31 Mar 2025

https://github.com/0xwdg/appsterdam-events-history

previous Appsterdam events

appsterdam meetup meetups statistics

Last synced: 08 Sep 2025

https://github.com/dmarks84/coursework_project_sentiment-analysis

Project for University of Michigan Python Programming Specialization -- Read in tweets and analyzed their content to perform basic sentiment analysis

classification programming python sentiment-analysis statistics web-scraping

Last synced: 09 Apr 2025

https://github.com/genietim/ache-analyzer

Principal component and other statistical analysis to detect correlations to aches

ache fitbit health statistics weather

Last synced: 08 Apr 2025

https://github.com/smac-group/introds

The objective of this R package is to provide a support for the course entitled "Introduction to Data Science" given at University of Geneva. This course is intended to provide an introduction to statistical programming using the R language.

data-science programming r statistics

Last synced: 02 Apr 2025

https://github.com/bt-88/deltasight-statistics

Provides efficient tracking of common statistical descriptors (mean, st. dev., sum, count) of a changing numeric sample

statistics

Last synced: 14 Jan 2026

https://github.com/pabsan-0/vfs2

Vectorial Mutual-Information based feature selection

feature-selection mutual-information repos-ml statistics

Last synced: 17 Mar 2025

https://github.com/tsu2000/aqw_guides

Web application for certain statistics in the MMORPG AdventureQuest Worlds.

mmorpg modelling simulation statistics

Last synced: 10 Jun 2025

https://github.com/schw4b/titanic

A reanalysis of the Titanic data set in R and Quarto.

quarto r statistics

Last synced: 25 Mar 2025

https://github.com/olafhaag/c3d-statistics

Analyze conditional values of C3D data generated by Phasespace Impulse X2 motion capture system

c3d motion-capture phasespace statistics

Last synced: 14 Jun 2025

https://github.com/dmarks84/coursework_project_nlp-with-nltk

Project for University of Michigan Applied Data Science Specialization -- Utilized NLTK library to process natural language, and then built several spelling recommenders for a list of misspelled words.

data-modeling databases dataframes eda nlp numpy pandas python reporting statistics text-mining visualization

Last synced: 13 Apr 2026

https://github.com/joekakone/inferential-statistics-with-r

Statistique Inférentielle avec R

inferential-statistics r statistics

Last synced: 30 May 2026

https://github.com/xstupi00/Theoretical-Assignments

Elaborated projects with theoretical assignments during the master's degree.

automata complexity information-security markov-chain petri-nets statistics storm vut vut-fit

Last synced: 11 Mar 2025

https://github.com/domingosdeeulariadumba/ablisk

A Python module for design, analysis and decision-making of A/B tests.

ab-testing data-visualization statistics

Last synced: 14 Jan 2026

https://github.com/mightymetrika/holi

holi: Higher Order Likelihood Inference Web Applications

data data-science r statistics

Last synced: 10 Feb 2026

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 13 Apr 2026

https://github.com/windi-wulandari/sentiment-analysis-imdb

This project analyzes sentiment from 50,000 IMDb movie reviews, aiming to enhance time and cost efficiency. The Naive Bayes model with TF-IDF yielded the best results, achieving up to 99% savings in time and cost, surpassing the initial goals.

imdb-dataset machine-learning sentiment-analysis sentiment-classification statistics supervised-learning

Last synced: 17 Mar 2025

https://github.com/kddubey/microarray-kaggle

Analyze a dataset with 72 observations and 7,129 features

machine-learning statistics

Last synced: 08 Apr 2025

https://github.com/connorodea/statistics-toolkit-cli

📊 A comprehensive command-line statistics learning tool with step-by-step explanations

cli edtech education learning mathematics python statistics

Last synced: 14 Jan 2026

https://github.com/steffin12-git/logistic-regression-social-network-ads-ml

Built an interpretable Logistic Regression model to predict whether a user will purchase a product from social network ads using demographic and behavioral features. The notebook demonstrates a complete ML workflow — data ingestion, preprocessing, scaling, modeling, evaluation, and visual diagnostics.

matplotlib-pyplot pandas python seaborn sklearn statistics

Last synced: 03 May 2026

https://github.com/ik5/tracepath

For those who trespass against us

golang graph plot plotting statistics

Last synced: 05 Oct 2025

https://github.com/jedrzejszelc/my_projects

A collection of Jedrzej (Andrew) Szelc's projects in Python, Robotframework, SQL and R languages.

machine-learning python3 rlanguage robotframework sql statistics xml xml-parser

Last synced: 18 May 2026

https://github.com/josepablodmg/python--linear-regression---housing-exercise

A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.

california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization

Last synced: 05 Oct 2025

https://github.com/agbarnett/medianwatch

My blog "Median Watch"

blog metascience statistics

Last synced: 23 Feb 2026

https://github.com/rreece/statistics-notebooks

Ryan's statistics notebooks

hypothesis-testing statistics

Last synced: 06 Oct 2025

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/alexhallam/wayne

🤠Trade a formula for a model matrix🤠

design-matrix formula model-matrix polars statistics

Last synced: 07 Oct 2025

https://github.com/sharmas1ddharth/10_days_of_statistics_hackerrank

The code in this repository is the solution of HackerRank's 10 day of statistics challenge problems.

10daysofstatistics hackerrank hackerrank-solutions r statistics

Last synced: 07 Oct 2025

https://github.com/aosousa/movie-rating-analyzer

Golang script to analyze my movie ratings and provide statistical information such as the average and mean of ratings

average excel film films go golang movies script statistics stats

Last synced: 14 Jan 2026

https://github.com/josephmars/change_point_detection

App to run statistical test on Change Point Detection

statistics

Last synced: 21 Jan 2026

https://github.com/jmetrikat/github-stats

Better GitHub statistics images for your profile.

python statistics visualization

Last synced: 08 Oct 2025

https://github.com/ndomah1/learning-probability-and-statistics

This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.

correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics

Last synced: 18 Jan 2026

https://github.com/preritdas/large-numbers

Repository for an online deployed law of large numbers and standard distribution simulation.

math random simulation statistics

Last synced: 09 Oct 2025

https://github.com/dadosdelaplace/dadosdelaplace

About me: mathematician, PhD Stats, Assistant Professor and scicomm

biostatistics compositional-data data-science quarto r-packages statistics teaching-materials

Last synced: 19 Jan 2026

https://github.com/milad-rasouli/toker

Toker is a lightweight app that clones GitHub projects and analyzes their codebase with Tokei, offering insightful statistics and details about project structure.

code-analysis code-metrics statistics tokei

Last synced: 14 Jan 2026

https://github.com/quanticpony/clothespin-probability-distribution

A small problem of a probability distribution of clothes pins along a string.

challenges-solved python simulation statistics

Last synced: 10 Oct 2025

https://github.com/fauzancodes/covariance-calculator

Covariance Calculator

covariance statistics

Last synced: 23 Feb 2026

https://github.com/mariemekmr/inventaire-app

Application web d’inventaire développée avec Django et MariaDB/MySQL pour gérer les produits, ventes, dettes et statistiques d’un magasin, avec intégration Cloudinary et Remove.bg pour les images.

application bootstrap cloudinary django gestion-boutique inventaire mariadb mysql removebg sales statistics

Last synced: 09 Apr 2026

https://github.com/montanaz0r/testing-if-mma-math-deduction-works-using-ufc-fighters-data

The probabilistic reasoning about phenomenon called MMA math using UFC fighters data and Python.

bayesian-inference data-analysis data-science graphviz jupyter-notebook pandas python scipy statistics

Last synced: 14 Apr 2026

https://github.com/paragpvyas/projects

good, sometimes ugly, functional code. First program to mine frequent patterns, a smart pill organizer python app, finally testing global randomness presented by RNGs

embedded-systems frequent-pattern-mining guizero infrared-sensors java json mime mqtt-protocol normalization-score paho-mqtt python random-number-generators raspberry-pi sms-api state-machine statistics

Last synced: 11 Apr 2026

https://github.com/jo-tham/geosample

Generate representative sample locations from spatial data

gis spatial-analysis statistics

Last synced: 22 Jan 2026

https://github.com/eva-kaushik/probnetx

ProbNetX is a research-grade Bayesian network framework that learns conditional dependencies from discrete datasets and performs exact inference for predicting missing values.

algorithms-and-data-structures machne-learning naive-bayes-classifier statistics

Last synced: 14 Oct 2025

https://github.com/egjfour/dsti-course-notes

Notes for classes taken at DSTI stored in an Obsidian vault and backed up to Github. Includes notes for all courses taken during my Master's program

aws calculus cloud graph law linear-algebra mlops neo4j optimization-algorithms owl-ontology project-management rdf software-engineering sql statistics

Last synced: 19 Apr 2026

https://github.com/coatless-textbooks/timeseriesisgreat

Notes from my odyssey in Time Series

bookdown notes r statistics time-series

Last synced: 15 Oct 2025

https://github.com/hamburgj/survivor-stats

Interactive visualization of Survivor US contestant statistics and season data, as well as connection path finding.

data-visualization graph interactive-visualizations react reactjs statistics survivor

Last synced: 16 Apr 2026

https://github.com/crodriguez1a/ml-questions-daily

A collection of Machine Learning Q&A, ranging from fundamentals to bleeding-edge topics

deep-learning linear-algebra machine-learning machine-learning-algorithms python statistics

Last synced: 16 Oct 2025

https://github.com/nomeyho/skype-analyzer

Get insights on your Skype conversations

analyze backup chat conversation export insight skype statistics

Last synced: 17 Oct 2025

https://github.com/stdlib-js/stats-min-by

Compute the minimum value along one or more ndarray dimensions according to a callback function.

domain extent extremes javascript math mathematics min minimum ndarray node node-js nodejs range statistics stats stdlib

Last synced: 14 Apr 2026

https://github.com/samjuk/pubgstats

A webapp that allows you to view your stats for Player Unknown's Battlegrounds

php pubg pubgtracker statistics

Last synced: 19 Oct 2025

https://github.com/tomkyle/binning

Determine optimal number of bins 𝒌 for histogram creation and optimal bin width 𝒉 using various statistical methods.

binning data-analysis distributions doanes-rule freedman-diaconis histogram histogram-binning math php-math rice-rule scotts-rule square-root statistics sturges-rule terrell-scotts-rule

Last synced: 21 Oct 2025

https://github.com/dcs-training/spatial_dynamics

Use of QGIS and R to analyse first and second order geospatial effects. Go to the Readme file

data-analysis geographical-data gis qgis r statistics

Last synced: 23 Oct 2025

https://github.com/andrewrporter/text-stats

Displays various text statistics for the currently opened document in Visual Studio Code

extension nodejs statistics typescript visual-studio-code vscode vscode-extension

Last synced: 12 Apr 2026

https://github.com/indy2222/gitstats

Statistics from a Git repository

cli code git python python3 statistics stats

Last synced: 29 Apr 2026

https://github.com/mkstratos/detectable_climate

Design and test improvements to MVK from evv4esm

climate climate-model-evaluation climate-modelling statistics

Last synced: 25 Oct 2025

https://github.com/papposilene/podstats-lddm

Code-source du site de statistiques du podcast Les Démons du MIDI.

laravel7 podcast statistics

Last synced: 23 Feb 2026

https://github.com/agbarnett/decimal.places

How many decimal places are used in abstracts?

decimal-places journals statistics

Last synced: 25 Oct 2025

https://github.com/welpo/srm

Precise Sample Ratio Mismatch calculator.

ab-testing experimentation sample-ratio-mismatch srm statistics

Last synced: 25 Oct 2025

https://github.com/conjfrnk/statistics-projects

Some projects I worked on for AP statistics (2020-2021)

ap-statistics math probability statistics

Last synced: 26 Oct 2025

https://github.com/m-jahn/europe-by-numbers

The content of my blog, https://europebynumbers.wordpress.com/.

blog computational-biology europe r-markdown statistics

Last synced: 06 Feb 2026

https://github.com/iankitnegi/statisticalexcelence

Welcome to Statistify! This repository is dedicated to sharing my learning journey in statistics as it applies to data science. Here, you'll find notes, code snippets, and resources that I find useful. Let's dive into the world of data together and uncover statistical insights!

msexcel statistical-analysis statistics

Last synced: 28 Jan 2026

https://github.com/queelius/algebraic.dist

R package: Algebra over distributions (random elements) with automatic simplification to closed forms

data-science distributions monte-carlo probability r-package statistics

Last synced: 25 Feb 2026

https://github.com/hwahyeon/knou-statistics

A project to study both completed and ongoing courses in the Department of Statistics and Data Science

data-science review statistics

Last synced: 29 Jan 2026

https://github.com/digital-wellbeing/paradigm-comments

Commentary on proposed new paradigm(s) in social media effects research

psychology social-media statistics well-being

Last synced: 30 Jan 2026

https://github.com/beliavsky/fortran-stuff

Various Fortran modules

fortran statistics

Last synced: 31 Jan 2026

https://github.com/sschrs/goplotter

A Package For Plotting in GoLang

graphics plot plotting statistics visualization visualize-data

Last synced: 07 Feb 2026

https://github.com/imker25/gpsa

This is a simple command line tool that helps to extract data for statistical analysis out of track files like *.gpx

csv-export gps gpx-parser json-export statistics tcx-parser

Last synced: 07 Feb 2026

https://github.com/dadosdelaplace/docencia

Repository with teaching material at Complutense University

data-science quarto r-programming statistics supervised-learning teaching-materials

Last synced: 08 Feb 2026

https://github.com/josericodata/statisticsapp

Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.

alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test

Last synced: 26 Feb 2026

https://github.com/rahulbhadani/statistical-sauce

A curated list of definitions and concepts from statistics

statistics

Last synced: 09 Feb 2026

https://github.com/haroontrailblazer/machine_learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics

Last synced: 16 Apr 2026

https://github.com/venkat-a/olympiananalytics

OlympianAnalytics uncovers Olympic trends, focusing on medal distribution by region, gender, and athlete attributes. It highlights dominance shifts and the role of inclusivity in sports, offering key insights through data visualization.

data-engineering data-visualization statistics tableau

Last synced: 10 Feb 2026

https://github.com/antaldaniel/music-indicators-description

Indicators in Open Music Europe

indicators music statistics

Last synced: 12 Feb 2026

https://github.com/pottekkat/best-stats-you-have-ever-seen

These are the best stats you've ever seen. Website offline.

data-science data-visualization graphs open-source statistics

Last synced: 28 Feb 2026