An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/gabboraron/biostatisztika_es_alkalmazasai

"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"

biostatistics data-analysis data-visualization r statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/sharmas1ddharth/10_days_of_statistics_hackerrank

The code in this repository is the solution of HackerRank's 10 day of statistics challenge problems.

10daysofstatistics hackerrank hackerrank-solutions r statistics

Last synced: 07 Oct 2025

https://github.com/mrtkp9993/datascienceexamples

Data science examples with Python and Julia.

data-science datascience julia-language python3 statistics veri-bilimi

Last synced: 10 May 2026

https://github.com/kashyap-prabhat/sigma

A Scala library for probability and statistics formulas, including rules for probability calculations.

data formulas library mathematics probability scala statistics

Last synced: 06 Oct 2025

https://github.com/jo-tham/geosample

Generate representative sample locations from spatial data

gis spatial-analysis statistics

Last synced: 22 Jan 2026

https://github.com/gfyoung/statwrappers

Useful wrapper classes around Python stat library functionality

python statistics wrapper

Last synced: 28 Aug 2025

https://github.com/stolsky/popular-baby-names

Interactive app listing popular baby names by year

baby-names birth-rates d3js destatis-data statistics

Last synced: 14 Oct 2025

https://github.com/eva-kaushik/probnetx

ProbNetX is a research-grade Bayesian network framework that learns conditional dependencies from discrete datasets and performs exact inference for predicting missing values.

algorithms-and-data-structures machne-learning naive-bayes-classifier statistics

Last synced: 14 Oct 2025

https://github.com/josepablodmg/python--linear-regression---housing-exercise

A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.

california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization

Last synced: 05 Oct 2025

https://github.com/palewire/ipsos-credibility-interval

A Python tool that calculates Bayesian credibility intervals for online polling using the Ipsos method

bayesian data-journalism journalism news polling python statistics

Last synced: 19 Feb 2026

https://github.com/ik5/tracepath

For those who trespass against us

golang graph plot plotting statistics

Last synced: 05 Oct 2025

https://github.com/steffin12-git/logistic-regression-social-network-ads-ml

Built an interpretable Logistic Regression model to predict whether a user will purchase a product from social network ads using demographic and behavioral features. The notebook demonstrates a complete ML workflow — data ingestion, preprocessing, scaling, modeling, evaluation, and visual diagnostics.

matplotlib-pyplot pandas python seaborn sklearn statistics

Last synced: 03 May 2026

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/stdlib-js/stats-strided-smskrange

Calculate the range of a single-precision floating-point strided array according to a mask.

dispersion domain extent extremes javascript math mathematics max maximum min minimum node node-js nodejs range statistics stats stdlib strided strided-array

Last synced: 30 Apr 2026

https://github.com/ccrisc/metaanalysis

A meta-analysis whether an increase in minimum wage compress wage inequality.

data-analysis-r learn meta-analysis statistics

Last synced: 23 Aug 2025

https://github.com/mohdrasmil7/ml-notebooks

This repository contains Jupyter notebooks demonstrating machine learning exercises using both supervised classification and unsupervised algorithms. Each notebook offers a hands-on approach to understanding and applying ML techniques to real-world datasets, providing valuable insights and practical skills for data analysis and predictive modeling.

machine-learning-algorithms neural-networks statistics supervised-learning unsupervised-learning

Last synced: 15 Oct 2025

https://github.com/lvmalware/lsm-module

A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)

data-analysis least-square-regression regression regression-analysis statistics

Last synced: 31 Mar 2025

https://github.com/tyriek-cloud/statistical-work-sample

The purpose of this study is to observe if a sample of people that has siblings is independent of a sample of people that possess an opinion of whether patients with incurable diseases should be allowed to die.

analysis data spss statistics t-test

Last synced: 22 Jan 2026

https://github.com/connorodea/statistics-toolkit-cli

📊 A comprehensive command-line statistics learning tool with step-by-step explanations

cli edtech education learning mathematics python statistics

Last synced: 14 Jan 2026

https://github.com/aayushwankhade/z

z is a versatile programming language known for its simplicity and ease of use in developing web applications. With a strong focus on clean, readable code and efficient performance, z is ideal for both beginner and experienced developers looking to create high-quality software.

apache chatgpt data-engineering data-science data-visualization fish free game immutability machine-learning pattern-matching python statistics zwave

Last synced: 07 Sep 2025

https://github.com/jhrcook/demeter2-stan

The DEMETER2 model of the impact of shRNA on cell line growth in Stan.

bayesian bioinformatics biostatistics demeter2 stan statistics

Last synced: 18 Oct 2025

https://github.com/shenxianpeng/gitstats-action

GitHub Action that generates insightful visual reports from Git repositories using GitStats

composite-action git git-stats github-actions report statistics

Last synced: 27 May 2026

https://github.com/kddubey/microarray-kaggle

Analyze a dataset with 72 observations and 7,129 features

machine-learning statistics

Last synced: 08 Apr 2025

https://github.com/gastonstat/rcompendium

Comprehensive collection of slides about R

data-science introduction-to-r r-language r-programming slides statistics

Last synced: 24 Feb 2026

https://github.com/cesar312/python-data-science-toolbox

A collection of useful data science tools and techniques

data-science jupyter-notebook pandas python scikit-learn statistics visualization

Last synced: 13 Apr 2026

https://github.com/olekscode/statisticseconometrics

My solutions to the assignments from Elements of Statistics, Econometrics, and Time Series Analysis course at UCU

course econometrics homework r statistics time-series

Last synced: 06 Jul 2025

https://github.com/pordarman/ultimate-stat-bot

A comprehensive statistics bot for Discord servers that tracks user voice and message activity. Features detailed leaderboards, personal stat lookups (!me), channel analysis, a blacklist system, and persistent data storage with MongoDB.

analytics-bot discord-bot discord-js discordjs-v14 javascript mongodb nodejs statistics statistics-bot

Last synced: 03 May 2026

https://github.com/baschin1103/machine-learning-linear-regression-tsi

The model is the multiple linear regression loaded from scikit-learn. The aim is to create a model to forecast the so called Total-Skill-Index (TSI) of a player from hattrick.org. More information on: https://wiki.hattrick.org/wiki/Total_Skill_Index. In this repository is the programm and necessary data.

csv linear-regression machine-learning numpy pandas python statistics

Last synced: 11 May 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/alan-y/blogdown-website

This is my personal website and blog built using the blogdown R package and deployed with Netlify.

r statistics

Last synced: 27 May 2026

https://github.com/andrii-zapukhlyi/otomoto_visualization

Scraping, data visualization, and building a price prediction model with data from the car classifieds website otomoto.pl

data-analysis machine-learning r scraping statistics visualization

Last synced: 26 Jul 2025

https://github.com/dmarks84/coursework_project_text-mining-spam-analysis

Project for University of Michigan Applied Data Science Specialization -- Performed NLP in order to build features of email messages; trained various classification models to help predict if a message was spam.

classification databases eda nlp numpy pandas python scikit seaborn sentiment-analysis statistics supervised-ml text-mining unsupervised-ml visualization

Last synced: 11 Apr 2026

https://github.com/g4brielvs/etudes

:robot: My collection of études

data-science etudes mathematics pytudes statistics teaching

Last synced: 23 Mar 2025

https://gitlab.com/prebens-phd-adventures/universal-edit-distance

A small Python library containing some generic metrics implemented in Rust

automatic speech recognition (ASR) metrics python rust statistics

Last synced: 07 May 2026

https://github.com/mituskillologies/krai-sppu-mca

The repository contains all the practicals of subject "Knowledge Representation in Artificial Intelligence" subject of MCA under Savitribai Phule Pune University, Pune. Programmed by Tushar B. Kute.

artificial-intelligence artificial-neural-networks convolutional-neural-network decision-trees deep-learning machine-learning neural-networks recurrent-neural-networks statistics support-vector-machine

Last synced: 06 Jul 2025

https://github.com/wattyven/opencovidca-dashboard

A Jupyter notebook for making quick and dirty visualizations of Canadian COVID statistics using data from the OpenCOVID API. (API unfortunately deprecated)

canada-covid-19 covid-19 data-visualization jupyter jupyter-notebook statistics

Last synced: 14 Mar 2025

https://github.com/maxbiostat/binarymarkovchains

Code to fit and explore two-state discrete-time Markov Chains (DTMCs)

markov-chain statistics

Last synced: 23 Jan 2026

https://github.com/sofiia-chorna/estimation-et-identification-statistique-project

Estimation et identification statistique - Course Project (MA201)

matlab statistics

Last synced: 23 Jun 2026

https://github.com/callmemaverick/deyields_mro

Interactive Streamlit app analyzing German 10-Year Bond Yields & ECB’s MRO Rate with visualizations, regression analysis, and correlation insights.

data-science python python3 statistics

Last synced: 16 May 2025

https://github.com/gyf9712/stat-theory-skills

A 4-skill pipeline for Claude Code: verify mathematical proofs → repair with literature support → sharpen the theory → write corrected proofs. Integrates Codex MCP for adversarial cross-review. Venue-audited reference library across statistics/econometrics/ML theory.

claude-code claude-skills econometrics latex machine-learning-theory mathematical-proof proof-verification research-tools statistics theoretical-statistics

Last synced: 27 May 2026

https://github.com/bt-88/deltasight-statistics

Provides efficient tracking of common statistical descriptors (mean, st. dev., sum, count) of a changing numeric sample

statistics

Last synced: 14 Jan 2026

https://github.com/lavakin/covid_positivity_statistics

Statistics exercise

statistics

Last synced: 11 Mar 2025

https://github.com/murilobsd/capybara

:neckbeard: Capybara: C DataFrames for geeks who like numbers. :alien:

analytics bigdata c-plus-plus dataframe geeksforgeeks pandas statistics

Last synced: 16 Apr 2026

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 11 Apr 2026

https://github.com/conjfrnk/statistics-projects

Some projects I worked on for AP statistics (2020-2021)

ap-statistics math probability statistics

Last synced: 26 Oct 2025

https://github.com/thechibo/estim

Distribution Parameter Estimation - DEPRECATED - please use https://github.com/thechibo/joker

estimation maximum-likelihood-estimation moment-estimation probability-distribution r r-package statistics

Last synced: 27 May 2026

https://github.com/genietim/ache-analyzer

Principal component and other statistical analysis to detect correlations to aches

ache fitbit health statistics weather

Last synced: 08 Apr 2025

https://github.com/akimuddinshaikh/statistics-project-1

find the best combination of multiple linear regression models to estimate the predicted cancer-related mortality rate per county using the county's cancer- increased incidence and accessible macroeconomic data.

linear-regression statistics

Last synced: 26 Oct 2025

https://github.com/bessarodrigo/hypothesis_test_healthy_program

Teste de Hipóteses da média de uma população para avaliar o nível de colesterol dos colaboradores de uma empresa.

hypothesis-test hypothesis-testing hypothesis-tests python statistics

Last synced: 02 May 2026

https://github.com/spacebakery/variance-in-weather-project

Statistics for Data Analysis | Variance and Standard Deviation

data-analysis python standard-deviation statistics variance

Last synced: 05 Jul 2025

https://github.com/nanotubing/statistics

Spatial Statistical analyses created using R and RStudio for an "Advanced Statistics for Urban Applications" at Temple University

autocorrelation geographically-weighted-regression r spatial-autocorrelation statistics

Last synced: 11 Mar 2026

https://github.com/pawel-slowik/git-stats

Display contributor statistics for a Git repository.

git statistics

Last synced: 27 Apr 2026

https://github.com/stdlib-js/stats-strided-smaxabssorted

Calculate the maximum absolute value of a sorted single-precision floating-point strided array.

absolute absolute-value domain extent extremes javascript magnitude math mathematics max maxabs maximum node node-js nodejs range sorted statistics stats stdlib

Last synced: 27 Apr 2026

https://github.com/stdlib-js/stats-strided-snanmax

Calculate the maximum value of a single-precision floating-point strided array, ignoring NaN values.

array domain extent extremes float32 javascript math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 27 Apr 2026

https://github.com/drkane/area-profiles

Produce UK area profiles based on various data sources

dash-plotly data flask statistics uk

Last synced: 27 Apr 2026

https://github.com/jurgenjacobsen/dsc.stats

A easy to use statistics package mainly for discord bots.

bots discord statistics stats

Last synced: 27 Apr 2026

https://github.com/stdlib-js/stats-strided-dsmean

Calculate the arithmetic mean of a single-precision floating-point strided array using extended accumulation and returning an extended precision result.

arithmetic-mean array average avg central-tendency float float32 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed

Last synced: 28 Apr 2026

https://github.com/vnery5/projetos

Repositório para abrigar alguns projetos pessoais feitos no intuito de aprender um pouco de Estatística e Machine Learning

machine-learning python statistics

Last synced: 28 Apr 2026

https://github.com/dadosdelaplace/dadosdelaplace.github.io

Web/blog de Javier Álvarez Liébana (@dadosdelaplace)

blog dataviz divulgacion programming r statistics website

Last synced: 28 Apr 2026

https://github.com/frankfont/rawloadtester

This javascript/PHP application calls the URL you select as many times as you choose and tells you how long it took the server to respond.

javascript php-log qatools statistics

Last synced: 28 Apr 2026

https://github.com/simranjeet97/data-science

Data Science Programs Based on Mathematics, Statistics and Machine Learning.

algrothm data-science machine-learning mathematics python3 statistics

Last synced: 08 Jun 2026

https://github.com/stdlib-js/stats-strided-dnanmean

Calculate the arithmetic mean of a double-precision floating-point strided array, ignoring NaN values.

arithmetic-mean array average avg central-tendency double float64 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed

Last synced: 28 Apr 2026

https://github.com/stdlib-js/stats-strided-nanmeanwd

Calculate the arithmetic mean of a strided array, ignoring NaN values and using Welford's algorithm.

arithmetic-mean array average avg central-tendency javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array welford

Last synced: 28 Apr 2026

https://github.com/johanneswiesner/statplot

A repository for creating publishable annotated plots

annotations plotting seaborn statistics

Last synced: 08 Jun 2026

https://github.com/antoniojcosta/probability-and-statistics-exercises

Probability and statistics exercises solved using python

excel jupyter-notebook matplotlib probability python statistics

Last synced: 28 Apr 2026

https://github.com/codiepp/reflections

ᴚэflэctюns, in R

r-language statistics

Last synced: 28 Apr 2026

https://github.com/boldandbrad/py-elo-db

python tool for calculating and storing elo values and statistics for two sided match-based games

elo-rating statistics

Last synced: 22 Jun 2026

https://github.com/rpodcast/opensub-ness2025

R-based Submissions talk at NESS 2025

life-sciences r statistics

Last synced: 29 Apr 2026

https://github.com/ouhscbbmc/randomrpql

A variable selection method for generalize linear mixed-effect models

glmm r-package statistics

Last synced: 29 Apr 2026

https://github.com/misha-mayskiy/lootbox_analytics

Lootbox Analytics: Your personal dashboard for tracking and analyzing lootbox/gacha opening statistics from popular games. Currently supports Genshin Impact with detailed Pity/luck analysis. (Python, Flask, SQLAlchemy)

chartjs data-visualization flask gacha game-analytics genshin-impact pity-tracker python sqlalchemy statistics

Last synced: 29 Apr 2026

https://github.com/stdlib-js/stats-array-nanmeanwd

Calculate the arithmetic mean of an array, ignoring NaN values and using Welford's algorithm.

arithmetic-mean array average avg central-tendency domain extent javascript math mathematics mean node node-js nodejs statistics stats stdlib

Last synced: 29 Apr 2026

https://github.com/v-mayya/programming-statistics-and-econometrics-resources

Programming, statistics and econometrics resources

econometrics programming python r statistics

Last synced: 29 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/stdlib-js/stats-strided-scumax

Calculate the cumulative maximum of single-precision floating-point strided array elements.

accumulate cumulative domain extent extremes javascript math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 29 Apr 2026

https://github.com/jxsl13/playerranking

A working draft of a ranking server for player statistics using either Redis or SQLite3 with automatic reconnect handling and backlogging of tasks, that were not saved during database outage times. This is mainly used in the zCatch repository.

cpp database player rank ranking redis sqlite sqlite3 statistics

Last synced: 29 Apr 2026

https://github.com/jose-jaen/amazon-nlp

Using NLP algorithms and Machine Learning modeling a Sentiment Classifier is built for Amazon Reviews

algorithms big-data machine-learning nlp python r statistics supervised-learning

Last synced: 29 Apr 2026