An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/xionlightt/stats-strided-sdsmean

Calculate the arithmetic mean of a single-precision floating-point strided array using extended accumulation.

arithmetic-mean average avg central-tendency float float32 javascript mathematics nodejs statistics stdlib strided strided-array typed

Last synced: 01 May 2026

https://github.com/stdlib-js/stats-strided-sstdevyc

Calculate the standard deviation of a single-precision floating-point strided array using a one-pass algorithm proposed by Youngs and Cramer.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 01 May 2026

https://github.com/komangandika/time-series

Im really interested in the world of trading, so why not create a repo dedicated to Time-Series

quant statistics time-series

Last synced: 15 Mar 2025

https://github.com/vladstudennikov/embedding-geometry

Influence of UMAP on clustering of text embeddings. Calculating spherical disperancy of text embeddings.

clustering dispensary manifold-learning ml nlp statistics umap

Last synced: 26 Jun 2026

https://github.com/chriso345/golumn

In-Memory DataFrames for Go

data-science dataframe golang statistics

Last synced: 10 Oct 2025

https://github.com/fauzancodes/covariance-calculator

Covariance Calculator

covariance statistics

Last synced: 23 Feb 2026

https://github.com/quanticpony/clothespin-probability-distribution

A small problem of a probability distribution of clothes pins along a string.

challenges-solved python simulation statistics

Last synced: 10 Oct 2025

https://github.com/bhavnanahar/breast.cancer.detection

This project aims to build a machine learning model to predict breast cancer using a dataset containing various medical features

colab-notebook machine-learning python statistics

Last synced: 10 Oct 2025

https://github.com/lleonardus/analise-de-dados

Anotações da matéria de Análise de Dados do curso de Ciência da Computação da Estácio

jupyter-notebook statistics

Last synced: 09 Oct 2025

https://github.com/dkxce/umoneybynrate

Get YooMoney Currency Rate (BYN by default) with log Statistics

byn console cross-platform currency dkxce exchange grab html json rate statistics yoomoney

Last synced: 02 May 2026

https://github.com/montanaz0r/testing-if-mma-math-deduction-works-using-ufc-fighters-data

The probabilistic reasoning about phenomenon called MMA math using UFC fighters data and Python.

bayesian-inference data-analysis data-science graphviz jupyter-notebook pandas python scipy statistics

Last synced: 14 Apr 2026

https://github.com/dadosdelaplace/dadosdelaplace

About me: mathematician, PhD Stats, Assistant Professor and scicomm

biostatistics compositional-data data-science quarto r-packages statistics teaching-materials

Last synced: 19 Jan 2026

https://github.com/paragpvyas/projects

good, sometimes ugly, functional code. First program to mine frequent patterns, a smart pill organizer python app, finally testing global randomness presented by RNGs

embedded-systems frequent-pattern-mining guizero infrared-sensors java json mime mqtt-protocol normalization-score paho-mqtt python random-number-generators raspberry-pi sms-api state-machine statistics

Last synced: 11 Apr 2026

https://github.com/jpleitao/statistics-assignments

Assignements for the Statistics Course (Doctoral Program in Information Science and Technology)

r statistics

Last synced: 27 Jan 2026

https://github.com/exp98/statistical-information-processing-with-r

Практические задания по дисциплине "Методы статистической обработки информации" (1 курс магистратуры, Матмех, СПбГУ) + "Data Science: инструментарий и жизненный цикл проекта" (2 курс)

r statistics

Last synced: 09 Oct 2025

https://github.com/stdlib-js/stats-strided-snanmin

Calculate the minimum value of a single-precision floating-point strided array, ignoring NaN values.

array domain extent extremes float32 javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib strided strided-array typed

Last synced: 02 May 2026

https://github.com/sdttttt/huck

💾 Data collection server software. Configuration flexible, no external dependencies.

go http-server simple statistics

Last synced: 14 Jan 2026

https://github.com/preritdas/large-numbers

Repository for an online deployed law of large numbers and standard distribution simulation.

math random simulation statistics

Last synced: 09 Oct 2025

https://github.com/thertzlor/bandcampmonitor

A web interface that aggregates live Bandcamp sales for music discovery.

bandcamp music statistics vue3

Last synced: 19 Jan 2026

https://github.com/lgibson7/stat620_quizii_group_presentation

Group presentation for STAT 620 Probability and Statistics Theory, Cal State East Bay Fall 2021

binomial-distribution mgf probability statistics

Last synced: 19 Jan 2026

https://github.com/stdlib-js/stats-strided-dnanmeanpw

Calculate the arithmetic mean of a double-precision floating-point strided array, ignoring NaN values and using pairwise summation.

arithmetic-mean array average avg central-tendency javascript math mathematics mean node node-js nodejs pairwise pw statistics stats stdlib strided strided-array typed

Last synced: 02 May 2026

https://github.com/ndomah1/learning-probability-and-statistics

This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.

correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics

Last synced: 18 Jan 2026

https://github.com/josephmars/change_point_detection

App to run statistical test on Change Point Detection

statistics

Last synced: 21 Jan 2026

https://github.com/dcs-training/machinelearning

Introduction to Machine Learning with Python delivered by the centre in the year 2022-23. Go to the read me file

data-analysis data-wrangling machine-learning python statistics

Last synced: 08 Oct 2025

https://github.com/stdlib-js/stats-strided-smskmax

Calculate the maximum value of a single-precision floating-point strided array according to a mask.

domain extent extremes javascript mask masked masked-array math mathematics max maximum node node-js nodejs range statistics stats stdlib strided strided-array

Last synced: 02 May 2026

https://github.com/casaper/swiss_wastewater_covid_virus_load

R Notebook with Swiss covid waste water analysis cuves for different cities

covid-19 public-health statistics switzerland wastewater-surveillance

Last synced: 12 Oct 2025

https://github.com/jenspi/sustainabilitypal

University of New Orleans x Tulane Hackathon Spring 2025! Theme: Social Innovation or Sustainability

habit-tracking hackathon hackathon-project mobile-app mobile-development statistics sustainability tulaneuniversity universityofneworleans uno-hackathon-spring-2025

Last synced: 18 Jan 2026

https://github.com/saroshfarhan/islp

Solving exercises of Introduction to Statistical Learning with Python

data-science python regresssion statistics

Last synced: 02 May 2026

https://github.com/steviecurran/numerical-integration

Will plot and integrate a function over the desired range, in addition to giving the point estimates and credible intervals

credible-interval integration point-estimating poisson-gamma probability python statistics visualization

Last synced: 31 May 2026

https://github.com/wiso/countingworkspace

Create RooFit workspace for simple counting experiments

roofit roofit-workspace root-cern statistics

Last synced: 12 Mar 2026

https://github.com/miozilla/kmeansc

kmeansc :blossom::tulip::sunflower: : K-Means Clustering # scikit-learn framework # clustering model

centroids cluster k kmeans math matplotlib means sklearn statistics

Last synced: 18 May 2026

https://github.com/deliprofesor/kizbasina_odev4_trafik

Bu proje, 2005-2014 yılları arasında İngiltere’de gerçekleşen trafik kazalarına ait kapsamlı veri setlerini kullanarak trafik kazalarının sebeplerini, şiddetini ve zaman içindeki değişimini analiz etmektedir.

data-science data-visualization istatistik matplotlib pandas python statistics

Last synced: 14 Apr 2026

https://github.com/stdlib-js/blas-ext-base-ndarray-zsum

Compute the sum of all elements in a one-dimensional double-precision complex floating-point ndarray.

blas extended javascript math mathematics ndarray node node-js nodejs statistics stats stdlib sum summation total

Last synced: 02 May 2026

https://github.com/codenameyau/math

Fundamental formulas and theorems

algebra calculus geometry math probability statistics trigonometry

Last synced: 17 Feb 2026

https://github.com/rreece/statistics-notebooks

Ryan's statistics notebooks

hypothesis-testing statistics

Last synced: 06 Oct 2025

https://github.com/agbarnett/medianwatch

My blog "Median Watch"

blog metascience statistics

Last synced: 23 Feb 2026

https://github.com/josepablodmg/python--linear-regression---housing-exercise

A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.

california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization

Last synced: 05 Oct 2025

https://github.com/jedrzejszelc/my_projects

A collection of Jedrzej (Andrew) Szelc's projects in Python, Robotframework, SQL and R languages.

machine-learning python3 rlanguage robotframework sql statistics xml xml-parser

Last synced: 18 May 2026

https://github.com/mightymetrika/gifistab

A Stability Assessment Framework

datascience r research statistics

Last synced: 05 Oct 2025

https://github.com/steffin12-git/logistic-regression-social-network-ads-ml

Built an interpretable Logistic Regression model to predict whether a user will purchase a product from social network ads using demographic and behavioral features. The notebook demonstrates a complete ML workflow — data ingestion, preprocessing, scaling, modeling, evaluation, and visual diagnostics.

matplotlib-pyplot pandas python seaborn sklearn statistics

Last synced: 03 May 2026

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/adisen99/msc_project

Code, functions and notebooks used for my Masters Project/Thesis.

master precipitation python statistics thesis

Last synced: 02 May 2026

https://github.com/stdlib-js/stats-strided-dsnanmeanpn

Calculate the arithmetic mean of a single-precision floating-point strided array, ignoring NaN values, using a two-pass error correction algorithm with extended accumulation, and returning an extended precision result.

arithmetic-mean array average avg central-tendency float32 javascript math mathematics mean node node-js nodejs single statistics stats stdlib strided strided-array typed

Last synced: 02 May 2026

https://github.com/elb4rto/craplog-cli

Parse Apache2 logs to create statistics

apache2 apache2-logs cli logs python3 statistics

Last synced: 13 Oct 2025

https://github.com/wilsoon7721/whatsapp-chatnalyzer

A tool designed to help users extract various statistics based on their WhatsApp chat exports, providing insights into messaging patterns, participant contributions, and more in a user-friendly interface.

chat statistics trends whatsapp

Last synced: 13 Apr 2026

https://github.com/elkronos/bmber

Bayesian Model Building and Evaluation Repository

bayesian-inference bayesian-statistics r rstan sensitivity-analysis statistics

Last synced: 17 Mar 2025

https://github.com/windi-wulandari/sentiment-analysis-imdb

This project analyzes sentiment from 50,000 IMDb movie reviews, aiming to enhance time and cost efficiency. The Naive Bayes model with TF-IDF yielded the best results, achieving up to 99% savings in time and cost, surpassing the initial goals.

imdb-dataset machine-learning sentiment-analysis sentiment-classification statistics supervised-learning

Last synced: 17 Mar 2025

https://github.com/akpoflash/covid_19_last_update_bot

🦠This bot has the last update of COVID-19 cases for each country

api covid javascipt node-js statistics telegram-bot

Last synced: 02 May 2026

https://github.com/ayakovlenko/scalai

Artificial Intelligence in Scala

linear-algebra machine-learning statistics

Last synced: 17 Mar 2025

https://github.com/stdlib-js/stats-strided-dcuminabs

Calculate the cumulative minimum absolute value of double-precision floating-point strided array elements.

abs absolute absolute-value accumulate cumulative domain extent extremes javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib

Last synced: 02 May 2026

https://github.com/stolsky/popular-baby-names

Interactive app listing popular baby names by year

baby-names birth-rates d3js destatis-data statistics

Last synced: 14 Oct 2025

https://github.com/eva-kaushik/probnetx

ProbNetX is a research-grade Bayesian network framework that learns conditional dependencies from discrete datasets and performs exact inference for predicting missing values.

algorithms-and-data-structures machne-learning naive-bayes-classifier statistics

Last synced: 14 Oct 2025

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/alek-dr/math4ds

Примеры на Python и R из Сборника задач по математике т.4 под редакцией А.В.Ефимова и А.С.Попелова, Schaum's Outline of Theory and Problems of Statistics 3rd Edition by Murray R Spiegel, Larry J Stephens

education statistics

Last synced: 12 Mar 2026

https://github.com/egjfour/dsti-course-notes

Notes for classes taken at DSTI stored in an Obsidian vault and backed up to Github. Includes notes for all courses taken during my Master's program

aws calculus cloud graph law linear-algebra mlops neo4j optimization-algorithms owl-ontology project-management rdf software-engineering sql statistics

Last synced: 19 Apr 2026

https://github.com/dcs-training/introtostatistics

This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file

data-analysis r rmarkdown statistics

Last synced: 26 Mar 2025

https://github.com/intranda/goobi-plugin-statistics-sudan-memory

This Statistics plugin for Goobi workflow determines the activity of edits to translations within specific metadata fields.

digitisation goobi goobi-workflow plugin statistics

Last synced: 14 Mar 2026

https://github.com/crhntr/visual_stats

Some visualizations for Math352 Undergradutate Probability and Statistics

education probability statistics vuejs2

Last synced: 07 Sep 2025

https://github.com/coatless-textbooks/timeseriesisgreat

Notes from my odyssey in Time Series

bookdown notes r statistics time-series

Last synced: 15 Oct 2025

https://github.com/stdlib-js/stats-strided-dstdevpn

Calculate the standard deviation of a double-precision floating-point strided array using a two-pass algorithm.

deviation dispersion javascript math mathematics node node-js nodejs sample-standard-deviation spread standard-deviation statistics stats stdlib strided strided-array typed unbiased var variance

Last synced: 30 Apr 2026

https://github.com/gregoritsch3/ml_eda_classification_diabetes

An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.

anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics

Last synced: 14 Apr 2026

https://github.com/imbryan/day-tracker

Daily statistics tracking

calendar python3 statistics tracker

Last synced: 31 May 2026

https://github.com/stdlib-js/stats-strided-dsemwd

Calculate the standard error of the mean for a double-precision floating-point strided array using Welford's algorithm.

array estimate estimation javascript math mathematics mean node node-js nodejs standard-deviation standard-error standard-error-of-the-mean statistics stats stdlib strided strided-array typed welford

Last synced: 03 May 2026

https://github.com/crodriguez1a/ml-questions-daily

A collection of Machine Learning Q&A, ranging from fundamentals to bleeding-edge topics

deep-learning linear-algebra machine-learning machine-learning-algorithms python statistics

Last synced: 16 Oct 2025

https://github.com/tyriek-cloud/statistical-work-sample

The purpose of this study is to observe if a sample of people that has siblings is independent of a sample of people that possess an opinion of whether patients with incurable diseases should be allowed to die.

analysis data spss statistics t-test

Last synced: 22 Jan 2026

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 13 Apr 2026

https://github.com/mightymetrika/holi

holi: Higher Order Likelihood Inference Web Applications

data data-science r statistics

Last synced: 10 Feb 2026

https://github.com/psygo/monte-carlo-ts

Monte Carlo (Gaussian) with TypeScript and SolidJS

monte-carlo simulation statistics

Last synced: 07 Dec 2025

https://github.com/mikma03/mathematics_and_python

Advanced mathematical concepts. Application of the probability theory. Part of the theory of partial differential equations. The use of statistics in the field of DataScience.

data-science mathematics partial-differential-equations probability python statistics

Last synced: 03 May 2026

https://github.com/chrismainey/chief_analyst_presentation

My presentation for Chief Analyst interview with the Health Economics Unit

data-science interview nhs rstats statistics xaringan

Last synced: 14 Jul 2025

https://github.com/analyticbastard/statistical-independence-financial-forecasting

Statistical independence for (impossible) financial forecasting

cryptocurrency finance machine-learning statistics trading

Last synced: 27 May 2026

https://github.com/jhrcook/demeter2-stan

The DEMETER2 model of the impact of shRNA on cell line growth in Stan.

bayesian bioinformatics biostatistics demeter2 stan statistics

Last synced: 18 Oct 2025

https://github.com/deypadma2020/dataanalysis-mlalgo

Practice repository for data analysis, feature engineering, statistics, web scraping, and building ML model pipelines in Python.

data-analysis eda feature-engineering machine-learning-algorithms ml-pipeline statistics web-scraping

Last synced: 30 May 2026

https://github.com/xstupi00/Theoretical-Assignments

Elaborated projects with theoretical assignments during the master's degree.

automata complexity information-security markov-chain petri-nets statistics storm vut vut-fit

Last synced: 11 Mar 2025

https://github.com/samjuk/pubgstats

A webapp that allows you to view your stats for Player Unknown's Battlegrounds

php pubg pubgtracker statistics

Last synced: 19 Oct 2025

https://github.com/joekakone/inferential-statistics-with-r

Statistique Inférentielle avec R

inferential-statistics r statistics

Last synced: 30 May 2026

https://github.com/lightbluetitan/african_countries_indicators

African Countries: A Curated Dataset on Africa Indicators for Education and Data Science

africa african-countries analytics coding datascience datasets opensource programming python rstats statistics stats

Last synced: 03 May 2026

https://github.com/sivkri/hints-for-statistics-and-machine-learning-models

This file will give you an overall idea to choose appropriate statistical test

statistical-tests statistics

Last synced: 28 Jan 2026