An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/yoshoku/numo-random

Numo::Random provides random number generation with several distributions for Numo::NArray.

gem random ruby statistics

Last synced: 25 Apr 2025

https://github.com/giacomolaw/leaguestatistics

Statistics for a specific player on League of Legends

league-of-legends leagueoflegends statistics

Last synced: 06 Oct 2025

https://github.com/dudynets/instagram-direct-stats

An application that counts messages of various types from JSON.

instagram javascript statistics

Last synced: 15 Mar 2025

https://github.com/mine-cetinkaya-rundel/teach-data-public-good

Materials for the JSM 2020 session "Teaching with Data for the Public Good"

data-science education jsm2020 public-good statistics

Last synced: 21 Jan 2026

https://github.com/abdelrahmanbayoumi/age-tracker-app

Age Tracker app is a tool that helps users keep track of important birthdays, including those of their loved ones. It features real-time age display in Hijri and Georgian calendars.

age age-calculator angular birthday birthday-tracker hijri ngrx rxjs statistics typescript web-application

Last synced: 31 Jan 2026

https://github.com/maxbiostat/propriety_power_priors

On the propriety of the power prior for general models

prior statistics

Last synced: 13 Oct 2025

https://github.com/llnl/smallmoleval

Using machine learning to score potential drug candidates may offer an advantage over traditional imprecise scoring functions because the parameters and model structure can be learned from the data. However, models may lack interpretability, are often overfit to the data, and are not generalizable to drug targets and chemotypes not in the training data. Benchmark datasets are prone to artificial enrichment and analogue bias due to the overrepresentation of certain scaffolds in experimentally determined active sets. Datasets can be evaluated using spatial statistics to quantify the dataset topology and better understand potential biases. Dataset clumping comprises a combination of self-similarity of actives and separation from decoys in chemical space and is associated with overoptimistic virtual screening results. This code explores methods of quantifying potential biases and examines some common benchmark datasets.

machine-learning python statistics

Last synced: 26 May 2026

https://github.com/egarpor/rp.flm.test

Software companion for "Goodness-of-fit tests for the functional linear model based on randomly projected empirical processes"

functional-data-analysis goodness-of-fit r random-projections reproducible-research statistics

Last synced: 11 Jun 2025

https://github.com/dirkschumacher/tfjs-glm

Generalized linear models in tensorflow.js (WIP)

generalized-linear-models statistics tensorflow tensorflow-js

Last synced: 25 Apr 2026

https://github.com/levminer/netflix-statistics

If you want to know how much time you spent in your life watching Netflix!

netflix netflix-statistics statistics

Last synced: 27 Feb 2026

https://github.com/wlandau/rmedicine2021-slides

Slides for a possible talk at R/Medicine 2021 (submitted and under review, accepted talks not yet determined)

bayesian bayesian-statistics cmdstanr high-performance-computing markdown pipeline r r-markdown reproducibility stan stantargets statistics targets

Last synced: 01 May 2026

https://github.com/alsami/covid-19-statistics

Web application showing the data available from the Covid19Api.

angular covid-19 monorepo ngrx statistics

Last synced: 20 Jan 2026

https://github.com/omkarpattnaik8080/credit-card-fault-detection-system

"Developing a credit card fraud detection system using machine learning techniques to identify and prevent fraudulent transactions, ensuring the security and integrity of financial transactions for users and businesses."

aws data-science machine-learning matplotlib numpy pandas statistics

Last synced: 08 Jan 2026

https://github.com/jmsv/hypothesis-testing-calculator

A2 Computing project: Hypothesis Testing Calculator for the binomial distribution, written in WinForms C# and Xamarin C# for Android

android computing csharp hypothesis-testing mathematics statistics winforms xamarin

Last synced: 14 Apr 2026

https://github.com/jramkiss/jramkiss.github.io

Personal blog about statistics and machine learning

blog github-pages statistics

Last synced: 30 Apr 2025

https://github.com/mkearney/tidycor

🎓 Tidy correlation tools for academics

correlation quantitative-methods rstats statistics tidyversity

Last synced: 11 May 2026

https://github.com/snelsi/statistics-in-psychology

🌭 A collection of some data analytics methods

calculator dashboard math nextjs psychology react statistics typescript

Last synced: 29 Apr 2026

https://github.com/ruivieira/scala-gsl

GNU Scientific Library (GSL) bindings for Scala native

bindings gsl numerical scala scala-native statistics

Last synced: 13 Oct 2025

https://github.com/flyingworkshop/desmos-creations

Links to cool graphs I made on Desmos! Almost all of the graphs are interactive, so play around with them as much as you like!

calculus desmos desmos-art geometry graph-theory linear-algebra phase-portrait statistics

Last synced: 18 Mar 2026

https://github.com/psychbruce/dpi

🛸 The Directed Prediction Index (DPI): Quantifying Relative Endogeneity of Outcome Versus Predictor Variables.

causal-inference causality causality-analysis directed-acyclic-graph influence linear-models linear-regression prediction simulation statistics

Last synced: 22 Oct 2025

https://github.com/tecnickcom/rpistat

Web-Service to collect system usage statistics

raspberry-pi statistics system usage

Last synced: 06 Jul 2025

https://github.com/johnthecoolingfan/motorchik

Motorchik, discord bot written in python

discord-bot factorio mod python statistics

Last synced: 09 Jun 2026

https://github.com/pharo-ai/tf-idf

Implementation of TF-IDF in Pharo

pharo statistics term-frequency tf-idf

Last synced: 18 Mar 2025

https://github.com/valcol/git-activity-stats

Grab a Github contributions calendar and extact stats from it.

activity commit contributions-calendar github statistics stats streak

Last synced: 27 Jan 2026

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling geographical-data r sentiment-analysis statistics text-analysis web-scraping

Last synced: 25 Apr 2025

https://github.com/tmiddlet2666/ghstats

A command line tool to display download and repository metrics for GitHub repositories.

cli github statistics stats

Last synced: 19 Apr 2026

https://github.com/tomerfi/github-viewer-stats

Small NPM package for collecting your own GitHub statistics

github npm-package npm-script statistics

Last synced: 14 Feb 2026

https://github.com/slub/statsdelta

A commandline command (Python3 program) that compares two (CSV) statistics with each other and generates delta values from the (old and the new) values

cli command-line-tool csv delta python statistics

Last synced: 11 Apr 2025

https://github.com/stephane-martin/mailstats

Parse incoming emails for statistics

email golang milter parsing smtp statistics

Last synced: 24 Mar 2025

https://github.com/matkoniecz/streetcomplete_usage_changeset_analysis

Process metadata of all changesets ever made to make StreetComplete usage analysis

openstreetmap statistics streetcomplete

Last synced: 27 Oct 2025

https://github.com/shawnlaffan/statistics-descriptive-pdl

A close to drop-in replacement for Statistics::Descriptive, but using PDL as the back end.

pdl perl statistics

Last synced: 24 Feb 2026

https://github.com/marberts/rsmatrix

An R package for making repeat-sales matrices

cran economics housing r r-package rstats statistics

Last synced: 10 Oct 2025

https://github.com/hifly81/bikedump

Bike Dump is a Java GUI that can be used to manage and extract stats from GPX 1.0, GPX 1.1 and TCX 2 activities from your cycling/mountain biking workouts. It also offers graphs and history stats.

biking-applications bing cycling extract-stats gpx java map mountain-bike openstreetmap routes statistics workouts

Last synced: 15 Mar 2026

https://github.com/tomlav/snippets

Snippets from my research in geo-sciences (satellite, etc...), often in python.

cartopy notebook python satellite science science-research scientific-visualization snippets statistics

Last synced: 18 Oct 2025

https://github.com/sambhav228/data_structure_algorithm

A repository which consists of the collection of various Data Structures and Algorithms implemented in various Programming Languages.

c cpp java python statistics

Last synced: 04 May 2026

https://github.com/git-quick-stats/git-keyword-stats

Gather statistics on keywording in a repo.

descendents git git-swear-stats gitlog keyword statistics swear

Last synced: 06 Feb 2026

https://github.com/zenmate/stats

:bar_chart: Request statistics middleware

expressjs middleware node nodejs statistics stats

Last synced: 16 Oct 2025

https://github.com/csinva/trees-to-networks

Bridging random forests and deep neural networks. Partial implementation of "Neural Random Forests" https://arxiv.org/abs/1604.07143

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Last synced: 12 Apr 2026

https://github.com/kmedian/jackknife

Jackknife resampling, parameter estimation and stability test.

cross-validation estimation jackknife jackknife-resampling pypi stability-test statistics

Last synced: 10 Oct 2025

https://github.com/tathithienthanh/finaltest_database-sql-data-collection-for-ds

The final test of the "Database SQL and Data Collection for Data Science" course from The Ho Chi Minh City University of Science (19/09/2023)

chrome data-collection data-processing database final-test ipynb-jupyter-notebook mysql pymysql query scraping-websites selenium sql statistics visualization

Last synced: 17 Feb 2026

https://github.com/stdlib-js/blas-ext-base-dnannsumkbn

Calculate the sum of double-precision floating-point strided array elements, ignoring NaN values and using an improved Kahan–Babuška algorithm.

array blas compensated extended float64 javascript math mathematics node node-js nodejs statistics stats stdlib strided strided-array sum summation total typed

Last synced: 09 Apr 2025

https://github.com/fchristenson/sassy-stats

CLI for displaying usefull information about your sass

cli sass sassy-stats statistics

Last synced: 26 Feb 2026

https://github.com/abmantz/rgw

A lightweight R-language implementation of the affine-invariant sampling method of Goodman & Weare (2010)

markov-chain-monte-carlo statistics

Last synced: 22 Oct 2025

https://github.com/clok/sig

Statistics in Go - CLI tool for quick statistical analysis of data streams

cli go golang simple-statistics statistics stats

Last synced: 23 Mar 2025

https://github.com/yang-zhang/stat-tests

Easier-to-use statistical tests in Python

correlation hypothesis python statistics

Last synced: 16 Jan 2026

https://github.com/slub/solr-fstats

A Python3 program that extracts some statistics regarding field coverage from a Solr index.

cli command-line-tool python solr statistics

Last synced: 08 May 2025

https://github.com/stdlib-js/random-streams-frechet

Create a readable stream for generating pseudorandom numbers drawn from a Fréchet distribution.

continuous frechet javascript math mathematics node node-js nodejs prng pseudorandom rand random readable rng seed seedable statistics stats stdlib stream

Last synced: 27 Oct 2025

https://github.com/alexdawn/battle-cogitator

This is a WIP project to get a REST api using flask to calculate stats from a simulated 40k combat.

40k battlescribe simulation statistics

Last synced: 26 Apr 2025

https://github.com/nhh/cstat

Get machine readable statistics from your linux machine

csharp linux statistics

Last synced: 15 Apr 2026

https://github.com/stdlib-js/random-iter-logistic

Create an iterator for generating pseudorandom numbers drawn from a logistic distribution.

continuous generator iterator javascript logistic math mathematics node node-js nodejs prng pseudorandom rand random rng seed seedable statistics stats stdlib

Last synced: 08 Jan 2026

https://github.com/DCS-training/intromachinelearning

This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file

data-analysis data-wrangling machine-learning python statistics

Last synced: 25 Apr 2025

https://github.com/klangner/purescript-stats

Statistics in PureScript

statistics

Last synced: 22 Feb 2026

https://github.com/unsignedarduino/stats

A simple MakeCode Arcade extension to toggle stats on and off via blocks

arcade extension makecode makecode-arcade makecode-arcade-extensions makecode-extension statistics stats wrapper

Last synced: 14 Mar 2026

https://github.com/enselic/git-repo-language-trends

Analyze programming language usage over time in a git repository and produce a graphical or textual representation of the result.

git png repository-utilities statistics svg tabular-data trend-analysis

Last synced: 11 Apr 2025

https://github.com/vishrut-b/database-project-mysql-

It is an astrophysical data analytics project, which I did with MySQL. The project was a graded by Sebastien Derriere of the Strasbourg Astronomical Observatory.

astrophysical-data astrophysics mysql relational-databases sql statistics

Last synced: 06 Feb 2026

https://github.com/stdlib-js/stats-base-dsnanmeanwd

Calculate the arithmetic mean of a single-precision floating-point strided array, ignoring NaN values, using Welford's algorithm with extended accumulation, and returning an extended precision result.

arithmetic-mean array average avg central-tendency float32 javascript math mathematics mean node node-js nodejs statistics stats stdlib strided strided-array typed welford

Last synced: 09 Apr 2025

https://github.com/tbeason/santaslittlehelpers.jl

A package full of toys for me to use in my other programs.

julia julia-language statistics utliities

Last synced: 19 Jan 2026

https://github.com/joelpurra/masters-thesis

"Swedes Online: You Are More Tracked Than You Think" — Joel Purra's master's thesis for a Master of Science in Information Technology and Engineering at Linköping University, Sweden

linkoping-university lith liu master-thesis research statistics thesis tracking

Last synced: 05 Mar 2026

https://github.com/abusjahn/introslides

Slides for introduction to statistics with R code to make application to other data easy.

quarto r revealjs statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/stdlib-js/random-iter-cauchy

Create an iterator for generating pseudorandom numbers drawn from a Cauchy distribution.

cauchy continuous generator iterator javascript math mathematics node node-js nodejs prng pseudorandom rand random rng seed seedable statistics stats stdlib

Last synced: 05 Mar 2026

https://github.com/stdlib-js/random-streams-cosine

Create a readable stream for generating pseudorandom numbers drawn from a raised cosine distribution.

continuous cosine javascript math mathematics node node-js nodejs prng pseudorandom raised rand random readable rng seed statistics stats stdlib stream

Last synced: 12 Apr 2025

https://github.com/mgorshkov/scipy

Scientific methods on top of NP library.

cplusplus cpp math mathematics scipy statistics

Last synced: 02 Apr 2026

https://github.com/narius2030/hive-datawarehouse-analysis

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics

Last synced: 01 Apr 2025

https://github.com/mtpatter/kaiba

Bootlier implementation for anomaly detection

anomaly-detection jupyter notebook statistics

Last synced: 27 Oct 2025