An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/luna-devv/mellow-a7s

Wamellow boat analytics (a7s) engine

a7s analytics bot discord statistics

Last synced: 22 Jun 2025

https://github.com/keimeno/word-growth-rate-analyzer

Analyzing the growth rate of all words that are actively written by Reddit users.

data-science nodejs statistics typescript

Last synced: 17 May 2026

https://github.com/kubeservice-stack/node-metrics

FinOPS agent. node metrics is memory statistics exporter for crane-schedule

agent finops memory node-metrics statistics

Last synced: 16 Jan 2026

https://github.com/xnuinside/pypi_tools_bot

PyPi Tools Bot for Telegram (@pypi_tools_bot) - subscribe to get updates about new releases of your favorite packages, search packages, get downloads statistics

downloads pypi pypi-packages python3 releases releases-digest statistics telegram telegram-bot

Last synced: 25 Jun 2025

https://github.com/mohammadkarbalaee/java-stats

The final project of the introductory statistics and probabilities course taken at SBU on fall 2021

java jfreechart object-oriented-programming probability shahid-beheshti-university statistics swing

Last synced: 27 Mar 2025

https://github.com/jasonjfoster/rolltalk

Presentation for rolling and expanding statistics of time-series data.

algorithms package presentation quarto r statistics

Last synced: 11 Apr 2025

https://github.com/thiyangt/dsjobtracker

What skills and qualifications are required for a data scientist?

dataset qualifications skills statistics tidy

Last synced: 26 Mar 2025

https://github.com/xavikortes/atp-probability

Probability engine to predict tennis matches result

atp probability statistics stats tennis

Last synced: 27 Oct 2025

https://github.com/louis-heraut/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 08 Apr 2026

https://github.com/csarven/fao-linked-data

FAO (Food and Agriculture Organization of the United Nations) Linked Data

dcv linked-data linked-sdmx prov-o rdf rdf-data-cube sdmx semstats statistical-data statistics void

Last synced: 26 Mar 2025

https://github.com/juangesino/behaviouraleconomics

All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam

behavioral-economics behavioural-economics data-analysis economics game-theory statistics

Last synced: 28 Oct 2025

https://github.com/wdhdev/my-github-profile

A website which provides a ton of information about your GitHub account.

github github-api github-oauth github-profile info nodejs oauth rest-api statistics stats ts typescript

Last synced: 18 Mar 2025

https://github.com/pharo-ai/TF-IDF

Implementation of TF-IDF in Pharo

pharo statistics term-frequency tf-idf

Last synced: 11 May 2025

https://github.com/aramshiva/babies

👶 A parser for every name listed on a Social Security Card between 1880-2023

babies data datagov db graphs mysql names social-security social-security-data sql statistics stats

Last synced: 22 Aug 2025

https://github.com/marcolugo/cansim2r

Extract CANSIM (Statistics Canada) tables and transform them into readily usable data in panel (wide) format. It can also extract more than one table at a time and produce the resulting merge by time period and geographical region.

canada r r-package statistics statistics-canada

Last synced: 20 Jun 2025

https://github.com/akai01/intermittentdemand.jl

IntermittentDemand.jl: Intermittent demand forecasting in Julia Language, |forecasting|Julia|

forecasting intermittent-demand inventory-management julia julia-language lumpy-time-series operations-research statistical-models statistics time-series

Last synced: 03 Feb 2026

https://github.com/mbjoseph/biometry-linear-models

Slides and scripts for teaching linear models in R

linear-models linear-regression r rstats statistics

Last synced: 15 Jun 2026

https://github.com/avoss84/seasonal

R code for the paper 'Forecasting seasonal time series data: a Bayesian model averaging approach'

bayesian-inference forecasting mcmc-sampler monte-carlo-methods seasonality statistics timeseries-analysis

Last synced: 10 Apr 2025

https://github.com/clockvapor/reddit-analyzer

Analyzes reddit comments to run various statistics

kotlin reddit reddit-api statistics subreddit

Last synced: 13 Jun 2026

https://github.com/muhammedhasan/betabinomial

Beta-Binomial for testing count data

count inference python statistics

Last synced: 11 Apr 2025

https://github.com/peplxx/probability-explorer

Interactive web application for exploring and visualizing probability distributions

data-visualization distribution educational-tool matplotlib-pyplot probability statistics streamlit-app

Last synced: 22 Aug 2025

https://github.com/csarven/bfs-linked-data

BFS (Bundesamt für Statistik - Swiss Federal Statistics Office) Linked Data

dcv linked-data linked-sdmx prov-o rdf rdf-data-cube sdmx semstats statistical-data statistics

Last synced: 26 Mar 2025

https://github.com/jialuechen/riskcore

Python Machine Learning Library for Risk Management

machine-learning neural-network risk-management statistics

Last synced: 27 Jul 2025

https://github.com/leen15/haproxy-stats-visualizer

A php app for visualize the status of multiple HaProxy instances.

docker haproxy metrics statistics visualizer

Last synced: 13 Apr 2025

https://github.com/3f/ghrmeter.user.js

📊📈 Displays statistics for attachments on GitHub Releases page

attachment download-counts ghr ghrmeter github github-releases github-statistics statistics stats userjs userscript

Last synced: 18 Jan 2026

https://github.com/dataknit/tidymine

Tidy interface for calculating Maximal Information-based Nonparametric Exploration (MINE) statistics

big-data exploratory-data-analysis mine mine-statistics package r r-package statistics

Last synced: 04 Apr 2025

https://github.com/maxbarsukov/log4meal

🍏 A simple application that allows you to log your meals statistics

calories-tracker healthy-eating meal-planner rails-application rails6 statistics

Last synced: 20 Jan 2026

https://github.com/yuhuihui2011/vasp

Quantification and Visualization of Variations of Splicing in Population

3s-scores alternative-splicing ballgown rna-seq splicing sqtl statistics visualization

Last synced: 19 Feb 2026

https://github.com/simpleidserver/statisticallearning

Statitical learning library in C#

linear-regression statistics

Last synced: 06 Apr 2025

https://github.com/camille-004/my-interview-prep

🗄️ All of the docs I am creating to cram for the ML interview.

interview ml probability statistics techniques

Last synced: 18 Jan 2026

https://github.com/super-lou/card

🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with EXstat

aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly

Last synced: 13 Apr 2025

https://github.com/philipperemy/neural-probability-dist-sampler

Training a network to sample from any probability distributions such as exponential distribution.

deep-learning deep-neural-networks probability-distribution sampler statistics tensorflow

Last synced: 16 Aug 2025

https://github.com/brews/burnr_2018_manuscript_figures

Figures and code for the 2018 paper "burnr: Fire history analysis and graphics in R"

dendrochronology ecology fire forest paper r rstats statistics visualization

Last synced: 11 Apr 2025

https://github.com/super-lou/explore2_toolbox

💧 R toolbox to provide a simple way of interacting with the code necessary to carry out diagnostic of the hydrological models used in Explore2

climate climate-change climate-data climate-model climate-science diagnostic environment explore2 hydrological-model hydrology inrae model r statistics

Last synced: 13 Apr 2025

https://github.com/abhiksark/udacity-dataanalyst-nanodegree

All my codes that were submitted during Udacity Nanodegree - Data Analyst Course.

data-analysis data-visualization matplotlib pandas python seaborn statistics udacity udacity-nanodegree

Last synced: 07 Apr 2026

https://github.com/crafterkolyan/applied-statistical-data-analysis

Курс прикладного статистического анализа данных. ВМК МГУ. Весна 2020

autoexec-scripts autotest data-analysis github-actions statistics university

Last synced: 17 Jun 2025

https://github.com/nikolas-virionis/polynomial-regression

Python package that analyses the given datasets and comes up with the best regression representation with either the smallest polynomial degree possible, to be the most reliable without overfitting or other models such as exponentials and logarithms

data-analysis exponential-regression flexibility logarithmic-regression logistic-regression polynomial-regression python sinusoisdal-regression statistics

Last synced: 06 Apr 2026

https://github.com/luizcalaca/pandas-statistics

Implementações no contexto de Data Science (Estatística Descritiva) utilizando o Python e a biblioteca Pandas

matplotlib pandas-library python3 statistics

Last synced: 15 May 2026

https://github.com/migueltc13/project-li3

Data-parsing program for reading and interpreting csv files using efficient modular design within Computer Labs III environment.

c-programming-language c-project csv-parser data-encapsulation input-output input-validation modularity statistics test-automation testing

Last synced: 04 Oct 2025

https://github.com/statsim/profile

StatSim Profile. Generate data profiles in the browser

data-profile data-profiling online-algorithms statistics streaming-algorithms

Last synced: 22 Jun 2026

https://github.com/dantesc03/time-spent

Statistics project in R about time spent, relating data to current and past issues. Our data source is the OWID website where we collected data from the data tables.

collaborate datacamp dplyr ggplot2 github-copilot github-pages gitlens mapproject pupillometry rprogramming rprogrammingassignment statistics student-vscode tidyr time

Last synced: 22 Jun 2026

https://github.com/grand-27-master/data-science-course

One-stop repo for learning data science along with roadmap!

data-analysis data-science machine-learning python statistics

Last synced: 12 May 2026

https://github.com/moeeinaali/eps-project

Project of Dr. Jafari's CE40181: Engineering Probability and Statistics (Sharif University of Technology - Spring 2023)

eps jupyter-notebook probability python r statistics

Last synced: 08 May 2026

https://github.com/cnuahs/hermans-rasson

The Hermans-Rasson test for non-uniformity of circular data.

circular-statistics hermans-rasson matlab rayleigh-test statistics

Last synced: 12 Jun 2026

https://github.com/jramkiss/jramkiss.github.io

Personal blog about statistics and machine learning

blog github-pages statistics

Last synced: 30 Apr 2025

https://github.com/willie-conway/meta-data-analyst-portfolio

A comprehensive 📚portfolio showcasing projects and skills developed during the Meta Data Analyst Professional Certificate 🎓course, featuring 📈data analysis, 📊visualization, and 👨🏿‍💻management using various ⚙️tools.

big-data business-intelligence data-analysis data-cleaning data-driven-decisions data-management data-mining data-visualization exploratory-data-analysis jupyter-notebook machine-learning pandas porfolio predictive-modeling python spreadsheet-analysis sql statistics tableau visualization-tools

Last synced: 11 Apr 2026

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling geographical-data r sentiment-analysis statistics text-analysis web-scraping

Last synced: 25 Apr 2025

https://github.com/thibaudcolas/statistical-inference

A little exploration of R's power for statistical inference

psychology statistical-inference statistics

Last synced: 15 Mar 2025

https://github.com/dcousin3/measurementprecision

Measurement Precision toolkit for R

measurement-precision statistics

Last synced: 16 Oct 2025

https://github.com/flazefy/gudangku-laravel

GudangKu helps you manage your belongings, from home supplies and food stock to furniture. Set reminders to remind you to cleaning or maybe time to restocking some of your home supplies. In this apps also have generate reports to create shopping or maintenance list. Start organizing your inventory with GudangKu’s features. Created using Laravel

api-testing cronjob csv-export firebase firebase-storage integration-testing laravel mailer migrations mysql pdf pdf-parser php rest-api seeding statistics swagger task-scheduler telegram-bot unit-testing

Last synced: 01 Jul 2025

https://github.com/chaoticsomeone/ooen_mining

Statistische Auswertung der Online-Ausgabe der OÖN

austria newspaper statistics

Last synced: 04 May 2025

https://github.com/dudynets/instagram-direct-stats

An application that counts messages of various types from JSON.

instagram javascript statistics

Last synced: 15 Mar 2025

https://github.com/matackett/modernize-regression

Supplemental materials for the article "Three principles for modernizing an undergraduate regression analysis course"

data-science education r regression statistics

Last synced: 28 Jan 2026

https://github.com/mkearney/tidycor

🎓 Tidy correlation tools for academics

correlation quantitative-methods rstats statistics tidyversity

Last synced: 11 May 2026

https://github.com/alsami/covid-19-statistics

Web application showing the data available from the Covid19Api.

angular covid-19 monorepo ngrx statistics

Last synced: 20 Jan 2026

https://github.com/llnl/smallmoleval

Using machine learning to score potential drug candidates may offer an advantage over traditional imprecise scoring functions because the parameters and model structure can be learned from the data. However, models may lack interpretability, are often overfit to the data, and are not generalizable to drug targets and chemotypes not in the training data. Benchmark datasets are prone to artificial enrichment and analogue bias due to the overrepresentation of certain scaffolds in experimentally determined active sets. Datasets can be evaluated using spatial statistics to quantify the dataset topology and better understand potential biases. Dataset clumping comprises a combination of self-similarity of actives and separation from decoys in chemical space and is associated with overoptimistic virtual screening results. This code explores methods of quantifying potential biases and examines some common benchmark datasets.

machine-learning python statistics

Last synced: 26 May 2026

https://github.com/lukem512/mann-whitney-utest

An NPM module for computing the Mann-Whitney U test (a nonparametric statistical test)

analysis mann-whitney statistics

Last synced: 08 Jul 2025

https://github.com/wdbm/shijian

change, time, file, list, statistics, language and other utilities

clock statistics time

Last synced: 03 Aug 2025

https://github.com/dirkschumacher/tfjs-glm

Generalized linear models in tensorflow.js (WIP)

generalized-linear-models statistics tensorflow tensorflow-js

Last synced: 25 Apr 2026

https://github.com/pharo-ai/tf-idf

Implementation of TF-IDF in Pharo

pharo statistics term-frequency tf-idf

Last synced: 18 Mar 2025

https://github.com/hifly81/bikedump

Bike Dump is a Java GUI that can be used to manage and extract stats from GPX 1.0, GPX 1.1 and TCX 2 activities from your cycling/mountain biking workouts. It also offers graphs and history stats.

biking-applications bing cycling extract-stats gpx java map mountain-bike openstreetmap routes statistics workouts

Last synced: 15 Mar 2026

https://github.com/mccarthy-m-g/psyc-615-lab

PSYC 615: Analysis of Variance

r statistics teaching

Last synced: 08 Apr 2025

https://github.com/stephane-martin/mailstats

Parse incoming emails for statistics

email golang milter parsing smtp statistics

Last synced: 24 Mar 2025

https://github.com/slub/statsdelta

A commandline command (Python3 program) that compares two (CSV) statistics with each other and generates delta values from the (old and the new) values

cli command-line-tool csv delta python statistics

Last synced: 11 Apr 2025

https://github.com/fboucher/azyoutubestats

Serverless functions returning YouTube Statistics

azure-functions hacktoberfest statistics youtube-api-v3

Last synced: 20 Jan 2026

https://github.com/csinva/trees-to-networks

Bridging random forests and deep neural networks. Partial implementation of "Neural Random Forests" https://arxiv.org/abs/1604.07143

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Last synced: 12 Apr 2026

https://github.com/simphotonics/sample_statistics

Sample statistics, histograms, probability distributions, and random sample generators for Dart.

error-function probability-distribution random random-number-generators sample statistics

Last synced: 02 Apr 2026

https://github.com/coatless-rpkg/msos

msos: Data Sets and Functions Used in Multivariate Statistics: Old School by John Marden

multivariate r statistics

Last synced: 11 Mar 2026

https://github.com/yoshoku/numo-random

Numo::Random provides random number generation with several distributions for Numo::NArray.

gem random ruby statistics

Last synced: 25 Apr 2025

https://github.com/omkarpattnaik8080/credit-card-fault-detection-system

"Developing a credit card fraud detection system using machine learning techniques to identify and prevent fraudulent transactions, ensuring the security and integrity of financial transactions for users and businesses."

aws data-science machine-learning matplotlib numpy pandas statistics

Last synced: 08 Jan 2026

https://github.com/egarpor/rp.flm.test

Software companion for "Goodness-of-fit tests for the functional linear model based on randomly projected empirical processes"

functional-data-analysis goodness-of-fit r random-projections reproducible-research statistics

Last synced: 11 Jun 2025

https://github.com/bitcoin-data/bitcoin-stats-archive

Archive of Bitcoin stats from public sources

archive bitcoin statistics

Last synced: 17 Jan 2026