An open API service indexing awesome lists of open source software.

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/spacebakery/variance-in-weather-project

Statistics for Data Analysis | Variance and Standard Deviation

data-analysis python standard-deviation statistics variance

Last synced: 05 Jul 2025

https://github.com/kaschenkkko/fitnesstrackermodule

A Python module designed for tracking and managing fitness activities and progress. It provides functionality for logging workouts, monitoring goals, and analyzing performance metrics.

oop python statistics

Last synced: 13 Jun 2025

https://github.com/dmarks84/coursework_project_sentiment-analysis

Project for University of Michigan Python Programming Specialization -- Read in tweets and analyzed their content to perform basic sentiment analysis

classification programming python sentiment-analysis statistics web-scraping

Last synced: 09 Apr 2025

https://github.com/genietim/ache-analyzer

Principal component and other statistical analysis to detect correlations to aches

ache fitbit health statistics weather

Last synced: 08 Apr 2025

https://github.com/fauzanfebrian/spotify

Spotify statistic for personal account

spotify-api statistics

Last synced: 17 Jan 2026

https://github.com/stdlib-js/stats-array-min-by

Calculate the minimum value of an array via a callback function.

array domain extent extremes javascript math mathematics min minimum node node-js nodejs range statistics stats stdlib

Last synced: 26 Apr 2026

https://github.com/lavakin/covid_positivity_statistics

Statistics exercise

statistics

Last synced: 11 Mar 2025

https://github.com/puravparab/claude-stats

heatmaps for your claude.ai conversation history

claude heatmap nextjs python statistics

Last synced: 13 Apr 2026

https://github.com/samorso/ibpaper

This repository contains the code to replicate the numerical studies presented in the paper "A Flexible Bias Correction Method based on Inconsistent Estimators".

bias-reduction finite-sample-bias indirect-inference regression resampling statistics

Last synced: 02 Apr 2025

https://github.com/harmim/vut-msp-project

Statistika a pravděpodobnost - Projekt

excel msp regression-analysis statistics vut vut-fit

Last synced: 08 Jan 2026

https://github.com/bt-88/deltasight-statistics

Provides efficient tracking of common statistical descriptors (mean, st. dev., sum, count) of a changing numeric sample

statistics

Last synced: 14 Jan 2026

https://github.com/pabsan-0/vfs2

Vectorial Mutual-Information based feature selection

feature-selection mutual-information repos-ml statistics

Last synced: 17 Mar 2025

https://github.com/tharun-mahendra/statistics

A repository for statistics and data analysis projects.

statistics

Last synced: 07 May 2025

https://github.com/schw4b/titanic

A reanalysis of the Titanic data set in R and Quarto.

quarto r statistics

Last synced: 25 Mar 2025

https://github.com/mituskillologies/krai-sppu-mca

The repository contains all the practicals of subject "Knowledge Representation in Artificial Intelligence" subject of MCA under Savitribai Phule Pune University, Pune. Programmed by Tushar B. Kute.

artificial-intelligence artificial-neural-networks convolutional-neural-network decision-trees deep-learning machine-learning neural-networks recurrent-neural-networks statistics support-vector-machine

Last synced: 06 Jul 2025

https://github.com/olafhaag/c3d-statistics

Analyze conditional values of C3D data generated by Phasespace Impulse X2 motion capture system

c3d motion-capture phasespace statistics

Last synced: 14 Jun 2025

https://github.com/elb4rto/craplog

Creates statistics from Apache2 log files

apache2 apache2-logs apache2-statistics cli gui logs statistics

Last synced: 16 Oct 2025

https://github.com/dmarks84/coursework_project_nlp-with-nltk

Project for University of Michigan Applied Data Science Specialization -- Utilized NLTK library to process natural language, and then built several spelling recommenders for a list of misspelled words.

data-modeling databases dataframes eda nlp numpy pandas python reporting statistics text-mining visualization

Last synced: 13 Apr 2026

https://github.com/mightymetrika/mmibain

Bayesian Informative Hypotheses Evaluation Web Applications

bayes-factor bayesian hypothesis informative r statistics

Last synced: 06 Jul 2025

https://github.com/xstupi00/Theoretical-Assignments

Elaborated projects with theoretical assignments during the master's degree.

automata complexity information-security markov-chain petri-nets statistics storm vut vut-fit

Last synced: 11 Mar 2025

https://github.com/domingosdeeulariadumba/ablisk

A Python module for design, analysis and decision-making of A/B tests.

ab-testing data-visualization statistics

Last synced: 14 Jan 2026

https://github.com/deypadma2020/dataanalysis-mlalgo

Practice repository for data analysis, feature engineering, statistics, web scraping, and building ML model pipelines in Python.

data-analysis eda feature-engineering machine-learning-algorithms ml-pipeline statistics web-scraping

Last synced: 30 May 2026

https://github.com/mightymetrika/holi

holi: Higher Order Likelihood Inference Web Applications

data data-science r statistics

Last synced: 10 Feb 2026

https://github.com/dmarks84/coursework_project_network-analysis-node-link-prediction

Project for University of Michigan Applied Data Science Specialization -- Analyzed network nodes and edges, developing custom features based on various scoring metrics; used features to train classifier model to predict node attribute (employee salary type) and future edges (employee connections)

classification cross-validation data-reporting databases eda grid-search matplotlib network-analysis numpy pandas python scikit-learn statistics supervised-ml visualization

Last synced: 13 Apr 2026

https://github.com/pordarman/ultimate-stat-bot

A comprehensive statistics bot for Discord servers that tracks user voice and message activity. Features detailed leaderboards, personal stat lookups (!me), channel analysis, a blacklist system, and persistent data storage with MongoDB.

analytics-bot discord-bot discord-js discordjs-v14 javascript mongodb nodejs statistics statistics-bot

Last synced: 03 May 2026

https://github.com/imbryan/day-tracker

Daily statistics tracking

calendar python3 statistics tracker

Last synced: 31 May 2026

https://github.com/crhntr/visual_stats

Some visualizations for Math352 Undergradutate Probability and Statistics

education probability statistics vuejs2

Last synced: 07 Sep 2025

https://github.com/dcs-training/introtostatistics

This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file

data-analysis r rmarkdown statistics

Last synced: 26 Mar 2025

https://github.com/alek-dr/math4ds

Примеры на Python и R из Сборника задач по математике т.4 под редакцией А.В.Ефимова и А.С.Попелова, Schaum's Outline of Theory and Problems of Statistics 3rd Edition by Murray R Spiegel, Larry J Stephens

education statistics

Last synced: 12 Mar 2026

https://github.com/ayakovlenko/scalai

Artificial Intelligence in Scala

linear-algebra machine-learning statistics

Last synced: 17 Mar 2025

https://github.com/windi-wulandari/sentiment-analysis-imdb

This project analyzes sentiment from 50,000 IMDb movie reviews, aiming to enhance time and cost efficiency. The Naive Bayes model with TF-IDF yielded the best results, achieving up to 99% savings in time and cost, surpassing the initial goals.

imdb-dataset machine-learning sentiment-analysis sentiment-classification statistics supervised-learning

Last synced: 17 Mar 2025

https://github.com/elkronos/bmber

Bayesian Model Building and Evaluation Repository

bayesian-inference bayesian-statistics r rstan sensitivity-analysis statistics

Last synced: 17 Mar 2025

https://github.com/kddubey/microarray-kaggle

Analyze a dataset with 72 observations and 7,129 features

machine-learning statistics

Last synced: 08 Apr 2025

https://github.com/wilsoon7721/whatsapp-chatnalyzer

A tool designed to help users extract various statistics based on their WhatsApp chat exports, providing insights into messaging patterns, participant contributions, and more in a user-friendly interface.

chat statistics trends whatsapp

Last synced: 13 Apr 2026

https://github.com/aayushwankhade/z

z is a versatile programming language known for its simplicity and ease of use in developing web applications. With a strong focus on clean, readable code and efficient performance, z is ideal for both beginner and experienced developers looking to create high-quality software.

apache chatgpt data-engineering data-science data-visualization fish free game immutability machine-learning pattern-matching python statistics zwave

Last synced: 07 Sep 2025

https://github.com/connorodea/statistics-toolkit-cli

📊 A comprehensive command-line statistics learning tool with step-by-step explanations

cli edtech education learning mathematics python statistics

Last synced: 14 Jan 2026

https://github.com/ccrisc/metaanalysis

A meta-analysis whether an increase in minimum wage compress wage inequality.

data-analysis-r learn meta-analysis statistics

Last synced: 23 Aug 2025

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/steffin12-git/logistic-regression-social-network-ads-ml

Built an interpretable Logistic Regression model to predict whether a user will purchase a product from social network ads using demographic and behavioral features. The notebook demonstrates a complete ML workflow — data ingestion, preprocessing, scaling, modeling, evaluation, and visual diagnostics.

matplotlib-pyplot pandas python seaborn sklearn statistics

Last synced: 03 May 2026

https://github.com/ik5/tracepath

For those who trespass against us

golang graph plot plotting statistics

Last synced: 05 Oct 2025

https://github.com/mightymetrika/gifistab

A Stability Assessment Framework

datascience r research statistics

Last synced: 05 Oct 2025

https://github.com/jedrzejszelc/my_projects

A collection of Jedrzej (Andrew) Szelc's projects in Python, Robotframework, SQL and R languages.

machine-learning python3 rlanguage robotframework sql statistics xml xml-parser

Last synced: 18 May 2026

https://github.com/josepablodmg/python--linear-regression---housing-exercise

A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.

california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization

Last synced: 05 Oct 2025

https://github.com/rreece/statistics-notebooks

Ryan's statistics notebooks

hypothesis-testing statistics

Last synced: 06 Oct 2025

https://github.com/vic-cheung/rpy-bridge

Python-R interoperability layer for environment management, type-safe conversions, data normalization, safe function execution, and supporting recursive R object unpacking.

bioinformatics bridge interoperability interoperability-layer python python-bridge r rpy2 statistics

Last synced: 17 Jan 2026

https://github.com/codenameyau/math

Fundamental formulas and theorems

algebra calculus geometry math probability statistics trigonometry

Last synced: 17 Feb 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/sharmas1ddharth/10_days_of_statistics_hackerrank

The code in this repository is the solution of HackerRank's 10 day of statistics challenge problems.

10daysofstatistics hackerrank hackerrank-solutions r statistics

Last synced: 07 Oct 2025

https://github.com/miozilla/kmeansc

kmeansc :blossom::tulip::sunflower: : K-Means Clustering # scikit-learn framework # clustering model

centroids cluster k kmeans math matplotlib means sklearn statistics

Last synced: 18 May 2026

https://github.com/steviecurran/numerical-integration

Will plot and integrate a function over the desired range, in addition to giving the point estimates and credible intervals

credible-interval integration point-estimating poisson-gamma probability python statistics visualization

Last synced: 31 May 2026

https://github.com/jenspi/sustainabilitypal

University of New Orleans x Tulane Hackathon Spring 2025! Theme: Social Innovation or Sustainability

habit-tracking hackathon hackathon-project mobile-app mobile-development statistics sustainability tulaneuniversity universityofneworleans uno-hackathon-spring-2025

Last synced: 18 Jan 2026

https://github.com/arunpersaud/cube

Timing analytics for rubrik cube

cstimer cube rubrik statistics

Last synced: 07 Oct 2025

https://github.com/dcs-training/machinelearning

Introduction to Machine Learning with Python delivered by the centre in the year 2022-23. Go to the read me file

data-analysis data-wrangling machine-learning python statistics

Last synced: 08 Oct 2025

https://github.com/lixx21/introduction-to-statistics

Learn statistics from Coursera and Stanford

statistical-analysis statistics

Last synced: 18 Jan 2026

https://github.com/jmetrikat/github-stats

Better GitHub statistics images for your profile.

python statistics visualization

Last synced: 08 Oct 2025

https://github.com/preritdas/large-numbers

Repository for an online deployed law of large numbers and standard distribution simulation.

math random simulation statistics

Last synced: 09 Oct 2025

https://github.com/sdttttt/huck

💾 Data collection server software. Configuration flexible, no external dependencies.

go http-server simple statistics

Last synced: 14 Jan 2026

https://github.com/spenczar/bootstraps

Nonparametric bootstraps in Go

bootstrapping-statistics golang statistics

Last synced: 09 Oct 2025

https://github.com/mgckaled/rs_data-analytics

Repositório agregador do conteúdo da formação Data Analytics desenvolvido pelo Rocketseat.

data-analytics data-visualization python sql statistics

Last synced: 09 Oct 2025

https://github.com/milad-rasouli/toker

Toker is a lightweight app that clones GitHub projects and analyzes their codebase with Tokei, offering insightful statistics and details about project structure.

code-analysis code-metrics statistics tokei

Last synced: 14 Jan 2026

https://github.com/bhavnanahar/breast.cancer.detection

This project aims to build a machine learning model to predict breast cancer using a dataset containing various medical features

colab-notebook machine-learning python statistics

Last synced: 10 Oct 2025

https://github.com/fauzancodes/covariance-calculator

Covariance Calculator

covariance statistics

Last synced: 23 Feb 2026

https://github.com/chriso345/golumn

In-Memory DataFrames for Go

data-science dataframe golang statistics

Last synced: 10 Oct 2025

https://github.com/paragpvyas/projects

good, sometimes ugly, functional code. First program to mine frequent patterns, a smart pill organizer python app, finally testing global randomness presented by RNGs

embedded-systems frequent-pattern-mining guizero infrared-sensors java json mime mqtt-protocol normalization-score paho-mqtt python random-number-generators raspberry-pi sms-api state-machine statistics

Last synced: 11 Apr 2026

https://github.com/jpleitao/statistics-assignments

Assignements for the Statistics Course (Doctoral Program in Information Science and Technology)

r statistics

Last synced: 27 Jan 2026

https://github.com/thertzlor/bandcampmonitor

A web interface that aggregates live Bandcamp sales for music discovery.

bandcamp music statistics vue3

Last synced: 19 Jan 2026

https://github.com/stdlib-js/stats-strided-dvarianceyc

Calculate the variance of a double-precision floating-point strided array using a one-pass algorithm proposed by Youngs and Cramer.

cramer deviation dispersion javascript math mathematics node node-js nodejs sample-variance standard-deviation statistics stats stdlib strided strided-array unbiased var variance youngs

Last synced: 15 May 2026

https://github.com/best-doctor/we_are_venom

Checks which modules developer contributed using git history.

developer-tools git onboarding statistics

Last synced: 12 Oct 2025

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/stdlib-js/stats-base-ndarray-dztest

Compute a one-sample Z-test for a one-dimensional double-precision floating-point ndarray.

hypothesis javascript math mathematics ndarray node node-js nodejs normality statistics stats stdlib z-test ztest

Last synced: 17 May 2026

https://github.com/elb4rto/craplog-cli

Parse Apache2 logs to create statistics

apache2 apache2-logs cli logs python3 statistics

Last synced: 13 Oct 2025

https://github.com/jo-tham/geosample

Generate representative sample locations from spatial data

gis spatial-analysis statistics

Last synced: 22 Jan 2026

https://github.com/stolsky/popular-baby-names

Interactive app listing popular baby names by year

baby-names birth-rates d3js destatis-data statistics

Last synced: 14 Oct 2025

https://github.com/eva-kaushik/probnetx

ProbNetX is a research-grade Bayesian network framework that learns conditional dependencies from discrete datasets and performs exact inference for predicting missing values.

algorithms-and-data-structures machne-learning naive-bayes-classifier statistics

Last synced: 14 Oct 2025

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/egjfour/dsti-course-notes

Notes for classes taken at DSTI stored in an Obsidian vault and backed up to Github. Includes notes for all courses taken during my Master's program

aws calculus cloud graph law linear-algebra mlops neo4j optimization-algorithms owl-ontology project-management rdf software-engineering sql statistics

Last synced: 19 Apr 2026

https://github.com/palewire/ipsos-credibility-interval

A Python tool that calculates Bayesian credibility intervals for online polling using the Ipsos method

bayesian data-journalism journalism news polling python statistics

Last synced: 19 Feb 2026

https://github.com/coatless-textbooks/timeseriesisgreat

Notes from my odyssey in Time Series

bookdown notes r statistics time-series

Last synced: 15 Oct 2025

https://github.com/mohdrasmil7/ml-notebooks

This repository contains Jupyter notebooks demonstrating machine learning exercises using both supervised classification and unsupervised algorithms. Each notebook offers a hands-on approach to understanding and applying ML techniques to real-world datasets, providing valuable insights and practical skills for data analysis and predictive modeling.

machine-learning-algorithms neural-networks statistics supervised-learning unsupervised-learning

Last synced: 15 Oct 2025

https://github.com/crodriguez1a/ml-questions-daily

A collection of Machine Learning Q&A, ranging from fundamentals to bleeding-edge topics

deep-learning linear-algebra machine-learning machine-learning-algorithms python statistics

Last synced: 16 Oct 2025

https://github.com/nomeyho/skype-analyzer

Get insights on your Skype conversations

analyze backup chat conversation export insight skype statistics

Last synced: 17 Oct 2025