Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/bblodfon/med-stat-solutions

Solutions to homework from the Fundamentals of Biostatistics, B. Rosner, Edition 8

fundamentals-of-biostatistics jupyter-notebook r-notebook rosner solutions statistics

Last synced: 16 Oct 2024

https://github.com/mihaiconstantin/sample-size-workshop

Workshop on Sample Size Planning for Intensive Longitudinal Studies

data-science power-analysis sample-size statistics time-series workshop

Last synced: 06 Nov 2024

https://github.com/educationaltestingservice/schoolgrowth

R package for more accurate school-level aggregate growth measures using Empirical Best Linear Prediction (EBLP)

blp education r statistics

Last synced: 06 Nov 2024

https://github.com/joseguilherme96/mini-projetos

Artificial Intelligence, daily activies tic tac toe, stopwatch, Statistical Calculator , Web Scraping and ChatGPT.

artificial-intelligence oriented-object-programming statistics stopwatch tic-tac-toe web-scraping

Last synced: 06 Nov 2024

https://github.com/justinribeiro/mplus-data-sets-and-templates

A collection of MPlus datasets and INP templates for study purposes.

mplus quantitive statistics

Last synced: 16 Oct 2024

https://github.com/rakibhhridoy/exploratorydataanalysis-python

Exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

ab-testing chitest data-science eda exploratory-data-analysis ftest hypotheses hypothesis-testing inferential-statistics numpy pandas python statistical-analysis statistics statsmodels ttest

Last synced: 06 Nov 2024

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 06 Nov 2024

https://github.com/rakibhhridoy/covid19analysisindashboard-tableau

Covid19 dashboard analysis of world,north america,south east Asia and their characteristics upon pandemic. Some interesting statistics is shown by the data. The increase rate make effect on death and recover rate quite periodic. Simulating those changes make more interactive.

covid-19 dashboard data-processing dataviz numpy pandas python statistics tableau tableau-dashboards

Last synced: 06 Nov 2024

https://github.com/rakibhhridoy/bioinformatics-geneticdatascience

This project is based on starting Bioinformatics as a life science student. Initializing a career as a Genetic Data Scientist and Bioinformatician.

bioinformatics biology biopython computer-science data-science genetic-data-science genetics genome-assembly genome-sequencing statistics

Last synced: 06 Nov 2024

https://github.com/hasnocool/hasnocool

An innovative project that updates a README file with dynamic content from GitHub repositories and WakaTime API.

api automation developer dynamic generator github integration language productivity programming python readme statistics wakatime

Last synced: 06 Nov 2024

https://github.com/marc7666/statistical-analysis-of-emotions-in-social-media

Statistical analysis of which factors influence dominant emotions in social media.

mathematics r random-forest rmd statistics

Last synced: 28 Oct 2024

https://github.com/barabasz/primes

Python class that calculates primes, their statistics and other related numbers in specified range

prime-numbers primes statistics

Last synced: 06 Nov 2024

https://github.com/bakkdoor/svl

Latin Library Word Statistics

analysis graph-algorithms graphdb graphs language latin statistics

Last synced: 05 Nov 2024

https://github.com/aglebov/sdafe-utils

Utilities for Statistics and Data Analysis for Financial Engineering by Ruppert and Matteson

statistics

Last synced: 06 Nov 2024

https://github.com/thomasx-0/env223

Research project measuring the affects of an increase in beef consumption on degradation of the amazon rainforest

environment project r statistics

Last synced: 28 Oct 2024

https://github.com/avoss84/seasonal

R code for the paper 'Forecasting seasonal time series data: a Bayesian model averaging approach'

bayesian-inference forecasting mcmc-sampler monte-carlo-methods seasonality statistics timeseries-analysis

Last synced: 07 Nov 2024

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 08 Nov 2024

https://github.com/mdequeljoe/statsday

understanding financial accounts display at 2018 statistics day

chord-diagram data-visualization financial-markets statistics

Last synced: 05 Nov 2024

https://github.com/vi/wilson

Simple Rust library to calculate Wilson confidence interval using the formula from Wikipedia.

confidence-intervals mathematics probability rust statistics wilson

Last synced: 16 Oct 2024

https://github.com/mattip/presentations

Different presentations I have made

python statistics

Last synced: 28 Oct 2024

https://github.com/papposilene/podstats-lddm

Code-source du site de statistiques du podcast Les Démons du MIDI.

laravel7 podcast statistics

Last synced: 11 Oct 2024

https://github.com/humburg/pvalue-distribution

A shiny app to visualise p-value distributions. Intended to facilitate the discussion of how to interpret p-values.

p-values shiny-apps statistics

Last synced: 05 Nov 2024

https://github.com/mysftz/statistical-analysis

A in-depth review of statistical analysis in Python from datasets.

data-analysis python python3 statistics university university-project

Last synced: 06 Nov 2024

https://github.com/leeway64/lwrfilestatisticsvisualizer

Visualizes the popularity of files in a given directory

data-visualization directory-traversal files filesystem r-language statistics

Last synced: 08 Nov 2024

https://github.com/ccrisc/matomo-task-scheduler

Web app designed in Flask built to perform API calls through scheduled tasks and display the fetched data through statistics

api flask-application github-actions matomo-analytics statistics vercel web-development

Last synced: 01 Nov 2024

https://github.com/tupui/hdr-boxplot

Functional highest density region boxplot

python statistics uncertainty-analysis visualization

Last synced: 15 Oct 2024

https://github.com/trovster/stats.trovster.com

Yearly statistics powered by my API.

11ty api eleventy maps statistics tailwind tailwindcss

Last synced: 16 Oct 2024

https://github.com/vincentlaucsb/statistical-models

A collection of notes detailing statistical models, including both their theoretical aspects and applications (in R).

logistic-regression statistics

Last synced: 06 Nov 2024

https://github.com/laszlokorte/random-variable

An interactive example of a function applied to a random variable showing the resulting distribution.

random-variables statistics

Last synced: 06 Nov 2024

https://github.com/laszlokorte/hypothesis-test

An interactive visusalization that shows how an optimal binary classifier can be derived from two given hypothesis.

hypothesis-testing statistics

Last synced: 06 Nov 2024

https://github.com/laszlokorte/gaussian-estimator

An interactive visusalization that shows how the parameters of a bivariate Gaußian Distribution can be estimated based on a given set of samples.

estimator gaussian statistics

Last synced: 06 Nov 2024

https://github.com/kddubey/microarray-kaggle

Analyze a dataset with 72 observations and 7,129 features

machine-learning statistics

Last synced: 05 Nov 2024

https://github.com/amyanchen/computational-statistics

Computational statistics projects for statistical inference using R programming

computational-statistics r statistical-computing statistics

Last synced: 07 Nov 2024

https://github.com/amyanchen/sf-airbnb

Exploratory Data Analysis of San Francisco Airbnb's

data-analysis data-science data-visualization r rmarkdown statistics

Last synced: 07 Nov 2024

https://github.com/joacosnchz/covid-air-traffic-argentina

A comparison between the number of Covid-19 cases in Argentina and other countries with similar air traffic

air-traffic comparison covid-19 data-science emerging-markets statistics

Last synced: 05 Nov 2024

https://github.com/sakshamarora07/whatsapp-chat-analyser

This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.

chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp

Last synced: 14 Oct 2024

https://github.com/netesf13d/conversations-analysis

Load and analyze Facebook Messenger and Whatsapp conversations

analytics conversations-analysis facebook messenger statistics whatsapp

Last synced: 14 Oct 2024

https://github.com/yandexdataschool/ml-sweights-experiments

Experiments for the "Machine Learning on data with sPlot background subtraction" paper

data-analysis high-energy-physics machine-learning statistics

Last synced: 06 Nov 2024

https://github.com/psygo/monte-carlo-ts

Monte Carlo (Gaussian) with TypeScript and SolidJS

monte-carlo simulation statistics

Last synced: 07 Nov 2024

https://github.com/mrousavy/piechart

A rich PieChart control for WPF which supports easy MVVM bindings and data access

analytics chart control library pie-chart statistics wpf xaml

Last synced: 05 Nov 2024

https://github.com/bernhardangerer/gpx-stats-helper

The "GPX stats helper" library allows reading from GPX files (GPS data) and calculates a lot of helpful parameters

geocoding gps-data gpx gpx-library gpx-reader java java-11 openstreetmap reverse-geocoding statistics

Last synced: 13 Oct 2024

https://github.com/sgaunet/gitlab-stats

tool to register stats of gitlab projects/groups and make a barchart of activity

gitlab statistics

Last synced: 23 Oct 2024

https://github.com/messente/messente-api-python

Messente API library: https://pypi.org/project/messente-api

number-lookup omnichannel phonebook statistics

Last synced: 23 Oct 2024

https://github.com/sigpwned/delta4j

Elements for building concurrent and distributed data processing applications

concurrent-programming distributed-computing java probabilistic-data-structures statistics text

Last synced: 12 Oct 2024

https://github.com/galal-pic/the-data-analysis-workshop

“A very special book on data analysis, containing 10 different projects using statistics, probability, Python, visualization, and various machine learning and deep learning models.” With a detailed explanation of them on my YouTube channel

matplotlib probability python seaborn statistics storytelling time-series visualization

Last synced: 05 Nov 2024

https://github.com/maybethee/statchasing

Site that provides interesting analytic data for Rocket League players with replays on ballchasing.com

react rocket-league ruby-on-rails statistics

Last synced: 27 Oct 2024

https://github.com/codenameyau/math

Fundamental formulas and theorems

algebra calculus geometry math probability statistics trigonometry

Last synced: 14 Oct 2024

https://github.com/josuerhea/math-julia

Statistical analyses implemented in Julia language

julia julia-language julialang math statistics

Last synced: 12 Oct 2024

https://github.com/trentpark8800/ud120-projects

My fork of the Udacity ud120 course repo, to store and share my work on the course projects.

data-science machine-learning python27 sklearn statistics

Last synced: 28 Sep 2024

https://github.com/dcorking/school-perf-rank

Demo of using Ruby standard library to rank government school tables

csv-reading open-data opendata schools statistics

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/tankibaj/php-visitor-statistics

php-visitor-statistics

mysql php statistics

Last synced: 07 Nov 2024

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 05 Nov 2024

https://github.com/matackett/modernize-regression

Supplemental materials for the article "Three principles for modernizing an undergraduate regression analysis course"

data-science education r regression statistics

Last synced: 05 Nov 2024

https://github.com/rayraegah/mean-value-adjuster

A python class to combat troll votes in ranking systems

python ranking-methods statistics

Last synced: 27 Oct 2024

https://github.com/rajikaimal/ferreter

:telescope: Collect anonymous usage for NPM package usage

javascript nodejs statistics

Last synced: 27 Oct 2024

https://github.com/mine-cetinkaya-rundel/enar-2021

2021 ENAR Fostering Diversity in Biostatistics Computing Workshop

data-science rstats statistics

Last synced: 05 Nov 2024

https://github.com/mine-cetinkaya-rundel/teach-ds-wsc-2021

Materials for the Teaching data science conference at WSC 2021

data-science rstats statistics

Last synced: 05 Nov 2024

https://github.com/hendersontrent/qldyjcost

Simple R package for costs and calculations of youth offending in Queensland, Australia

cost-benefit cost-benefit-analysis econometrics economics evaluation statistics

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 05 Nov 2024

https://github.com/andreghl/stats

Implementing statistical methods in Julia.

econometrics julia statistics

Last synced: 27 Oct 2024

https://github.com/meain/gimpact

Git stat per author

authorship commits git statistics

Last synced: 28 Oct 2024

https://github.com/maxmekiska/statsload

Basic statistics library/header-file for C.

distributions probability statistics

Last synced: 20 Oct 2024

https://github.com/akramarenkov/stat

Library that allows you to collect and display the quantity of occurrences of values ​​in given spans

go golang statistics

Last synced: 02 Nov 2024

https://github.com/junpenglao/spafv

SPAFV - Surface Profile Analysis for Free Viewing eye movement experiment in 2AFC task

data-analysis statistics temporal-logic

Last synced: 25 Oct 2024

https://github.com/mindstorm38/sc-stat

Little console utility that shows lot of statistics about a project.

cpp statistics

Last synced: 20 Oct 2024

https://github.com/fabianschmick/statistik

C Projekt zur Berechnung von Median, Arithmetischen Mittel und der Standardabweichung

arithmetisches-mittel median standardabweichung statistics statistik

Last synced: 27 Oct 2024

https://github.com/matthewfeickert/pydistcore

The interface library for probabilistic modeling in HEP

high-energy-physics probabilistic-modeling python statistical-modeling statistics

Last synced: 15 Oct 2024

https://github.com/warrenweckesser/yanova

Functions for one-way and two-way ANOVA.

anova python statistics

Last synced: 15 Oct 2024

https://github.com/tbouron/ha-agur

Home Assistant integration for Agur https://ael.agur.fr

custom-integration hacs hacs-integration history home-assistant integration sensors statistics water

Last synced: 15 Oct 2024

https://github.com/pblischak/zig-ndarray

N-Dimensional Arrays in Zig

data-science ndarray statistics ziglang

Last synced: 15 Oct 2024

https://github.com/sourceduty/power-input_log

💻 Log the total time a computer is powered on, and the time spent inputting or idling while powered on.

c-plus-plus computer-usage computer-user idea linux log logger logging macos power-input power-input-log power-log power-logger programming statistics usage-log user-log windows

Last synced: 02 Nov 2024

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 15 Oct 2024

https://github.com/andreypomortsev/statistical-list.pop-performance-evaluation

This repository contains a performance comparison of list.pop() versus list.pop(-2) in Python. The project involves measuring execution times of these list operations, performing statistical tests to evaluate the significance of differences, and visualizing the results using histograms and box plots.

jupyter-notebook scipy statistics visualization

Last synced: 15 Oct 2024

https://github.com/adijo/ucsc-bayesian-stats-2-project

Bayesian Statistics: Techniques and Tools

bayesian bayesian-inference machine-learning statistics

Last synced: 15 Oct 2024

https://github.com/adijo/ph525-statistics-and-r

Exercises from the Statistics and R course on edX.

inference r statistics

Last synced: 15 Oct 2024

https://github.com/adijo/bayesian-inference-hello-world

A minimalistic example of bayesian inference. We infer the probability of heads in a series of coin flips.

bayesian-inference hello-world machine-learning minimal statistics

Last synced: 15 Oct 2024

https://github.com/takuizum/parallelanalysis.jl

Heuristic methods for assessing approximate unidimensionality of data matrix.

julia-language julia-package psychology psychometrics statistics

Last synced: 19 Oct 2024

https://github.com/ngiann/fastparzenwindows.jl

Fast Parzen Windows: a kernel-based method for non-parametric probability density function.

julia kernel-density-estimation statistics

Last synced: 19 Oct 2024

https://github.com/saketkc/usc-math-505a-screening-solutions

Solutions to USC's MATH-505A Screening Exams

bookdown math rstudio statistics

Last synced: 15 Oct 2024