Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Statistics

Statistics is a mathematical discipline concerned with developing and studying mathematical methods for collecting, analyzing, interpreting, and presenting large quantities of numerical data. Statistics is a highly interdisciplinary field of study with applications in fields such as physics, chemistry, life sciences, political science, and economics.

https://github.com/abidshafee/google.colaboratory_projects

This repository contains the collections of interactive python notebooks (ipynb) that are some of my projects on Data Science, Machine Learning (ML), and Natural Language Processing (NLP).

colaboratory data-analysis data-science lstm machine-learning nlp statistics time-series

Last synced: 06 Nov 2024

https://github.com/mattip/presentations

Different presentations I have made

python statistics

Last synced: 28 Oct 2024

https://github.com/pedropizarro/learn-statistics-with-python

Personal code repository for statistics studying in Python

python statistical-learning statistics

Last synced: 06 Nov 2024

https://github.com/cpmachado/a2edr

Resolução de "Introdução à Estatística: Estatística Descritiva em R" por Teresa A. Oliveira, Amílcar Oliveira, e Nuno Sousa.

dataset learning r statistics

Last synced: 06 Nov 2024

https://github.com/casaper/swiss_wastewater_covid_virus_load

R Notebook with Swiss covid waste water analysis cuves for different cities

covid-19 public-health statistics switzerland wastewater-surveillance

Last synced: 02 Oct 2024

https://github.com/bilgehangecici/r-assignments

R projects that are useful for understanding the area of Statistics and Probability.

r statistics

Last synced: 23 Oct 2024

https://github.com/darkdk123/house-valuation-model

A Challenge Project in a Boot-Camp to create a ML Model to predict the prices of houses in Boston Massachusetts from multiple parameters Using Multivariable Regression.

data-analysis data-science data-visualization matplotlib-pyplot multivariate-regression predictive-modeling statistics

Last synced: 07 Nov 2024

https://github.com/murilobsd/capybara

:neckbeard: Capybara: C DataFrames for geeks who like numbers. :alien:

analytics bigdata c-plus-plus dataframe geeksforgeeks pandas statistics

Last synced: 07 Nov 2024

https://github.com/danilofreire/pols1600

Course repository for POLS 1600 - Political Research Methods (Brown University)

brown-university course-materials political-science pols1600 r statistics

Last synced: 06 Nov 2024

https://github.com/mihaiconstantin/sample-size-workshop

Workshop on Sample Size Planning for Intensive Longitudinal Studies

data-science power-analysis sample-size statistics time-series workshop

Last synced: 06 Nov 2024

https://github.com/educationaltestingservice/schoolgrowth

R package for more accurate school-level aggregate growth measures using Empirical Best Linear Prediction (EBLP)

blp education r statistics

Last synced: 06 Nov 2024

https://github.com/joseguilherme96/mini-projetos

Artificial Intelligence, daily activies tic tac toe, stopwatch, Statistical Calculator , Web Scraping and ChatGPT.

artificial-intelligence oriented-object-programming statistics stopwatch tic-tac-toe web-scraping

Last synced: 06 Nov 2024

https://github.com/rakibhhridoy/exploratorydataanalysis-python

Exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

ab-testing chitest data-science eda exploratory-data-analysis ftest hypotheses hypothesis-testing inferential-statistics numpy pandas python statistical-analysis statistics statsmodels ttest

Last synced: 06 Nov 2024

https://github.com/rakibhhridoy/differentprojects

Some of my learning projects that I practice to launch in data science. Not all, but some of few that was stored in my local repository. It can be useful for beginner data science enthusiast. Explore and learn!

data-science deep-learning machine-learning mathematics matplotlib numpy pandas python scikit-learn seaborn statistics

Last synced: 06 Nov 2024

https://github.com/rakibhhridoy/covid19analysisindashboard-tableau

Covid19 dashboard analysis of world,north america,south east Asia and their characteristics upon pandemic. Some interesting statistics is shown by the data. The increase rate make effect on death and recover rate quite periodic. Simulating those changes make more interactive.

covid-19 dashboard data-processing dataviz numpy pandas python statistics tableau tableau-dashboards

Last synced: 06 Nov 2024

https://github.com/rakibhhridoy/bioinformatics-geneticdatascience

This project is based on starting Bioinformatics as a life science student. Initializing a career as a Genetic Data Scientist and Bioinformatician.

bioinformatics biology biopython computer-science data-science genetic-data-science genetics genome-assembly genome-sequencing statistics

Last synced: 06 Nov 2024

https://github.com/hasnocool/hasnocool

An innovative project that updates a README file with dynamic content from GitHub repositories and WakaTime API.

api automation developer dynamic generator github integration language productivity programming python readme statistics wakatime

Last synced: 06 Nov 2024

https://github.com/vi/wilson

Simple Rust library to calculate Wilson confidence interval using the formula from Wikipedia.

confidence-intervals mathematics probability rust statistics wilson

Last synced: 16 Oct 2024

https://github.com/barabasz/primes

Python class that calculates primes, their statistics and other related numbers in specified range

prime-numbers primes statistics

Last synced: 06 Nov 2024

https://github.com/aglebov/sdafe-utils

Utilities for Statistics and Data Analysis for Financial Engineering by Ruppert and Matteson

statistics

Last synced: 06 Nov 2024

https://github.com/mathieucarbou/megatron

Megatron library sends Terracotta statistics and notifications to a management backend through REST, StatsD and other means

cluster clustering ehcache event management monitoring notification statistics statsd terracotta

Last synced: 13 Oct 2024

https://github.com/avoss84/seasonal

R code for the paper 'Forecasting seasonal time series data: a Bayesian model averaging approach'

bayesian-inference forecasting mcmc-sampler monte-carlo-methods seasonality statistics timeseries-analysis

Last synced: 07 Nov 2024

https://github.com/bblodfon/med-stat-solutions

Solutions to homework from the Fundamentals of Biostatistics, B. Rosner, Edition 8

fundamentals-of-biostatistics jupyter-notebook r-notebook rosner solutions statistics

Last synced: 16 Oct 2024

https://github.com/imnotannamaria/ia_statistics_for_devs

Repository focused on learning statics to deal with AI with pandas.

ia pandas python statistics

Last synced: 07 Nov 2024

https://github.com/mysftz/statistical-analysis

A in-depth review of statistical analysis in Python from datasets.

data-analysis python python3 statistics university university-project

Last synced: 06 Nov 2024

https://github.com/mdequeljoe/statsday

understanding financial accounts display at 2018 statistics day

chord-diagram data-visualization financial-markets statistics

Last synced: 05 Nov 2024

https://github.com/zelosleone/true-random-file-picker

PowerShell utility combining Random.org true randomness, geometric calculations, and mystical algorithms for unconventional file selection

automation background-jobs file-selection-dialog-function file-system geometry logging mystical-computing powershell random-org statistics symbolic-processing windows-scripts

Last synced: 31 Oct 2024

https://github.com/vincentlaucsb/statistical-models

A collection of notes detailing statistical models, including both their theoretical aspects and applications (in R).

logistic-regression statistics

Last synced: 06 Nov 2024

https://github.com/laszlokorte/random-variable

An interactive example of a function applied to a random variable showing the resulting distribution.

random-variables statistics

Last synced: 06 Nov 2024

https://github.com/laszlokorte/hypothesis-test

An interactive visusalization that shows how an optimal binary classifier can be derived from two given hypothesis.

hypothesis-testing statistics

Last synced: 06 Nov 2024

https://github.com/laszlokorte/gaussian-estimator

An interactive visusalization that shows how the parameters of a bivariate Gaußian Distribution can be estimated based on a given set of samples.

estimator gaussian statistics

Last synced: 06 Nov 2024

https://github.com/amyanchen/computational-statistics

Computational statistics projects for statistical inference using R programming

computational-statistics r statistical-computing statistics

Last synced: 07 Nov 2024

https://github.com/amyanchen/sf-airbnb

Exploratory Data Analysis of San Francisco Airbnb's

data-analysis data-science data-visualization r rmarkdown statistics

Last synced: 07 Nov 2024

https://github.com/eubrunoo/beer-consumption-predictor

An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.

data-analysis data-science data-visualization machine-learning r statistical-analysis statistics

Last synced: 28 Oct 2024

https://github.com/yandexdataschool/ml-sweights-experiments

Experiments for the "Machine Learning on data with sPlot background subtraction" paper

data-analysis high-energy-physics machine-learning statistics

Last synced: 06 Nov 2024

https://github.com/sukuasoft/stat-js

It is a lightweight and easy-to-use library for Node.js that offers basic functions essential for simple statistical analysis.

basic nodejs statistics

Last synced: 14 Oct 2024

https://github.com/sam81/rguide

A guide for the R programming language

bookdown r statistics

Last synced: 14 Oct 2024

https://github.com/vsivsi/bigbinomial

Implements binomial distribution PMF and CDF functions with math/big support

bignumber binomial-distribution cdf golang math pmf statistics

Last synced: 18 Oct 2024

https://github.com/kashyap-prabhat/sigma

A Scala library for probability and statistics formulas, including rules for probability calculations.

data formulas library mathematics probability scala statistics

Last synced: 14 Oct 2024

https://github.com/ivandeex/rlib

helper functions for R

econometrics rlang statistics

Last synced: 28 Oct 2024

https://github.com/psygo/monte-carlo-ts

Monte Carlo (Gaussian) with TypeScript and SolidJS

monte-carlo simulation statistics

Last synced: 07 Nov 2024

https://github.com/cpmachado/r-programming-for-statistics-and-data-science

My source code for exercises of a given udemy course on statistics

r rlang statistical-learning statistics statistics-course

Last synced: 28 Oct 2024

https://github.com/saketkc/usc-math-505a-screening-solutions

Solutions to USC's MATH-505A Screening Exams

bookdown math rstudio statistics

Last synced: 15 Oct 2024

https://github.com/g4brielvs/etudes

:robot: My collection of études

data-science etudes mathematics pytudes statistics teaching

Last synced: 14 Oct 2024

https://github.com/hokagem/damagedlogginganalyzer

A project about an analization of a statistic of damaged logging (wood) in Germany using Python.

analysis csv csv-parser k-fold-cross-validation numpy pandas pandas-dataframe pandas-python polynomial-regression scikit-learn statistics wood

Last synced: 20 Oct 2024

https://github.com/erdogant/pypiplot

Python package to count and plot the number of downloads from Pypi.

plot pypi pypi-downloads python statistics

Last synced: 18 Oct 2024

https://github.com/antoinesoetewey/lespo2102

Site web dédié au cours LESPO2102 dispensé à l'UCLouvain.

academic course statistics

Last synced: 01 Nov 2024

https://github.com/maybethee/statchasing

Site that provides interesting analytic data for Rocket League players with replays on ballchasing.com

react rocket-league ruby-on-rails statistics

Last synced: 27 Oct 2024

https://github.com/antoinesoetewey/correlogram

Shiny app with an example of correlogram. See https://antoinesoetewey.shinyapps.io/correlogram/

correlogram r shiny statistics

Last synced: 01 Nov 2024

https://github.com/rhagenson/rscripts

Random R scripts and utilities I have created. May be useful for others, but no guarantee

helpers r statistics utilities

Last synced: 14 Oct 2024

https://github.com/foxted/basic-trend

Basic algorithm to detect increasing, decreasing or stable trend in a Numbers sequence

arrays sequence statistics trends

Last synced: 17 Oct 2024

https://github.com/dfandrich/testclutch

Test Clutch is a system for tracking and analyzing automated regression test results over multiple continuous integration services.

developer-tools statistics testing

Last synced: 16 Oct 2024

https://github.com/aligusnet/fsstats

Basic Statistics and Some Probability Library

fsharp functional-programming mathematics statistics statistics-library

Last synced: 12 Oct 2024

https://github.com/ngiann/fastparzenwindows.jl

Fast Parzen Windows: a kernel-based method for non-parametric probability density function.

julia kernel-density-estimation statistics

Last synced: 19 Oct 2024

https://github.com/defgsus/bm-wahl-18-jena

Oberbürgermeisterwahl Jena/Thüringen 2018 - Datenbeschaffung und Analyse

correlation-coefficient mayor politics statistics vote

Last synced: 23 Oct 2024

https://github.com/takuizum/parallelanalysis.jl

Heuristic methods for assessing approximate unidimensionality of data matrix.

julia-language julia-package psychology psychometrics statistics

Last synced: 19 Oct 2024

https://github.com/palewire/ipsos-credibility-interval

A Python tool that calculates Bayesian credibility intervals for online polling using the Ipsos method

bayesian data-journalism journalism news polling python statistics

Last synced: 18 Oct 2024

https://github.com/remi-gau/talk_non-sphericity_correction

scripts and function to illustrate sphericity correction in fMRI

autocorrelation fmri statistics

Last synced: 01 Nov 2024

https://github.com/adijo/bayesian-inference-hello-world

A minimalistic example of bayesian inference. We infer the probability of heads in a series of coin flips.

bayesian-inference hello-world machine-learning minimal statistics

Last synced: 15 Oct 2024

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 05 Nov 2024

https://github.com/adijo/ph525-statistics-and-r

Exercises from the Statistics and R course on edX.

inference r statistics

Last synced: 15 Oct 2024

https://github.com/adijo/ucsc-bayesian-stats-2-project

Bayesian Statistics: Techniques and Tools

bayesian bayesian-inference machine-learning statistics

Last synced: 15 Oct 2024

https://github.com/jefedigital/data-science-resources

Open-source books and papers on foundational concepts for AI, ML, Stats etc.

ai machine-learning statistics

Last synced: 14 Oct 2024

https://github.com/olekscode/statisticseconometrics

My solutions to the assignments from Elements of Statistics, Econometrics, and Time Series Analysis course at UCU

course econometrics homework r statistics time-series

Last synced: 31 Oct 2024

https://github.com/erikras/war

Will this game of War ever end??

card-game cardgames probability statistics war

Last synced: 13 Oct 2024

https://github.com/pdwaggoner/mlbstats

Vector-based MLB Player Statistics Calculator

mlb player-level-metrics sabermetrics statistics

Last synced: 14 Oct 2024

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 05 Nov 2024

https://github.com/sahiltiwariiii/dssp

Predicting student math scores ! This project utilizes advanced machine learning techniques and MLOps tools like DVC and MLflow to predict a student's math score based on various factors such as gender, race/ethnicity, parental level of education, lunch type, test preparation course, writing etc

docker dotenv dvc flask machine-learning mlflow mlops mysql mysql-connector-python numpy pandas pymysql python python-dotenv scikit-learn seaborn sklearn-library statistics streamlit

Last synced: 08 Nov 2024

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/barbaracalderon/ine5649-tecnicas-estatistica-predicao

My repository for the contents developed in R for class INE5649 - "Técnicas de Estatística e Predição"

r rstudio statistics

Last synced: 14 Oct 2024

https://github.com/drkane/area-profiles

Produce UK area profiles based on various data sources

dash-plotly data flask statistics uk

Last synced: 24 Oct 2024

https://github.com/mwoss/parlamentstatistics

Small app providing statistics about Polish parlament.

gson java json statistics

Last synced: 13 Oct 2024

https://github.com/esirotkin/statistics-subsystem

Subsystem for monitoring and collecting statistics of event-driven Java applications

java monitoring statistics zabbix

Last synced: 15 Oct 2024

https://github.com/xstupi00/Theoretical-Assignments

Elaborated projects with theoretical assignments during the master's degree.

automata complexity information-security markov-chain petri-nets statistics storm vut vut-fit

Last synced: 23 Oct 2024

https://github.com/chmue/vam-bayes-intro

A brief introduction to Bayesian Statistics for Very Applied Methods

bayes introduction statistics

Last synced: 14 Oct 2024

https://github.com/andreypomortsev/statistical-list.pop-performance-evaluation

This repository contains a performance comparison of list.pop() versus list.pop(-2) in Python. The project involves measuring execution times of these list operations, performing statistical tests to evaluate the significance of differences, and visualizing the results using histograms and box plots.

jupyter-notebook scipy statistics visualization

Last synced: 15 Oct 2024

https://github.com/nakshjainsonigara/football-eda

This EDA offers a comprehensive exploration of football analytics, leveraging Python libraries to analyze players, club games, and various other aspects of the sport. Through statistical analysis and visualization, we uncover insights into player performance, club strategies, and league dynamics providing valuable insights for football enthusiasts.

dash exploratory-data-analysis matplotlib numpy pandas plotly python scipy seaborn statistics

Last synced: 15 Oct 2024

https://github.com/pblischak/zig-ndarray

N-Dimensional Arrays in Zig

data-science ndarray statistics ziglang

Last synced: 15 Oct 2024

https://github.com/thomasx-0/env223

Research project measuring the affects of an increase in beef consumption on degradation of the amazon rainforest

environment project r statistics

Last synced: 28 Oct 2024

https://github.com/tbouron/ha-agur

Home Assistant integration for Agur https://ael.agur.fr

custom-integration hacs hacs-integration history home-assistant integration sensors statistics water

Last synced: 15 Oct 2024