https://github.com/crsh/jab

R package to automagically compute Jeffrey's Approximate Bayes factors
https://github.com/crsh/jab

bayes-factor bayesian-inference r statistics

Last synced: over 1 year ago
JSON representation

R package to automagically compute Jeffrey's Approximate Bayes factors

Host: GitHub
URL: https://github.com/crsh/jab
Owner: crsh
License: other
Created: 2023-04-25T12:40:10.000Z (about 3 years ago)
Default Branch: main
Last Pushed: 2025-02-14T08:33:56.000Z (over 1 year ago)
Last Synced: 2025-02-14T09:37:02.391Z (over 1 year ago)
Topics: bayes-factor, bayesian-inference, r, statistics
Language: R
Homepage:
Size: 896 KB
Stars: 2
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.Rmd
- Changelog: NEWS.md
- License: LICENSE
- Code of conduct: .github/CODE_OF_CONDUCT.md

Awesome Lists containing this project

README

          ---

output: github_document

references: "inst/references.bib"

---

```{r, include = FALSE}

knitr::opts_chunk$set(

  collapse = TRUE,

  comment = "#>",

  fig.path = "man/figures/README-",

  out.width = "100%"

)

```

# jab: Automagic computation of Jeffrey's approxiamte Bayes factors

[![CRAN status](https://www.r-pkg.org/badges/version/jab)](https://CRAN.R-project.org/package=jab)

[![Lifecycle: experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://lifecycle.r-lib.org/articles/stages.html#experimental)

The goal of **jab** is to conveniently calculate Jeffrey's approximate Bayes factor (JAB; [Wagenmakers, 2022](https://psyarxiv.com/egydq)) for a wide variety of statistical analyses.

## Installation

You can install the development version of **jab** like so:

``` r

remotes::install_github("crsh/jab")

```

## Example

**jab** automatically supports calculation of JAB for any analysis that outputs a [Wald test](https://en.wikipedia.org/wiki/Wald_test) and for which [**broom**](https://github.com/tidymodels/broom/) returns an estimate and a standard error.

The user additionally needs to specify a prior distribution for estimate in the scale used to calculate the Wald statistic.

Take the example of standard linear regression.

JAB can be easily calculated for all regression coefficients.

We simply submit the results from the orthodox frequentist analysis to `jab()` and specify a prior distribution---let's use a scaled central Cauchy distribution.

Note that JAB gives evidence for the null hypothesis relative to the alternative.

```{r lm-example}

library("jab")

library("ggplot2")

# Fit regression model

data(attitude)

attitude_z <- data.frame(scale(attitude))

attitude_lm <- lm(rating ~ 0 + ., data = attitude_z)

attitude_tidy_lm <- broom::tidy(attitude_lm)

attitude_tidy_lm

# Specify prior distribution and approximate Bayes factor

attitude_jab <- jab(

  attitude_lm

  , prior = dcauchy

  , location = 0

  , scale = sqrt(2) / 4

)

attitude_jab

```

Now compare this with the Jeffreys-Zellner-Siow (JZS) Bayes factor from `BayesFactor::regressionBF()` with the same prior distribution.

```{r lm-example-jzs}

# Calculate JZS-Bayes factor

attitude_jzs <- BayesFactor::regressionBF(

  rating ~ .

  , data = attitude

  , rscaleCont = sqrt(2) / 4

  , whichModels = "top"

  , progress = FALSE

)

# Compare results

tibble::tibble(

  predictor = attitude_tidy_lm$term

  

  # Frequentist p-values

  , p = attitude_tidy_lm$p.value

  

  # Bayes factors in favor of the null hypothesis

  , jab = attitude_jab

  , jzs = rev(as.vector(attitude_jzs))

  

  # Naive posterior probabilities

  , jab_pp = jab / (jab + 1)

  , jzs_pp = jzs / (jzs + 1)

)

```

Pretty close!

### Varying prior distributions

To vary the scale of the prior distribution, simply pass a vector of scaling parameters, one scale for each coefficient.

```{r lm-example-vary-priors}

jab(

  attitude_lm

  , prior = dcauchy

  , location = 0

  , scale = c(rep(0.5, 3), rep(sqrt(2) / 4, 3))

)

```

### Prior sensitivity

Similarly, performing a prior sensitivity analysis is straight forward and fast.

```{r lm-example-prior-sensitivity}

# Specify design

jab_sensitivity <- expand.grid(

  coef = names(coef(attitude_lm))

  , r = seq(0.2, 1.5, length.out = 50)

) |>

  # Calculate Bayes factors for each prior setting

  dplyr::group_by(r) |>

  dplyr::mutate(

    jab = jab(

      attitude_lm

      , prior = dcauchy

      , location = 0

      , scale = r

    )

  )

# Plot results

ggplot(jab_sensitivity) +

  aes(x = r, y = jab / (1 + jab), color = coef) +

  geom_hline(

    yintercept = 0.5

    , linetype = "22"

    , color = grey(0.7)

  ) +

  geom_line(linewidth = 1.5) +

  scale_color_viridis_d() +

  lims(y = c(0, 1)) +

  labs(

    x = bquote(italic(r))

    , y = "Naive posterior probability"

    , color = "Coefficient"

  ) +

  papaja::theme_apa(box = TRUE)

```

### Sequential analyses

Sequential analyses are also a breeze.

```{r lm-example-sequential-analysis}

# Specify design

sequential_jab <- expand.grid(

  coef = names(coef(attitude_lm))

  , n = 10:nrow(attitude_z)

) |>

  # Calculate Bayes factors for each subsample

  dplyr::group_by(n) |>

  dplyr::mutate(

    jab = jab(

      update(attitude_lm, data = attitude_z[1:unique(n), ])

      , dcauchy

      , location = 0

      , scale = sqrt(2) / 4

    )

    , jab_pp = jab / (jab + 1)

  )

# Plot results

ggplot(sequential_jab) +

  aes(x = n, y = jab_pp, color = coef) +

  geom_line(linewidth = 1.5) +

  scale_color_viridis_d() +

  lims(y = c(0, 1)) +

  labs(

    x = bquote(italic(n))

    , y = "Naive posterior probability"

    , color = "Coefficient"

  ) +

  papaja::theme_apa(box = TRUE)

```

## What's in a p-value?

By calculating JAB from p-values, we can explore approximately how much evidence a p-value provides for the alternative (or null) hypothesis for a given sample size.

Here I use the precise piecewise approximation suggested by [Wagenmakers (2022)](https://psyarxiv.com/egydq), Eq. 9.

Note that both axes are on a log-scale.

```{r evidence-in-p}

library("geomtextpath")

p_boundaries <- c(0.0001, 0.001, 0.01, 0.05, 0.1, 1)

dat <- expand.grid(

  p = exp(seq(log(0.00005), log(1), length.out = 100))

  , n = exp(seq(log(3), log(10000), length.out = 100))

) |>

  transform(jab_p = 1 / jab::jab_p(p, n))

evidence_labels <- data.frame(

  n = c(17, 50, 75, 150, 350, 800, 2000, 4800)

  , p = c(0.0002, 0.0009, 0.00225, 0.005, 0.019, 0.065, 0.175, 0.45)

  , label = c("Extreme", "Very strong", "Strong", "Moderate", "Anecdotal", "Moderate", "Strong", "Very strong")

  , angle = -c(17, 17, 17, 17, 17, 18, 21, 24) + 3

)

plot_settings <- list(

  scale_x_continuous(

    expand = expansion(0, 0)

    , breaks = c(5, 10, 20, 50, 100, 250, 500, 1000, 2500, 5000, 10000)

    , trans = "log"

    , name = bquote("Sample size" ~ italic(n))

  )

  , scale_y_continuous(

    expand = expansion(0, 0)

    , breaks = p_boundaries

    , labels = format(p_boundaries, scientific = FALSE, drop0trailing = TRUE)

    , trans = "log"

    , name = bquote(italic(p)*"-value")

  )

  , scale_fill_viridis_c(guide = "none")

  , theme_minimal(base_size = 16)

  , theme(

    axis.ticks.length = unit(5, "pt")

    , axis.ticks.x = element_line()

    , axis.ticks.y = element_line()

    , plot.margin = margin(0.5, 0.5, 0.5, 0.5, "cm")

    , axis.title.x = element_text(margin = margin(t = 0.1, unit = "cm"))

    , axis.title.y = element_text(margin = margin(r = -0, unit = "cm"))

    , axis.text.x = element_text(angle = 30, hjust = 1)

  )

)

to_reciprocal <- function(x) {

  ifelse(

    x > 1

    , as.character(round(x))

    , paste0("1/", round(1/x))

  )

}

 

ggplot(dat) +

  aes(x = n, y = p) +

  geom_raster(aes(fill = log(jab_p)), interpolate = TRUE) +

  # geom_hline(yintercept = p_boundaries, color = "white", alpha = 0.2) +

  geom_textcontour(

    aes(z = jab_p, label = to_reciprocal(after_stat(level)))

    , color = "white"

    , breaks = c(1/30, 1/10, 1/3, 3, 10, 30, 100)

  ) +

  geom_text(

    aes(x = n, y = p, label = label, angle = angle)

    , data = evidence_labels

    , color = "white"

    , fontface = "bold"

    , size = 5

  ) +

  plot_settings

```

```{r dep-plot, echo = FALSE, fig.width = 7, fig.height = 7, message = FALSE, warning = FALSE, eval = FALSE}

depgraph::plot_dependency_graph()

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/crsh/jab

Awesome Lists containing this project

README