https://github.com/bam-tools/rbam

An R user interface to BaM
https://github.com/bam-tools/rbam
bayesian-inference modeling r statistics uncertainty
Last synced: 8 months ago
JSON representation
An R user interface to BaM
Host: GitHub
URL: https://github.com/bam-tools/rbam
Owner: BaM-tools
Created: 2021-05-18T09:54:38.000Z (about 5 years ago)
Default Branch: main
Last Pushed: 2025-09-30T18:15:37.000Z (9 months ago)
Last Synced: 2025-10-22T03:47:54.566Z (8 months ago)
Topics: bayesian-inference, modeling, r, statistics, uncertainty
Language: R
Homepage:
Size: 9.55 MB
Stars: 1
Watchers: 1
Forks: 2
Open Issues: 2
Metadata Files:
- Readme: README.Rmd
- Changelog: NEWS.md
Awesome Lists containing this project

README

          ---

output: github_document

---

```{r setup, include=FALSE}

knitr::opts_chunk$set(echo=TRUE,results='hide',

                      fig.path = "man/readme/README-")

```

# RBaM - An R user interface to BaM 

[![DOI](https://zenodo.org/badge/368480115.svg)](https://doi.org/10.5281/zenodo.15856646)

## Introduction

BaM (Bayesian Modeling) is a framework to estimate a model using Bayesian inference. The R package `RBaM` is built as an R User Interface to the [computational BaM engine](https://github.com/BaM-tools/BaM). It defines classes (objects' properties and methods) for the various building blocks of a BaM case study. Its typical usage is as follows:

1. Assemble the dataset of input/ouput variables.

2. Define the model and set prior distributions for its parameters.

3. Perform Bayesian-MCMC inference.

4. Perform predictions.

## Installation

You can install the latest stable version from [CRAN](https://CRAN.R-project.org/package=RBaM) [recommended] with:

```{r message=FALSE, eval=FALSE}

install.packages('RBaM')

```

or the development version from [Github](https://github.com/BaM-tools/RBaM) [may be unstable] with:

```{r message=FALSE, eval=FALSE}

devtools::install_github('BaM-tools/RBaM')

```

The package can then be loaded with:

```{r message=FALSE}

library(RBaM) 

```

If this is the first time you're using `RBaM`, you need to link it with the `BaM` executable. There are two ways of doing this:

1. [recommended] Download the latest stable version of `BaM` executable as follows:

```{r message=FALSE, eval=FALSE}

downloadBaM(destination_folder_on_your_computer)

```

2. If you already have a version of `BaM` executable on your computer, you may link it with `RBaM` as follows:

```{r message=FALSE, eval=FALSE}

setPathToBaM(folder_containing_BaM_on_your_computer)

```

All set! The catalog of distributions and models that are available in RBaM can be accessed as follows:

```{r results='markup'}

getCatalogue()

```

## An example using the BaRatin rating curve model

A [rating curve](https://en.wikipedia.org/wiki/Rating_curve) is a model linking the water stage $H$ measured at a given point of a river and the discharge $Q$ flowing through it. The elementary rating curve equation has the power-law form Q=a(H-b)^c, where a, b and c are parameters. Since different hydraulic controls may succeed to each other as the water stage increases, it is frequent to use a piecewise combination of these elementary equations, for instance: Q=a₁(H-b₁)^c₁ for k₁ < H < k₂; Q=a₂(H-b₂)^c₂ for H > k₂, with k₁ and k₂ the activation stages of the first and the second control.

We refer to [this article](https://hal.science/hal-00934237) for additional details.

This example shows how to estimate a rating curve using a set of (H,Q) calibration data (called 'gaugings') from the 

[Ardèche river](https://en.wikipedia.org/wiki/Ardèche_(river)) at the Sauze-Saint-Martin hydrometric station. The first thing to do is to define the workspace, i.e. the folder where configuration and result files will be written.

```{r}

workspace=file.path(tempdir(),'BaM_workspace')

```

The second step is to define the calibration data. The dataset `SauzeGaugings` is provided with this package, and the code below let RBaM know what to use as inputs/outputs. 

```{r}

# Define the calibration dataset by specifying 

# inputs (X), outputs (Y) and uncertainty on the outputs (Yu).

# A copy of this dataset will be written in data.dir.

D=dataset(X=SauzeGaugings['H'],Y=SauzeGaugings['Q'],Yu=SauzeGaugings['uQ'],data.dir=workspace)

```

The third step is to define the rating curve model. The code below specifies the priors on parameters, the control matrix and creates the model object.

```{r}

# Parameters of the low flow section control: activation stage k, coefficient a and exponent c

k1=parameter(name='k1',init=-0.5,prior.dist='Uniform',prior.par=c(-1.5,0))

a1=parameter(name='a1',init=50,prior.dist='LogNormal',prior.par=c(log(50),1))

c1=parameter(name='c1',init=1.5,prior.dist='Gaussian',prior.par=c(1.5,0.05))

# Parameters of the high flow channel control: activation stage k, coefficient a and exponent c

k2=parameter(name='k2',init=1,prior.dist='Gaussian',prior.par=c(1,1))

a2=parameter(name='a2',init=100,prior.dist='LogNormal',prior.par=c(log(100),1))

c2=parameter(name='c2',init=1.67,prior.dist='Gaussian',prior.par=c(1.67,0.05))

# Define control matrix: columns are controls, rows are stage ranges.

# Here the matrix means that the first control only is active for the first stage range,

# and the second control only is active for the second stage range.

controlMatrix=rbind(c(1,0),c(0,1))

# Stitch it all together into a model object

M=model(ID='BaRatin',

        nX=1,nY=1, # number of input/output variables

        par=list(k1,a1,c1,k2,a2,c2), # list of model parameters

        xtra=xtraModelInfo(object=controlMatrix)) # use xtraModelInfo() to pass the control matrix

```

All set! The function BaM can now be called to estimate parameters. This will run a MCMC sampler and save the result into the workspace.

```{r results='markup'}

BaM(workspace=workspace,mod=M,data=D)

```

MCMC samples can now be read. There are 2 MCMC files: 'Results_MCMC.txt' contains raw MCMC simulations and is hence quite big. 'Results_Cooking.txt' contains 'cooked' simulations, i.e. after 'burning' the first part of iterations and 'slicing' what remains, and may be favored in most cases. Functions `mcmcOptions` and `mcmcCooking` allow controlling MCMC properties.

```{r results='markup'}

# Read 'cooked' MCMC file in the workspace

MCMC=readMCMC(file.path(workspace,'Results_Cooking.txt'))

head(MCMC)

```

A few functions are provided with the package to explore MCMC samples.

```{r tracePlot, fig.height=6, fig.width=9}

# Trace plot for each parameter, useful to assess convergence.

plots=tracePlot(MCMC)

gridExtra::grid.arrange(grobs=plots,ncol=3)

```

```{r densityPlot, fig.height=6, fig.width=9}

# Density plot for each parameter

plots=densityPlot(MCMC)

gridExtra::grid.arrange(grobs=plots,ncol=3)

```

```{r violinPlot, fig.height=3, fig.width=6}

# Violon plot, useful to compare 'comparable' parameters.

violinPlot(MCMC[c('c1','c2')])

```

Finally, the estimated rating curve model can be used to make predictions as shown below.

```{r}

# Define the grid of stage values on which the rating curve will be computed

hgrid=data.frame(H=seq(-1,7,0.1))

# Define a 'prediction' object for total predictive uncertainty

totalU=prediction(X=hgrid, # stage values

                  spagFiles='totalU.spag', # file where predictions are saved

                  data.dir=workspace, # a copy of data files will be saved here

                  doParametric=TRUE, # propagate parametric uncertainty, i.e. MCMC samples?

                  doStructural=TRUE) # propagate structural uncertainty ?

# Define a 'prediction' object for parametric uncertainty only - not the doStructural=FALSE

paramU=prediction(X=hgrid,spagFiles='paramU.spag',data.dir=workspace,

                  doParametric=TRUE,doStructural=FALSE)

# Define a 'prediction' object with no uncertainty - this corresponds to the 'maxpost' rating curve maximizing the posterior

maxpost=prediction(X=hgrid,spagFiles='maxpost.spag',data.dir=workspace,

                  doParametric=FALSE,doStructural=FALSE)

# Re-run BaM, but in prediction mode

BaM(workspace=workspace,mod=M,data=D, # model and data

    pred=list(totalU,paramU,maxpost), # list of predictions

    doCalib=FALSE,doPred=TRUE) # Do not re-calibrate but do predictions

```

The resulting 'spaghetti files' can be read into the workspace and plotted. 

```{r RC_spag, fig.height=6, fig.width=9}

# Plot spaghetti representing total uncertainty in red

Q=read.table(file.path(workspace,'totalU.spag'))

matplot(hgrid$H,Q,col='red',type='l',lty=1)

# Add spaghetti representing parametric uncertainty in pink

Q=read.table(file.path(workspace,'paramU.spag'))

matplot(hgrid$H,Q,col='pink',type='l',lty=1,add=TRUE)

# Add maxpost rating curve

Q=read.table(file.path(workspace,'maxpost.spag'))

matplot(hgrid$H,Q,col='black',type='l',lty=1,add=TRUE)

```

Spaghetti are the raw outputs of the predictions, but it is often more convenient to plot probability intervals only. These have been computed automatically by BaM, and are saved in files with an '.env' (like 'envelop') extension.

```{r RC_env, fig.height=6, fig.width=9}

Q=read.table(file.path(workspace,'totalU.env'),header=T)

matplot(hgrid$H,Q[,2:3],col='red',type='l',lty=1)

Q=read.table(file.path(workspace,'paramU.env'),header=T)

matplot(hgrid$H,Q[,2:3],col='pink',type='l',lty=1,add=TRUE)

Q=read.table(file.path(workspace,'maxpost.spag'))

matplot(hgrid$H,Q,col='black',type='l',lty=1,add=TRUE)

```
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bam-tools/rbam

Awesome Lists containing this project

README