https://github.com/bgreenwell/investr

Inverse estimation in R
https://github.com/bgreenwell/investr

calibration inverse-estimation inverse-prediction r regression

Last synced: about 1 month ago
JSON representation

Inverse estimation in R

Host: GitHub
URL: https://github.com/bgreenwell/investr
Owner: bgreenwell
Created: 2013-08-08T01:22:43.000Z (almost 12 years ago)
Default Branch: master
Last Pushed: 2022-12-27T08:39:46.000Z (over 2 years ago)
Last Synced: 2025-04-14T16:12:42.199Z (about 1 month ago)
Topics: calibration, inverse-estimation, inverse-prediction, r, regression
Language: R
Size: 2.92 MB
Stars: 23
Watchers: 4
Forks: 6
Open Issues: 18
Metadata Files:
- Readme: README.Rmd

Awesome Lists containing this project

README

        ---

output:

  md_document:

    variant: markdown_github

---

```{r, echo = FALSE}

knitr::opts_chunk$set(

  collapse = TRUE,

  comment = "#>",

  fig.align = "center",

  fig.path = "man/figures/README-"

)

```

[![CRAN status](https://www.r-pkg.org/badges/version/investr)](https://CRAN.R-project.org/package=investr)

[![Total Downloads](https://cranlogs.r-pkg.org/badges/grand-total/investr)](https://cranlogs.r-pkg.org/badges/grand-total/investr)

# investr: Inverse Estimation in R

Inverse estimation, also referred to as the calibration problem, is a classical and well-known problem in regression. In simple terms, it involves the use of an observed value of the response (or specified value of the mean response) to make inference on the corresponding unknown value of an explanatory variable. 

A detailed introduction to investr has been published in The R Journal: ["investr: An R Package for Inverse Estimation"](https://journal.r-project.org/archive/2014/RJ-2014-009/index.html). You can track development at https://github.com/bgreenwell/investr. To report bugs or issues, contact the main author directly or submit them to https://github.com/bgreenwell/investr/issues. 

As of right now, `investr` supports (univariate) inverse estimation with objects of class:

* `"lm"` - linear models (multiple predictor variables allowed)

* `"glm"` - generalized linear models (multiple predictor variables allowed)

* `"nls"` - nonlinear least-squares models

* `"lme"` - linear mixed-effects models (fit using the `nlme` package)

## Installation

The package is [currently listed on CRAN](https://cran.r-project.org/package=investr) and can easily be installed:

```{r, eval=FALSE}

# Install from CRAN

install.packages("investr", dep = TRUE)

  

# Alternatively, install the development version from GitHub

devtools::install_github("bgreenwell/investr")

```

The package is also part of the [ChemPhys task view](https://cran.r-project.org/view=ChemPhys), a collection of R packages useful for analyzing data from chemistry and physics experiments. These packages can all be installed at once (including `investr`) using the `ctv` package (Zeileis, 2005):

```{r, eval=FALSE}

# Install the ChemPhys task view

install.packages("ctv")

ctv::install.views("ChemPhys")

```

## Examples

### Dobson's Beetle Data

In binomial regression, the estimated lethal dose corresponding to a specific probability _p_ of death is often referred to as _LDp_. `invest` obtains an estimate of _LDp_ by inverting the fitted mean response on the link scale. Similarly, a confidence interval for _LDp_ can be obtained by inverting a confidence interval for the mean response on the link scale.

```{r glm-beetle-invest}

# Load required packages

library(investr)

# Binomial regression

beetle.glm <- glm(cbind(y, n-y) ~ ldose, data = beetle, 

                  family = binomial(link = "cloglog"))

plotFit(beetle.glm, lwd.fit = 2, cex = 1.2, pch = 21, bg = "lightskyblue", 

        lwd = 2, xlab = "Log dose", ylab = "Probability")

# Median lethal dose

invest(beetle.glm, y0 = 0.5)   

# 90% lethal dose

invest(beetle.glm, y0 = 0.9)   

# 99% lethal dose

invest(beetle.glm, y0 = 0.99)  

```

To obtain an estimate of the standard error, we can use the Wald method:

```{r glm-beetle-invest-se}

invest(beetle.glm, y0 = 0.5, interval = "Wald")

# The MASS package function dose.p can be used too 

MASS::dose.p(beetle.glm, p = 0.5)

```

### Including a factor variable

Multiple predictor variables are allowed for objects of class `lm` and `glm`. 

For instance, the example from `?MASS::dose.p` can be re-created as follows:

```{r glm-beetle-investr-vs-MASS}

# Load required packages

library(MASS)

# Data

ldose <- rep(0:5, 2)

numdead <- c(1, 4, 9, 13, 18, 20, 0, 2, 6, 10, 12, 16)

sex <- factor(rep(c("M", "F"), c(6, 6)))

SF <- cbind(numdead, numalive = 20 - numdead)

budworm <- data.frame(ldose, numdead, sex, SF)

# Logistic regression

budworm.glm <- glm(SF ~ sex + ldose - 1, family = binomial, data = budworm)

# Using dose.p function from package MASS

dose.p(budworm.glm, cf = c(1, 3), p = 1/4)

# Using invest function from package investr

invest(budworm.glm, y0 = 1/4, 

       interval = "Wald",

       x0.name = "ldose", 

       newdata = data.frame(sex = "F"))

```

### Bioassay on Nasturtium

The data here contain the actual concentrations of an agrochemical present in soil samples versus the weight of the plant after three weeks of growth. These data are stored in the data frame `nasturtium` and are loaded with the package. A simple

log-logistic model describes the data well:

```{r nls-nasturtium-fit}

# Log-logistic model for the nasturtium data

nas.nls <- nls(weight ~ theta1/(1 + exp(theta2 + theta3 * log(conc))),

               start = list(theta1 = 1000, theta2 = -1, theta3 = 1),

               data = nasturtium)

               

# Plot the fitted model

plotFit(nas.nls, lwd.fit = 2)

```

Three new replicates of the response (309, 296, 419) at an unknown concentration of interest ($x_0$) are measured. It is desired to estimate $x_0$.

```{r nls-nasturtium-invest}  

# Inversion method

invest(nas.nls, y0 = c(309, 296, 419), interval = "inversion")

# Wald method

invest(nas.nls, y0 = c(309, 296, 419), interval = "Wald")  

```

The intervals both rely on large sample results and normality. In practice, the bootstrap may be more reliable:

```{r nls-nasturtium-bootstrap}

# Bootstrap calibration intervals (may take a few seconds)

boo <- invest(nas.nls, y0 = c(309, 296, 419), interval = "percentile", 

              nsim = 9999, seed = 101)

boo  # print bootstrap summary

plot(boo)  # plot results

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bgreenwell/investr

Awesome Lists containing this project

README