Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/GSK-Biostatistics/tfrmt

r package for formatting tables
https://github.com/GSK-Biostatistics/tfrmt

Last synced: 8 days ago
JSON representation

r package for formatting tables

Awesome Lists containing this project

README

        

---
output: github_document
---

```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>"
)
```

```{r setup, echo=FALSE, message = FALSE, warning = FALSE}
library(tfrmt)
library(gt)
```

# tfrmt tfrmt logo

[![R-CMD-check](https://github.com/GSK-Biostatistics/tfrmt/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/GSK-Biostatistics/tfrmt/actions/workflows/R-CMD-check.yaml)
[![Codecov test coverage](https://codecov.io/gh/GSK-Biostatistics/tfrmt/branch/main/graph/badge.svg)](https://app.codecov.io/gh/GSK-Biostatistics/tfrmt?branch=main)
[![status: experimental](https://github.com/GIScience/badges/raw/master/status/experimental.svg)](https://github.com/GIScience/badges#experimental)
[![CRAN status](https://www.r-pkg.org/badges/version/tfrmt)](https://CRAN.R-project.org/package=tfrmt)

The tfrmt package provides a language for defining display-related metadata, which can then be used to automate and easily update output formats.

In clinical trials, displays are generally quite standard, but frequent, highly specific formatting tweaks (e.g., rounding, footnotes, headers) are very common. Prior to data analysis, study teams often generate mock displays to represent the desired end product for sponsors to approve or programmers to replicate. This process is typically highly manual and separate from the programming itself. There is also a high importance placed on verifying the accuracy of the results via a QC (Quality Control) process such as double programming. Finally, there is a movement toward an industry standard data structure for Analysis Results Data "ARD", which means analysis results datasets will have consistent structures and column names. Specifically, the ARD is long, with 1 record per computed value. For more information about ARDs click [here](https://pharmasug.org/download/sde/rtp2021/PharmaSUG-NCSDE_2021-08.pdf).

tfrmt supports a vision where:

- Mock displays are integrated with the programming workflow
- Results are QC'ed prior to formatting to reduce rework
- Standard formatting styles can be applied in as little as one line of code
- The ARD structure can be leveraged to accommodate a variety of tables

By reducing the amount of repetitive tasks, study teams can focus on the quality and interpretation of the results themselves.

# Why tfrmt?

While there are many existing table-making packages in the R ecosystem, they typically fall into one of two categories:

- Table packages that perform analyses and format the results
- Table packages that format and output existing data

By design, tfrmt is more of the latter, as it is intended to be used after the results have been computed. What makes tfrmt unique, however, is that it offers an intuitive interface for defining and layering standard or custom formats that are often specific to clinical trials. It also offers the novel ability to easily generate mock displays using metadata that will be used for the actual displays. tfrmt is built on top of the powerful gt package, which is intended to support a variety of output formats in the future.

# Installation

The development version of tfrmt can be installed with:

```{r eval =FALSE}
devtools::install_github("GSK-Biostatistics/tfrmt")
```

# Input data structure

We expect an input dataset that is long, with 1 record per computed value. Required columns include:

- [Optional] 1 or more **group** columns, containing grouping values
- A single **label** column, containing row label values
- 1 or more **column** columns, containing column values
- A single **param** column, which provides a label for distinct types of values
- A single **value** column, containing the computed, raw data values
- [Optional] 1 or more **sorting_cols** columns, containing numeric values to be used in the row ordering

# Functionality

Here is an overview of what is possible with tfrmt:

- Create a "tfrmt" metadata object containing all formatting and labelling for the display
- Create mock displays based on existing sample data or no prior data
- ARD-standard compliant facilitates reuse and automation

Other benefits of tfrmt:

- Provides a tidyverse-friendly, pipeable interface
- Leverages gt as output engine, which allows for further customizations within gt itself

# More Info

For more information about how to build your own tfrmt mocks/tables (like the one below!), please explore the [vignettes](https://gsk-biostatistics.github.io/tfrmt/articles/examples.html).

```{r echo = FALSE, message = FALSE, warning = FALSE}
library(dplyr)
data_demog <- data_demog %>%
select(-value)
tfrmt <- tfrmt(
# specify columns in the data
group = vars(rowlbl1,grp),
label = rowlbl2,
column = column,
param = param,
value = value,
sorting_cols = vars(ord1, ord2),

# Specify body plan
body_plan = body_plan(
frmt_structure(group_val = ".default", label_val = ".default", frmt_combine("{n} {pct}",
n = frmt("xxx"),
pct = frmt_when("==100" ~ "",
"==0" ~ "",
TRUE ~ frmt("(xx.x %)")))),
frmt_structure(group_val = ".default", label_val = "n", frmt("xxx")),
frmt_structure(group_val = ".default", label_val = c("Mean", "Median", "Min","Max"), frmt("xxx.x")),
frmt_structure(group_val = ".default", label_val = "SD", frmt("xxx.xx")),
frmt_structure(group_val = ".default", label_val = ".default", p = frmt("")),
frmt_structure(group_val = ".default", label_val = c("n","<65 yrs","<12 months","<25"), p = frmt_when(">0.99" ~ ">0.99",
"<0.001" ~ "<0.001",
TRUE ~ frmt("x.xxx", missing = "")))
),

# Specify row group plan
row_grp_plan = row_grp_plan(
row_grp_structure(group_val = ".default", element_block(post_space = " ")),
label_loc = element_row_grp_loc(location = "column")
),

# Specify column style plan
col_style = col_style_plan(
col_style_structure(align = c(".",","," "), col = vars(everything()))
),

# remove extra cols
col_plan = col_plan(Placebo,
starts_with("Xanom"),
Total,
-grp,
-starts_with("ord")
)
)

gt_out <- print_mock_gt(tfrmt, data_demog)
invisible(gtsave(gt_out, "man/figures/gt_readme.png", vwidth = 999))
```

![Example GT Demography Table](man/figures/gt_readme.png)

# Recorded Presentations and Shared Materials
[R/Pharma 2022 Day 1: Christina Fillmore - Why do I spend all my life formatting tables?](https://www.youtube.com/watch?v=00lGhuANUJw)

[R/Pharma 2023: Becca Krouse - Everyone's Invited: A Case Study on Bridging the Usability Gap](https://www.youtube.com/watch?v=Zg1LPJSO0kQ)

[R/Pharma 2022 Workshop: Christina Fillmore, Ellis Hughes, Thomas Neitmann - Clinical Reporting in R (Day 2)](https://www.youtube.com/watch?v=rYflZhFDSZQ)

[R/Pharma 2023 Workshop: Thomas Neitmann, Pawel Rucki, Ellis Hughes - Leveraging and contributing to the the pharmaverse for clinical trial reporting in R](https://github.com/posit-conf-2023/r-pharma)