An open API service indexing awesome lists of open source software.

https://github.com/uncoast-unconf/steward

Helps you manage metadata for published datasets
https://github.com/uncoast-unconf/steward

Last synced: 3 months ago
JSON representation

Helps you manage metadata for published datasets

Awesome Lists containing this project

README

        

---
output: github_document
---

```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = FALSE,
comment = "",
fig.path = "man/figures/README-",
out.width = "100%"
)

# https://community.rstudio.com/t/showing-only-the-first-few-lines-of-the-results-of-a-code-chunk/6963/2
#
hook_output <- knitr::knit_hooks$get("output")
knitr::knit_hooks$set(
output = function(x, options) {
lines <- options$output.lines
if (is.null(lines)) {
return(hook_output(x, options)) # pass to default hook
}
x <- unlist(strsplit(x, "\n"))
more <- "..."
if (length(lines)==1) { # first n lines
if (length(x) > lines) {
# truncate the output, but add ....
x <- c(head(x, lines), more)
}
} else {
x <- c(more, x[lines], more)
}
# paste these lines together
x <- paste(c(x, ""), collapse = "\n")
hook_output(x, options)
}
)
```

# steward

[![Lifecycle: maturing](https://img.shields.io/badge/lifecycle-maturing-blue.svg)](https://www.tidyverse.org/lifecycle/#maturing)
[![R build status](https://github.com/uncoast-unconf/steward/workflows/R-CMD-check/badge.svg)](https://github.com/uncoast-unconf/steward)

The goal of **steward** is to make it easier to import, manage, and publish the metadata associated with data frames. This might be useful for:

- [documenting a dataset](https://r-pkgs.org/data.html#documenting-data) for a package
- publishing a data-dictionary [table](https://gt.rstudio.com) in an R Markdown document

Our current definition of metadata includes the name, description and source of a dataset, as well as the name, type, and description of each of the variables in the dataset.

The name, steward, is an homage to the [Data Stewardship](http://agron590-isu.github.io/) class taught by [Andee Kaplan](https://github.com/andeek) and [Ranae Dietzel](https://github.com/ranae) (also an author of this package) at Iowa State University in Fall 2016.

This package was first built collaboratively at the [Uncoast Unconf](https://uuconf.rbind.io), held at Des Moines in April 2019.

## Installation

You can install the development version of steward from [GitHub](https://github.com/uncoast-unconf/steward) with:

```r
# install.packages("devtools")
devtools::install_github("uncoast-unconf/steward")
```

## Usage

In our [getting started article](https://uncoast-unconf.github.io/steward/articles/steward.html), we describe some common tasks:

- create a "steward" dataset.
- write a dataset to a package, with documentation.
- create a [gt table](https://gt.rstudio.com).

### Future development

We plan to release this package to CRAN as soon as the [gt](https://gt.rstudio.com) package is released there.

Some of the capabilities we plan to develop:

- take into account [timezone as column metadata](https://github.com/uncoast-unconf/steward/issues/39).
- for [readr](https://readr.tidyverse.org) and [vroom](https://vroom.r-lib.org), a way to build a [column specification to read a flat-file](https://github.com/uncoast-unconf/steward/issues/99).
- [build metadata from package-documentation](https://github.com/uncoast-unconf/steward/issues/43) (`.Rd` file)

## Related work

The [codebook](https://rubenarslan.github.io/codebook) package can help you manage dataset metadata.

## Code of conduct

Please note that the {steward} project is released with a [Contributor Code of Conduct](CODE_OF_CONDUCT.md). By contributing to this project, you agree to abide by its terms.