https://github.com/btskinner/duawranglr
R Package to Securely Wrangle Dataset According to Data Usage Agreement
https://github.com/btskinner/duawranglr
data-security data-usage-agreement data-wrangling package r
Last synced: 5 months ago
JSON representation
R Package to Securely Wrangle Dataset According to Data Usage Agreement
- Host: GitHub
- URL: https://github.com/btskinner/duawranglr
- Owner: btskinner
- License: other
- Created: 2018-04-10T19:29:54.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2021-06-13T17:26:44.000Z (over 4 years ago)
- Last Synced: 2025-07-27T12:50:30.033Z (5 months ago)
- Topics: data-security, data-usage-agreement, data-wrangling, package, r
- Language: R
- Homepage: https://www.btskinner.io/duawranglr
- Size: 610 KB
- Stars: 9
- Watchers: 2
- Forks: 2
- Open Issues: 6
-
Metadata Files:
- Readme: README.Rmd
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
---
title: duawranglr
output: md_document
---
# duawranglr
```{r, include = FALSE}
options(width = 100)
```
[](https://github.com/btskinner/duawranglr/actions)
[](https://github.com/btskinner/duawranglr)
[](https://CRAN.R-project.org/package=duawranglr)
The guiding principle behind duawranglr is to make it easier for
organizations to share data that contain protected elements and/or
personally idenfiable information (PII) with researchers. There are
two key problems this package attempts to solve:
1. Data owners and reseachers may wish to collaborate on multiple
projects, each with a different level of data security required;
executing a unique data usage agreement (DUA) for each project can be
time consuming and inefficient.
2. Administrators tasked with approving data requests do not always
have the time or technical proficiency to closely review the code that
reads, subsets, filters, and deidentifies data files according to a
DUA.
This package offers a set of functions to help users create shareable
data sets from raw data files that contain protected elements. Relying
on master crosswalk files that list restricted variables, package
functions warn users about possible violations of data usage agreement
and prevent writing protected elements.
## Installation
Install the latest released version from CRAN with
```r
install.packages("duawranglr")
```
Install the latest development version from Github with
```r
devtools::install_github("btskinner/duawranglr")
```
## Usage
See vignettes or [documentation site](https://btskinner.io/duawranglr)
for package motivation and an example use case.