Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dmrodz/dataMeta
An R package to create and append a data dictionary for an R dataset
https://github.com/dmrodz/dataMeta
Last synced: 9 days ago
JSON representation
An R package to create and append a data dictionary for an R dataset
- Host: GitHub
- URL: https://github.com/dmrodz/dataMeta
- Owner: dmrodz
- License: gpl-3.0
- Created: 2017-04-22T22:06:36.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2022-04-19T23:27:46.000Z (over 2 years ago)
- Last Synced: 2024-11-14T14:56:38.091Z (29 days ago)
- Language: R
- Homepage:
- Size: 1.42 MB
- Stars: 23
- Watchers: 3
- Forks: 8
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- jimsghstars - dmrodz/dataMeta - An R package to create and append a data dictionary for an R dataset (R)
README
## dataMeta - An R package to create and append a data dictionary for an R dataset
Datasets generally require a data dictionary that will incorporate information related to the dataset's variables and their descriptions, including the description of any variable options. This information is crucial, particularly when a dataset will be shared for further analyses. However, constructing it may be time-consuming. The `dataMeta` package has a collection of functions that are designed to construct a data dictionary and append it to the original dataset as an attribute, along with other information generally provided in other software as metadata. This information will include: the time and date when it was edited last, the user name, and a main description of the dataset. Finally, the dataset is saved as an R dataset (.rds).
```diff
Suggestions are most welcome!
```Install dataMeta from GitHub (v0.1.2) using devtools:
last_update: ongoing
```
install.packages("devtools")
library(devtools)
install_github("dmrodz/dataMeta")
```Install dataMeta v0.1.1 from CRAN:
```
install.packages("dataMeta")
```### Feel free to view the [vignette](http://htmlpreview.github.io/?https://github.com/dmrodz/dataMeta/blob/master/inst/doc/dataMeta_Vignette.html)!
dataMeta workflow
There are three basic steps to building a data dictionary with this package, which are outlined in the figure below. First, a "linker" data frame is created, where the user will add each variable description and also provide a variable "type" or key. These keys/variable types are explained in the vignette (see link above). The variable type will depend on whether the user wants to list all available variable options or if a range of variable values would suffice. Secondly, the main data dictionary is created using the original dataset and the linker data frame. Here, the user will be able to construct any additional variable option descriptions as needed or just build a dictionary with variable names, their descriptions and options. Finally, the user can append the dictionary to the original dataset as an R attribute, along with the date in which the dictionary is created, the author name, and also general R attributes included for data frames. The new dataset with its attributes is then saved as an R dataset (.rds).