Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/markvanderloo/lumberjack
Track changes in data with ease
https://github.com/markvanderloo/lumberjack
daff datascience logging r reproducible-research
Last synced: 3 months ago
JSON representation
Track changes in data with ease
- Host: GitHub
- URL: https://github.com/markvanderloo/lumberjack
- Owner: markvanderloo
- Created: 2017-05-24T06:21:50.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2024-06-14T09:16:04.000Z (5 months ago)
- Last Synced: 2024-07-14T20:22:47.963Z (4 months ago)
- Topics: daff, datascience, logging, r, reproducible-research
- Language: R
- Homepage:
- Size: 645 KB
- Stars: 63
- Watchers: 6
- Forks: 4
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- jimsghstars - markvanderloo/lumberjack - Track changes in data with ease (R)
README
## Track changes in data
[![CRAN](http://www.r-pkg.org/badges/version/lumberjack)](http://cran.r-project.org/package=lumberjack/)
[![status](https://tinyverse.netlify.com/badge/lumberjack)](https://CRAN.R-project.org/package=lumberjack)
[![Downloads](http://cranlogs.r-pkg.org/badges/lumberjack)](http://www.r-pkg.org/pkg/lumberjack)[![Mentioned in Awesome Official Statistics ](https://awesome.re/mentioned-badge.svg)](http://www.awesomeofficialstatistics.org)
[![status](https://tinyverse.netlify.app/badge/lumberjack)](https://CRAN.R-project.org/package=lumberjack)The `lumberjack` R package allows you to:
- **track changes** in **multiple data sets** as they get processed;
- using **multiple loggers** for each dataset;
- where loggers are **fully customizable**.You can get started by just adding one line of code to your existing data
analysis script.- [Overview of functionality](./pkg)
- [Talk at the eRum2018 meeting (Budapest)](https://www.youtube.com/watch?v=DNZs0CHBU4s)### Citing lumberjack
Please cite the [JSS paper](https://www.jstatsoft.org/article/view/v098i01).
```
@article{loo2020monitoring,
title = {Monitoring Data in {R} with the {lumberjack} Package},
author = {Mark P. J. {van der Loo}},
journal = {Journal of Statistical Software},
year = {2021},
volume = {98},
number = {1},
pages = {1--13},
doi = {10.18637/jss.v098.i01},
url = {https://www.jstatsoft.org/article/view/v098i01}
}
```### lumberjack philosophy
Production scripts may contain many data transformations, aimed to clean,
select, model, or augment data with new variables. Analyzing the effect of each
step is cumbersome because it involves adding a lot of code that is not
concerned with the primary goal of the script, namely to analyze and process
data.In the lumberjack philosophy, a programmer (analyst) should be only concerned
with the primary process of data analyses.![](fig/datastep2.png)
### Installation
Published version from CRAN
```r
install.packages('lumberjack')
```Development version.
```r
git clone https://github.com/markvanderloo/lumberjack
cd lumbjerjack
make install
```----
Copyright (2016) Mark van der Loo
Licenced by [EUPL 1.2](https://eupl.eu/1.2/en/)