https://github.com/renkun-ken/rlist

A Toolbox for Non-Tabular Data Manipulation
https://github.com/renkun-ken/rlist

Last synced: 8 months ago
JSON representation

A Toolbox for Non-Tabular Data Manipulation

Host: GitHub
URL: https://github.com/renkun-ken/rlist
Owner: renkun-ken
License: other
Created: 2014-06-01T10:33:11.000Z (almost 12 years ago)
Default Branch: master
Last Pushed: 2023-03-11T12:55:35.000Z (almost 3 years ago)
Last Synced: 2025-07-07T16:15:01.838Z (8 months ago)
Language: R
Homepage:
Size: 892 KB
Stars: 205
Watchers: 17
Forks: 28
Open Issues: 25
Metadata Files:
- Readme: README.Rmd
- License: LICENSE

Awesome Lists containing this project

fucking-awesome-R - rlist - A toolbox for non-tabular data manipulation with lists. (Data Manipulation)
awesome-R - rlist - A toolbox for non-tabular data manipulation with lists. (Data Manipulation)
jimsghstars - renkun-ken/rlist - A Toolbox for Non-Tabular Data Manipulation (R)
awesome-R - rlist - A toolbox for non-tabular data manipulation with lists. (Data Manipulation)

README

          ```{r knitsetup, echo=FALSE, results='hide', warning=FALSE, message=FALSE, cache=FALSE}

library(knitr)

opts_knit$set(base.dir='./', fig.path='', out.format='md')

opts_chunk$set(prompt=FALSE, comment='', results='markup')

```

# rlist

[![R-CMD-check](https://github.com/renkun-ken/rlist/workflows/R-CMD-check/badge.svg)](https://github.com/renkun-ken/rlist/actions)

[![codecov.io](https://codecov.io/github/renkun-ken/rlist/coverage.svg?branch=master)](https://codecov.io/github/renkun-ken/rlist?branch=master)

[![CRAN Version](https://www.r-pkg.org/badges/version/rlist)](https://cran.r-project.org/package=rlist)

rlist is a set of tools for working with list objects. Its goal is to make it easier to work with lists by providing a wide range of functions that operate on non-tabular data stored in them.

This package supports list mapping, filtering, grouping, sorting, updating, searching, file input/output, and many other functions. Most functions in the package are designed to be pipeline friendly so that data processing with lists can be chained.

**[rlist Tutorial](https://renkun-ken.github.io/rlist-tutorial/) is a highly recommended complete guide to rlist.**

This document is also translated into  [日本語](https://github.com/renkun-ken/rlist/blob/master/README.ja.md) (by [@teramonagi](https://github.com/teramonagi)).

## Installation

Install the latest version from GitHub:

```r

devtools::install_github("renkun-ken/rlist")

```

Install from [CRAN](https://cran.r-project.org/package=rlist):

```r

install.packages("rlist")

```

## Motivation

In R, there are numerous powerful tools to deal with structured data stored in tabular form such as data frame. However, a variety of data is non-tabular: different records may have different fields; for each field they may have different number of values. 

It is hard or no longer straightforward to store such data in data frame, but the `list` object in R is flexible enough to represent such records of diversity. rlist is a toolbox to deal with non-structured data stored in `list` objects, providing a collection of high-level functions which are pipeline friendly.

## Getting started

Suppose we have a list of developers, each of whom has a name, age, a few interests, a list of programming languages they use and the number of years they have been using them.

```{r}

library(rlist)

devs <- 

  list(

    p1=list(name="Ken",age=24,

      interest=c("reading","music","movies"),

      lang=list(r=2,csharp=4)),

    p2=list(name="James",age=25,

      interest=c("sports","music"),

      lang=list(r=3,java=2,cpp=5)),

    p3=list(name="Penny",age=24,

      interest=c("movies","reading"),

      lang=list(r=1,cpp=4,python=2)))

```

This type of data is non-relational since it does not well fit the shape of a data frame,  yet it can be easily stored in JSON or YAML format. In R, list objects are flexible enough to represent a wide range of non-relational datasets like this. This package provides a wide range of functions to query and manipulate this type of data.

The following examples use `str()` to show the structure of the output.

### Filtering

Filter those who like music and has been using R for more than 3 years.

```{r}

str( list.filter(devs, "music" %in% interest & lang$r >= 3) )

```

### Selecting

Select their names and ages.

```{r}

str( list.select(devs, name, age) )

```

### Mapping

Map each of them to the number of interests.

```{r}

str( list.map(devs, length(interest)) )

```

### More functions

In addition to these basic functions, rlist also supports various types of grouping, joining, searching, sorting, updating, etc. For the introduction to more functionality, please go through the [rlist Tutorial](https://renkun-ken.github.io/rlist-tutorial/).

## Lambda expression

In this package, almost all functions that work with expressions accept the following forms of lambda expressions:

- Implicit lambda expression: `expression`

- Univariate lambda expressions: 

    * `x ~ expression`

    * `f(x) ~ expression`

- Multivariate lambda expressions:

    * `f(x,i) ~ expression`

    * `f(x,i,name) ~ expression`

where `x` refers to the list member itself, `i` denotes the index, `name` denotes the name. If the symbols are not explicitly declared, `.`, `.i` and `.name` will by default be used to represent them, respectively.

```r

nums <- list(a=c(1,2,3),b=c(2,3,4),c=c(3,4,5))

list.map(nums, c(min=min(.),max=max(.)))

list.filter(nums, x ~ mean(x)>=3)

list.map(nums, f(x,i) ~ sum(x,i))

```

## Using pipeline

### Working with pipe syntax

Query the name of each developer who likes music and uses R, and put the results in a data frame.

```{r}

devs |>

  list.filter("music" %in% interest & "r" %in% names(lang)) |>

  list.select(name, age) |>

  list.stack()

```

The example above uses the pipe syntax `|>` introduced in R 4.1 that chains commands in a fluent style.

### List environment

`List()` function wraps a list within an environment where almost all list functions are defined. Here is the List-environment version of the previous example.

```{r}

ldevs <- List(devs)

ldevs$filter("music" %in% interest & "r" %in% names(lang))$

  select(name,age)$

  stack()$

  data

```

## License

This package is under [MIT License](https://opensource.org/licenses/MIT).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/renkun-ken/rlist

Awesome Lists containing this project

README