https://github.com/corybrunson/plt-topdown

R interface to & implementation of the Persistence Landscapes Toolbox
https://github.com/corybrunson/plt-topdown
persistent-homology r topological-data-analysis
Last synced: 4 months ago
JSON representation
R interface to & implementation of the Persistence Landscapes Toolbox
Host: GitHub
URL: https://github.com/corybrunson/plt-topdown
Owner: corybrunson
License: gpl-3.0
Created: 2022-11-18T12:58:58.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-02-23T19:42:19.000Z (over 2 years ago)
Last Synced: 2025-02-05T00:41:47.071Z (5 months ago)
Topics: persistent-homology, r, topological-data-analysis
Language: C++
Homepage:
Size: 7.83 MB
Stars: 1
Watchers: 4
Forks: 1
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

README

        

    ## ℹ Loading plt

# plt

Persistence landscapes are a vectorization of persistence data (also

called persistence diagrams), [originated by Peter

Bubenik](https://jmlr.csail.mit.edu/papers/v16/bubenik15a.html), that

have useful statistical properties including linearity and an inner

product. This is an R package interface to [Paweł Dłotko’s Persistence

Landscapes

Toolbox](https://www2.math.upenn.edu/~dlotko/persistenceLandscape.html),

[developed with

Bubenik](https://www.sciencedirect.com/science/article/pii/S0747717116300104)

to efficienctly compute and calculate with persistence landscapes. It

was adapted from [Jose Bouza’s **tda-tools**

package](https://github.com/jjbouza/tda-tools), a TDA pipeline used by

[Bubenik’s

lab](https://people.clas.ufl.edu/peterbubenik/researchgroup/).

## Installation

Since the package is not on CRAN, we suggest that you install

**remotes** on your system. To do this just open an R session and run:

``` r

install.packages("remotes")

```

Then install the package from the GitHub repository as follows:

``` r

remotes::install_github("corybrunson/plt")

```

Alternatively, you can clone or download the code repository and install

the package from source (from within the directory):

``` r

devtools::install()

```

You should now be able to load the package normally from an R session:

``` r

library(plt)

```

## Quickstart Guide

The **plt** package supports various operations involving persistence

landscapes:

- Compute persistence landscapes from persistence data

- Perform Hilbert space operations (vector space operations and an inner

  product) on persistence landscapes

- Plot persistence landscapes

Examples and tests in **plt** rely on other packages to simulate data

and to compute persistence diagrams from data:

- **tdaunif** provides functions to sample uniformly from various

  immersed manifolds.

- **ripserr** and **TDA** provide functions to compute persistence data

  from point clouds and distance matrices.

### Calculation

**plt** introduces the ‘Rcpp_PersistenceLandscape’ S4 class, which is

exposed using **Rcpp** from the underlying ‘PersistenceLandscape’ C++

class. Instances of this class can be created using `new()` but the

recommended way is to use `landscape()`. This function accepts either a

single matrix of persistence data or a persistence diagram object, which

is a specially formatted list containing the first and most important

element `$pairs`. The `$pairs` entry is itself a list, of a 2-column

matrix of persistence pairs for each homological degree from 0

(`$pairs[[1]]`) to the maximum degree calculated. The generic conversion

function `as_persistence()` includes methods for outputs from

`ripserr::vietoris_rips()` and from `TDA::*Diag()`.

To begin an illustration, we noisily sample 60 points from a figure

eight and compute the persistence diagram of the point cloud:

``` r

set.seed(513611L)

pc <- tdaunif::sample_lemniscate_gerono(60, sd = .1)

pd <- ripserr::vietoris_rips(pc, dim = 1, threshold = 2, p = 2)

#> Warning in vietoris_rips.matrix(pc, dim = 1, threshold = 2, p = 2): `dim`

#> parameter has been deprecated; use `max_dim` instead.

print(pd)

#> PHom object containing persistence data for 63 features.

#> 

#> Contains:

#> * 59 0-dim features

#> * 4 1-dim features

#> 

#> Radius/diameter: min = 0; max = 0.63582.

```

We the convert the persistence data to the preferred persistence diagram

format and inspect some of its features:

``` r

pd <- as_persistence(pd)

print(head(pd$pairs[[1]]))

#>      [,1]       [,2]

#> [1,]    0 0.01918952

#> [2,]    0 0.01947548

#> [3,]    0 0.02604350

#> [4,]    0 0.04218479

#> [5,]    0 0.04542467

#> [6,]    0 0.05941691

print(head(pd$pairs[[2]]))

#>           [,1]      [,2]

#> [1,] 0.4809292 0.6358225

#> [2,] 0.3016234 0.6075172

#> [3,] 0.2504500 0.2727915

#> [4,] 0.2251884 0.2300871

```

This allows us to compute a persistence landscape—in this case, for the

1-dimensional features. Here we compute the landscape exactly, which can

be cost-prohibitive for larger persistence data:

``` r

pl1 <- landscape(pd, degree = 1, exact = TRUE)

print(pl1)

#> Persistence landscape (exact format) of 2 envelopes over (0.2,0.6)

```

### Class

The object `pl1` is not an array, but rather an object that encapsulates

both the data that encode a landscape and several basic operations that

can be performed on it. This allows us to work with persistence

landscapes without worrying about pre-processing their representations.

At any point, the underlying landscape can be extracted using

`$getInternal()`, which in the case of an exactly calculated landscape

returns a list of 2-column matrices, each matrix containing coordinates

that define one level of the landscape as a piecewise linear function:

``` r

print(length(pl1$getInternal()))

#> [1] 2

print(pl1$getInternal())

#> [[1]]

#>            [,1]        [,2]

#>  [1,]      -Inf 0.000000000

#>  [2,] 0.2251884 0.000000000

#>  [3,] 0.2276378 0.002449358

#>  [4,] 0.2300871 0.000000000

#>  [5,] 0.2504500 0.000000000

#>  [6,] 0.2616207 0.011170764

#>  [7,] 0.2727915 0.000000000

#>  [8,] 0.3016234 0.000000000

#>  [9,] 0.4545703 0.152946885

#> [10,] 0.5442232 0.063293964

#> [11,] 0.5583759 0.077446647

#> [12,] 0.6358225 0.000000000

#> [13,]       Inf 0.000000000

#> 

#> [[2]]

#>           [,1]       [,2]

#> [1,]      -Inf 0.00000000

#> [2,] 0.4809292 0.00000000

#> [3,] 0.5442232 0.06329396

#> [4,] 0.6075172 0.00000000

#> [5,]       Inf 0.00000000

```

The length of this list is the number of levels of the landscape.

An alternative, approximate construction computes the value of each

level of the landscape at each point on a 1-dimensional grid, ranging

from `xmin` to `xmax` at increments of `by`. A landscape constructed

using a discrete approximation is stored as a 3-dimensional array of

dimensions (levels, values, 2), with one level per feature (some of

which may be trivial) and one value per grid point, stored as $x,y$

pairs along the third dimension.

``` r

b_ran <- pl_support(pl1)

pl1d <- landscape(pd, degree = 1,

                  xmin = b_ran[[1L]], xmax = b_ran[[2L]], by = 0.02)

print(dim(pl1d$getInternal()))

#> [1]  4 21  2

print(pl1d$getInternal())

#> , , 1

#> 

#>           [,1]      [,2]      [,3]      [,4]      [,5]      [,6]      [,7]

#> [1,] 0.2251884 0.2451884 0.2651884 0.2851884 0.3051884 0.3251884 0.3451884

#> [2,] 0.2251884 0.2451884 0.2651884 0.2851884 0.3051884 0.3251884 0.3451884

#> [3,] 0.2251884 0.2451884 0.2651884 0.2851884 0.3051884 0.3251884 0.3451884

#> [4,] 0.2251884 0.2451884 0.2651884 0.2851884 0.3051884 0.3251884 0.3451884

#>           [,8]      [,9]     [,10]     [,11]     [,12]     [,13]     [,14]

#> [1,] 0.3651884 0.3851884 0.4051884 0.4251884 0.4451884 0.4651884 0.4851884

#> [2,] 0.3651884 0.3851884 0.4051884 0.4251884 0.4451884 0.4651884 0.4851884

#> [3,] 0.3651884 0.3851884 0.4051884 0.4251884 0.4451884 0.4651884 0.4851884

#> [4,] 0.3651884 0.3851884 0.4051884 0.4251884 0.4451884 0.4651884 0.4851884

#>          [,15]     [,16]     [,17]     [,18]     [,19]     [,20]     [,21]

#> [1,] 0.5051884 0.5251884 0.5451884 0.5651884 0.5851884 0.6051884 0.6251884

#> [2,] 0.5051884 0.5251884 0.5451884 0.5651884 0.5851884 0.6051884 0.6251884

#> [3,] 0.5051884 0.5251884 0.5451884 0.5651884 0.5851884 0.6051884 0.6251884

#> [4,] 0.5051884 0.5251884 0.5451884 0.5651884 0.5851884 0.6051884 0.6251884

#> 

#> , , 2

#> 

#>      [,1] [,2]        [,3] [,4]        [,5]       [,6]       [,7]       [,8]

#> [1,]    0    0 0.007603077    0 0.003565016 0.02356502 0.04356502 0.06356502

#> [2,]    0    0 0.000000000    0 0.000000000 0.00000000 0.00000000 0.00000000

#> [3,]    0    0 0.000000000    0 0.000000000 0.00000000 0.00000000 0.00000000

#> [4,]    0    0 0.000000000    0 0.000000000 0.00000000 0.00000000 0.00000000

#>            [,9]    [,10]    [,11]    [,12]     [,13]       [,14]      [,15]

#> [1,] 0.08356502 0.103565 0.123565 0.143565 0.1423288 0.122328755 0.10232875

#> [2,] 0.00000000 0.000000 0.000000 0.000000 0.0000000 0.004259174 0.02425917

#> [3,] 0.00000000 0.000000 0.000000 0.000000 0.0000000 0.000000000 0.00000000

#> [4,] 0.00000000 0.000000 0.000000 0.000000 0.0000000 0.000000000 0.00000000

#>           [,16]      [,17]      [,18]      [,19]       [,20]      [,21]

#> [1,] 0.08232875 0.06425917 0.07063412 0.05063412 0.030634119 0.01063412

#> [2,] 0.04425917 0.06232875 0.04232875 0.02232875 0.002328755 0.00000000

#> [3,] 0.00000000 0.00000000 0.00000000 0.00000000 0.000000000 0.00000000

#> [4,] 0.00000000 0.00000000 0.00000000 0.00000000 0.000000000 0.00000000

```

Exactly computed landscapes can be converted to discrete landscape

objects, but the other direction is not well-defined. Below, we view a

portion of the discretized exact landscape and make a failed attempt to

exactify the discrete one:

``` r

# default conversion to discrete uses `by = 0.001`

# TODO: Export method with `xmin,xmax,by` parameters.

print(dim(pl1$toDiscrete()))

#> [1]   2 411   2

# print first 12 x-coordinates

pl1$toDiscrete()[, seq(220L, 280L), , drop = FALSE]

#> , , 1

#> 

#>           [,1]      [,2]      [,3]      [,4]      [,5]      [,6]      [,7]

#> [1,] 0.4441884 0.4451884 0.4461884 0.4471884 0.4481884 0.4491884 0.4501884

#> [2,] 0.4441884 0.4451884 0.4461884 0.4471884 0.4481884 0.4491884 0.4501884

#>           [,8]      [,9]     [,10]     [,11]     [,12]     [,13]     [,14]

#> [1,] 0.4511884 0.4521884 0.4531884 0.4541884 0.4551884 0.4561884 0.4571884

#> [2,] 0.4511884 0.4521884 0.4531884 0.4541884 0.4551884 0.4561884 0.4571884

#>          [,15]     [,16]     [,17]     [,18]     [,19]     [,20]     [,21]

#> [1,] 0.4581884 0.4591884 0.4601884 0.4611884 0.4621884 0.4631884 0.4641884

#> [2,] 0.4581884 0.4591884 0.4601884 0.4611884 0.4621884 0.4631884 0.4641884

#>          [,22]     [,23]     [,24]     [,25]     [,26]     [,27]     [,28]

#> [1,] 0.4651884 0.4661884 0.4671884 0.4681884 0.4691884 0.4701884 0.4711884

#> [2,] 0.4651884 0.4661884 0.4671884 0.4681884 0.4691884 0.4701884 0.4711884

#>          [,29]     [,30]     [,31]     [,32]     [,33]     [,34]     [,35]

#> [1,] 0.4721884 0.4731884 0.4741884 0.4751884 0.4761884 0.4771884 0.4781884

#> [2,] 0.4721884 0.4731884 0.4741884 0.4751884 0.4761884 0.4771884 0.4781884

#>          [,36]     [,37]     [,38]     [,39]     [,40]     [,41]     [,42]

#> [1,] 0.4791884 0.4801884 0.4811884 0.4821884 0.4831884 0.4841884 0.4851884

#> [2,] 0.4791884 0.4801884 0.4811884 0.4821884 0.4831884 0.4841884 0.4851884

#>          [,43]     [,44]     [,45]     [,46]     [,47]     [,48]     [,49]

#> [1,] 0.4861884 0.4871884 0.4881884 0.4891884 0.4901884 0.4911884 0.4921884

#> [2,] 0.4861884 0.4871884 0.4881884 0.4891884 0.4901884 0.4911884 0.4921884

#>          [,50]     [,51]     [,52]     [,53]     [,54]     [,55]     [,56]

#> [1,] 0.4931884 0.4941884 0.4951884 0.4961884 0.4971884 0.4981884 0.4991884

#> [2,] 0.4931884 0.4941884 0.4951884 0.4961884 0.4971884 0.4981884 0.4991884

#>          [,57]     [,58]     [,59]     [,60]     [,61]

#> [1,] 0.5001884 0.5011884 0.5021884 0.5031884 0.5041884

#> [2,] 0.5001884 0.5011884 0.5021884 0.5031884 0.5041884

#> 

#> , , 2

#> 

#>          [,1]      [,2]      [,3]      [,4]      [,5]      [,6]      [,7]

#> [1,] 0.142595 0.1435921 0.1445892 0.1455863 0.1465835 0.1475806 0.1485777

#> [2,] 0.000000 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000

#>           [,8]      [,9]    [,10]     [,11]     [,12]     [,13]     [,14]

#> [1,] 0.1495748 0.1505719 0.151569 0.1525661 0.1515746 0.1505831 0.1495915

#> [2,] 0.0000000 0.0000000 0.000000 0.0000000 0.0000000 0.0000000 0.0000000

#>       [,15]     [,16]     [,17]     [,18]     [,19]     [,20]     [,21]

#> [1,] 0.1486 0.1476085 0.1466169 0.1456254 0.1446339 0.1436424 0.1426508

#> [2,] 0.0000 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000

#>          [,22]     [,23]     [,24]     [,25]     [,26]     [,27]     [,28]

#> [1,] 0.1416593 0.1406678 0.1396762 0.1386847 0.1376932 0.1367016 0.1357101

#> [2,] 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000 0.0000000

#>          [,29]     [,30]     [,31]    [,32]     [,33]     [,34]     [,35]

#> [1,] 0.1347186 0.1337271 0.1327355 0.131744 0.1307525 0.1297609 0.1287694

#> [2,] 0.0000000 0.0000000 0.0000000 0.000000 0.0000000 0.0000000 0.0000000

#>          [,36]     [,37]        [,38]       [,39]       [,40]       [,41]

#> [1,] 0.1277779 0.1267864 0.1257948213 0.124803292 0.123811762 0.122820233

#> [2,] 0.0000000 0.0000000 0.0009884309 0.001976862 0.002965293 0.003953724

#>            [,42]       [,43]       [,44]       [,45]       [,46]       [,47]

#> [1,] 0.121828703 0.120837174 0.119845644 0.118854114 0.117862585 0.116871055

#> [2,] 0.004942154 0.005930585 0.006919016 0.007907447 0.008895878 0.009884309

#>           [,48]      [,49]     [,50]      [,51]      [,52]      [,53]

#> [1,] 0.11587953 0.11488800 0.1138965 0.11290494 0.11191341 0.11092188

#> [2,] 0.01087274 0.01186117 0.0128496 0.01383803 0.01482646 0.01581489

#>           [,54]      [,55]      [,56]      [,57]      [,58]      [,59]

#> [1,] 0.10993035 0.10893882 0.10794729 0.10695576 0.10596423 0.10497270

#> [2,] 0.01680333 0.01779176 0.01878019 0.01976862 0.02075705 0.02174548

#>           [,60]      [,61]

#> [1,] 0.10398117 0.10298964

#> [2,] 0.02273391 0.02372234

try(pl1d$toExact())

#> Error in pl1d$toExact() : 

#>   Error: Can not convert a discrete PL to an exact PL.

```

### Visualization

**plt** provides a `plot()` method for the ‘Rcpp_PersistenceLandscape’

class. It uses **grDevices** to build color palettes, and as such its

default palette is viridis; but the user may supply the name of a

recognized palette or a sequence of colors between which to interpolate:

``` r

n_env <- max(pl_num_envelopes(pl1), pl_num_envelopes(pl1d))

par(mfrow = c(2L, 1L), mar = c(2, 2, 0, 2))

plot(pl1, palette = "terrain", n_levels = n_env, asp = 1)

plot(pl1d, palette = "terrain", n_levels = n_env, asp = 1)

```

![](man/figures/README-unnamed-chunk-12-1.png)

### Vector Operations

### Inner Product
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/corybrunson/plt-topdown

Awesome Lists containing this project

README