Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/daranzolin/textych

Create interactive text parallels :page_with_curl: :page_with_curl: :page_with_curl:
https://github.com/daranzolin/textych

htmlwidgets textanalysis

Last synced: about 2 months ago
JSON representation

Create interactive text parallels :page_with_curl: :page_with_curl: :page_with_curl:

Awesome Lists containing this project

README

        

# textych

![](https://camo.githubusercontent.com/ea6e0ff99602c3563e3dd684abf60b30edceaeef/68747470733a2f2f696d672e736869656c64732e696f2f62616467652f6c6966656379636c652d6578706572696d656e74616c2d6f72616e67652e737667)
![CRAN log](http://www.r-pkg.org/badges/version/textych)

The goal of textych is to create interactive text parallels. This form of reference is useful for exploring similarities and differences betwixt passages.

## Installation

You can install the released version of textych from GitHub with:

``` r
remotes::install_github("daranzolin/textych")
```
## Simple Example

Split any text into words and assign a corresponding color and tooltip.

```r
library(textych)
library(tidytext)
library(dplyr)

df <- tibble(
text = c("The quick brown fox jumps over the lazy grey dog",
"The catepiller ate through once nice green leaf"),
ind = c("A", "B")
) %>%
unnest_tokens(word, text, to_lower = FALSE) %>%
mutate(
color = case_when(
word == "brown" ~ "brown",
word == "grey" ~ "grey",
word == "green" ~ "green",
TRUE ~ "#333333"
),
tooltip = case_when(
word == "caterpillar" ~ "An insect",
word %in% c("fox", "dog") ~ "A cute mammal",
word == "leaf" ~ "Vegetation"
)
)

textych(df, text = word, text_index = ind, color = color, tooltip = tooltip)
```
![](inst/textych-gif2.gif)

## Complex Example: Greek Text Analysis

Arranging parallel texts with similar language and ideas is a common practice in textual analysis, and there is *very* expensive software that parses each word's form, tense, mood, gender, case, etc. This is a cheaper (and more customizable) alternative.

First, I load the packages, then [retrieve and parse the texts via rperseus.](https://github.com/ropensci/rperseus)

``` r
library(rperseus) # remotes::install_github("ropensci/rperseus")
library(glue)

texts <- bind_rows(
get_perseus_text("urn:cts:greekLit:tlg0031.tlg012.perseus-grc2", "1.4"),
get_perseus_text("urn:cts:greekLit:tlg0031.tlg013.perseus-grc2", "1.3"),
get_perseus_text("urn:cts:greekLit:tlg0031.tlg006.perseus-grc2", "8.39")
)

parsed_texts <- bind_rows(
parse_excerpt("urn:cts:greekLit:tlg0031.tlg012.perseus-grc2", "1.4"),
parse_excerpt("urn:cts:greekLit:tlg0031.tlg013.perseus-grc2", "1.3"),
parse_excerpt("urn:cts:greekLit:tlg0031.tlg006.perseus-grc2", "8.39")
)
```

Second, I want to (1) specify title labels; (2) color the word ἀγάπη ("love"); and (3) create a custom HTML tooltip parsing each word on hover.

``` r
tt_data <- texts %>%
transmute(
text,
passage = glue("{label} {section}")
) %>%
unnest_tokens(word, text) %>%
left_join(
distinct(parsed_texts, word, form, .keep_all = TRUE),
by = c("word" = "form")
) %>%
mutate(color = ifelse(grepl("ἀγάπη", word), "firebrick", "#333333")) %>%
mutate(tooltip = glue("

word
part
number
gender
case


{word.y}
{part_of_speech}
{number}
{gender}
{case}


")
)
```

Finally, I pass the data to `textych`, specifying the respective columns for each parallel, text color, and tooltip.

```r
textych(tt_data, word, passage, color, tooltip)
```
![](inst/textych-gif1.gif)

## Future work

* Highlighting words
* More easily customizable tooltips
* Additional styling
* Improved margins