Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/daranzolin/textych
Create interactive text parallels :page_with_curl: :page_with_curl: :page_with_curl:
https://github.com/daranzolin/textych
htmlwidgets textanalysis
Last synced: 4 months ago
JSON representation
Create interactive text parallels :page_with_curl: :page_with_curl: :page_with_curl:
- Host: GitHub
- URL: https://github.com/daranzolin/textych
- Owner: daranzolin
- License: other
- Created: 2020-03-05T05:30:49.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2020-03-05T20:51:10.000Z (almost 5 years ago)
- Last Synced: 2024-08-01T00:47:41.342Z (7 months ago)
- Topics: htmlwidgets, textanalysis
- Language: JavaScript
- Size: 3.19 MB
- Stars: 8
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- jimsghstars - daranzolin/textych - Create interactive text parallels :page_with_curl: :page_with_curl: :page_with_curl: (JavaScript)
README
# textych
data:image/s3,"s3://crabby-images/48682/48682462179586e977482c2e008606146a964d7f" alt=""
data:image/s3,"s3://crabby-images/59c96/59c96de10629726584613794e94e519ed125ebbb" alt="CRAN log"The goal of textych is to create interactive text parallels. This form of reference is useful for exploring similarities and differences betwixt passages.
## Installation
You can install the released version of textych from GitHub with:
``` r
remotes::install_github("daranzolin/textych")
```
## Simple ExampleSplit any text into words and assign a corresponding color and tooltip.
```r
library(textych)
library(tidytext)
library(dplyr)df <- tibble(
text = c("The quick brown fox jumps over the lazy grey dog",
"The catepiller ate through once nice green leaf"),
ind = c("A", "B")
) %>%
unnest_tokens(word, text, to_lower = FALSE) %>%
mutate(
color = case_when(
word == "brown" ~ "brown",
word == "grey" ~ "grey",
word == "green" ~ "green",
TRUE ~ "#333333"
),
tooltip = case_when(
word == "caterpillar" ~ "An insect",
word %in% c("fox", "dog") ~ "A cute mammal",
word == "leaf" ~ "Vegetation"
)
)textych(df, text = word, text_index = ind, color = color, tooltip = tooltip)
```
data:image/s3,"s3://crabby-images/4cf36/4cf36180ef119c14392746a697273f92b31da5c7" alt=""## Complex Example: Greek Text Analysis
Arranging parallel texts with similar language and ideas is a common practice in textual analysis, and there is *very* expensive software that parses each word's form, tense, mood, gender, case, etc. This is a cheaper (and more customizable) alternative.
First, I load the packages, then [retrieve and parse the texts via rperseus.](https://github.com/ropensci/rperseus)
``` r
library(rperseus) # remotes::install_github("ropensci/rperseus")
library(glue)texts <- bind_rows(
get_perseus_text("urn:cts:greekLit:tlg0031.tlg012.perseus-grc2", "1.4"),
get_perseus_text("urn:cts:greekLit:tlg0031.tlg013.perseus-grc2", "1.3"),
get_perseus_text("urn:cts:greekLit:tlg0031.tlg006.perseus-grc2", "8.39")
)parsed_texts <- bind_rows(
parse_excerpt("urn:cts:greekLit:tlg0031.tlg012.perseus-grc2", "1.4"),
parse_excerpt("urn:cts:greekLit:tlg0031.tlg013.perseus-grc2", "1.3"),
parse_excerpt("urn:cts:greekLit:tlg0031.tlg006.perseus-grc2", "8.39")
)
```Second, I want to (1) specify title labels; (2) color the word ἀγάπη ("love"); and (3) create a custom HTML tooltip parsing each word on hover.
``` r
tt_data <- texts %>%
transmute(
text,
passage = glue("{label} {section}")
) %>%
unnest_tokens(word, text) %>%
left_join(
distinct(parsed_texts, word, form, .keep_all = TRUE),
by = c("word" = "form")
) %>%
mutate(color = ifelse(grepl("ἀγάπη", word), "firebrick", "#333333")) %>%
mutate(tooltip = glue("
word
part
number
gender
case
{word.y}
{part_of_speech}
{number}
{gender}
{case}
")
)
```Finally, I pass the data to `textych`, specifying the respective columns for each parallel, text color, and tooltip.
```r
textych(tt_data, word, passage, color, tooltip)
```
data:image/s3,"s3://crabby-images/99cde/99cde984bc4fdb3ee7fe869a5e376daef989a399" alt=""## Future work
* Highlighting words
* More easily customizable tooltips
* Additional styling
* Improved margins