Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/BriceNocenti/ggfacto
Interactive Graphs for Correspondence Analysis
https://github.com/BriceNocenti/ggfacto
r
Last synced: 3 months ago
JSON representation
Interactive Graphs for Correspondence Analysis
- Host: GitHub
- URL: https://github.com/BriceNocenti/ggfacto
- Owner: BriceNocenti
- License: gpl-3.0
- Created: 2021-03-04T15:30:39.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2024-03-17T15:50:50.000Z (11 months ago)
- Last Synced: 2024-05-29T11:28:21.077Z (8 months ago)
- Topics: r
- Language: R
- Homepage:
- Size: 2.2 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.Rmd
- License: LICENSE.md
Awesome Lists containing this project
README
---
output:
github_document
---```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```# ggfacto
[![R-CMD-check](https://github.com/BriceNocenti/ggfacto/actions/workflows/R-CMD-check.yaml/badge.svg)](https://github.com/BriceNocenti/ggfacto/actions/workflows/R-CMD-check.yaml)
Readable, complete and pretty graphs for correspondence analysis made with [FactoMineR](http://factominer.free.fr/). Many can be rendered as interactive html plots, showing useful informations at mouse hover. The interest is not mainly visual but statistical : it helps the reader to keep in mind the data contained in the cross-table or Burt table while reading correspondence analysis, thus preventing overinterpretation. Graphs are made with [ggplot2](https://ggplot2.tidyverse.org/), which means that you can use the `+` syntax to manually add as many graphical pieces you want, or change theme elements.
## Installation
You can install ggfacto from `CRAN`:
``` r
install.packages("ggfacto")
```Or install the development version from `github`:
``` r
# install.packages("devtools")
devtools::install_github("BriceNocenti/ggfacto")
```## Interactive plot from multiple correspondence analysis (MCA)
Make the MCA (using a wrapper function around `FactoMineR:MCA`) :
``` {r, results = "hide"}
library(ggfacto)data(tea, package = "FactoMineR")
res.mca <- MCA2(tea, active_vars = 1:18)
```Make the plot (as a ggplot2 object) and add a supplementary variable (`"SPC"`) :
``` r
graph_mca <- ggmca(res.mca, tea, sup_vars = "SPC", profiles = TRUE, text_repel = TRUE)
```Use `text_repel = TRUE` to avoid overlapping of text, and obtain a more readable image (be careful that, if the plot is overloaded, labels can be far away from their original location).
Use `profiles = TRUE` to draw the graph of individuals : one point is added for each profile of answers.
Turn the plot interactive :
``` r
ggi(graph_mca)
```![](readme_plot.png)
### See the crosstables of active variables directly on the plot
It is possible to print all crosstables between active variables (burt table) into the interactive tooltips. Spread from mean are colored and, usually, points near the middle will have less colors, and points at the edges will have plenty. It may takes time to print, but really helps to interpret the MCA in close proximity with the underlying data.
``` r
ggmca(res.mca, tea, sup_vars = "SPC", active_tables = "active",
ylim = c(NA, 1.2), text_repel = TRUE) %>%
ggi()
```### See the distribution of active variables for each level of a supplementary variable
``` r
ggmca(res.mca, tea, sup_vars = "SPC", active_tables = "sup",
ylim = c(NA, 1.2), text_repel = TRUE) %>%
ggi()
```### Concentration ellipses for each levels of a supplementary variable
``` {r, fig.width = 6, fig.height = 6, results = "hide"}
ggmca(res.mca, tea, sup_vars = "SPC", ylim = c(NA, 1.2), ellipses = 0.95, text_repel = TRUE, profiles = TRUE)
```### Graph of profiles of answer for each levels of a supplementary variable (with median ellipses containing half the population)
``` {r, fig.width = 6, fig.height = 6, results = "hide"}
ggmca(res.mca, tea, sup_vars = "SPC", ylim = c(NA, 1.2), type = "facets", ellipses = 0.5, profiles = TRUE)
```## Simple correspondence analysis (CA)
Make the correspondence analysis :
``` r
tabs <- tabxplor::tab_plain(forcats::gss_cat, race, marital, df = TRUE)
res.ca <- FactoMineR::CA(tabs, graph = FALSE)
``````{r, echo = FALSE}
tabs <- table(forcats::gss_cat$race, forcats::gss_cat$marital)[-4,]
res.ca <- FactoMineR::CA(tabs, graph = FALSE)
```Interactive plot :
``` r
graph.ca <- ggca(res.ca,
title = "Race by marical : correspondence analysis",
tooltips = c("row", "col"))
ggi(graph.ca)
```Image plot (with `text_repel` to avoid overlapping of labels) :
``` {r}
ggca(res.ca,
title = "Race by marical : correspondence analysis",
text_repel = TRUE, dist_labels = 0.02)
```## Personnalize plots
Step-by-step functions can be used to create a database with all the necessary data, modify it, then use it to draw the plot:
``` {r, fig.width = 6, fig.height = 6, results = "hide", message = FALSE}
library(dplyr)
library(ggplot2)plot_data <- ggmca_data(res.mca, tea, sup_vars = "SPC")
plot_data$vars_data <- plot_data$vars_data %>%
filter(!lvs %in% c("other worker", "non-worker"))ggmca_plot(plot_data, ylim = c(NA, 1.2), text_repel = TRUE)
```The plot can always be modified using the `ggplot2` `+` operator :
``` {r, fig.width = 6, fig.height = 6, results = "hide"}
ggmca_plot(plot_data, ylim = c(NA, 1.2)) +
labs(title = "Multiple correspondence analysis") +
theme(axis.line = element_line(linetype = "solid") )```
You can then pass to plot to `ggi()` to make it interactive.Set `use_theme = FALSE` to use you own ggplot2 theme :
``` {r, fig.width = 6, fig.height = 6}
ggmca_plot(plot_data, ylim = c(NA, 1.2), use_theme = FALSE) +
theme_classic()
```