https://github.com/gederajeg/malayic-happiness
Data and R Markdown Notebook for a conference paper titled "The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia" presented at the International Seminar on Austronesian Languages and Literature IX, at the Faculty of Humanities, Udayana University, Indonesia (https://ojs.unud.ac.id/index.php/isall/article/view/79856/41930).
https://github.com/gederajeg/malayic-happiness
austronesian-languages corpus-linguistics happiness indonesian-language lexicalisation malay malayic-languages mpi-eva-jfs
Last synced: 3 months ago
JSON representation
Data and R Markdown Notebook for a conference paper titled "The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia" presented at the International Seminar on Austronesian Languages and Literature IX, at the Faculty of Humanities, Udayana University, Indonesia (https://ojs.unud.ac.id/index.php/isall/article/view/79856/41930).
- Host: GitHub
- URL: https://github.com/gederajeg/malayic-happiness
- Owner: gederajeg
- License: other
- Created: 2021-08-05T04:47:48.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2025-04-25T04:18:50.000Z (7 months ago)
- Last Synced: 2025-06-05T16:03:13.964Z (6 months ago)
- Topics: austronesian-languages, corpus-linguistics, happiness, indonesian-language, lexicalisation, malay, malayic-languages, mpi-eva-jfs
- Language: HTML
- Homepage: https://gederajeg.github.io/malayic-happiness/
- Size: 11.5 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.Rmd
- License: LICENSE
Awesome Lists containing this project
README
---
output: github_document
title: "Supplementary materials, including data and R Markdown Notebook, for a paper titled *The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia*"
author: '[Gede Primahadi Wijaya Rajeg](https://udayananetworking.unud.ac.id/lecturer/880-gede-primahadi-wijaya-rajeg)
& [I Made Rajeg](https://udayananetworking.unud.ac.id/lecturer/1817-i-made-rajeg) 
Universitas Udayana, Indonesia'
bibliography: [biblio.bib, packages.bib]
link-citations: yes
---
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>"
)
```
# License
[](https://osf.io/y42f6/) [](https://doi.org/10.5281/zenodo.5166425) [](http://dx.doi.org/10.6084/m9.figshare.15124713)
This repository is licensed with the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Please cite this repository (in OSF) [@rajeg_codedata_2021] as follows if you use the data and other materials here in your research and/or teaching (in Unified Style Sheet for Linguistics):
> Rajeg, Gede Primahadi Wijaya & I Made Rajeg. 2021. Supplementary materials for *The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia*. Open Science Framework (OSF). https://doi.org/10.17605/OSF.IO/Y42F6.
Or using the [Zenodo](https://doi.org/10.5281/zenodo.5166425) repository version:
> Rajeg, Gede Primahadi Wijaya & I Made Rajeg. 2021. Supplementary materials for *The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia*. Zenodo. https://doi.org/10.5281/zenodo.5166425.
# Overview
The repository provides supplementary materials for our paper titled *The lexicalisation of HAPPINESS in the Malayic varieties of Indonesia*, presented at the International Seminar on Austronesian Languages and Literature IX (10 September 2021) ([conference website](https://ucs.unud.ac.id/conf/isall-ix)). The materials include (i) the data; (ii) the R Markdown Notebook interleaving our paper-texts and R codes used for writing the whole paper and running the statistical analyses and visualisations; and (iii) the figures included in the paper (see the `figures` folder). The study is based on the open-access, large corpora of naturalistic colloquial Malay/Indonesian published by the [Max Planck Institute for Evolutionary Anthropology (MPI EVA) Jakarta Field Station (JFS)](https://lingweb.eva.mpg.de/archive/jakarta/index.html) [@gil_data_2015].
## Data description
The `data` folder holds the data used in this paper.
- `indo-prov-latlong.csv` provides latitude and longitude data for the whole provinces in Indonesia
- `malayic_happy_freq_long_lat.tsv` provides the original data for the latitude and longitude and those manually culled from Google Maps
- `malayic_happy.tsv` contains the original raw data for the HAPPINESS lexicalisation
- `malayic_LIKE_df.tsv` contains the distribution of morphs glossed as 'to like' in all regions
- `malayic_LIKE_df_WK_ENT.tsv` contains distribution of morphs glossed as 'to like' in West Kalimantan and East Nusa Tenggara regions
- `non_acquisition_malayic_sessions_dataset_project.tsv` contains the metadata information for the Malayic subset of the MPI EVA JFS corpora; the metadata include the session names, regions, languoid, word-count per session, genre, mode, among others
## Required R packages
The following R packages are used in the data processing, statistical analyses, visualisation, and knitting the content of the R Markdown Notebook file (`austronesian-paper-2021-gpwrajeg.Rmd`) into MS Word format. Please make sure that they are installed in R to run the codes in the R Notebook and reproduce the results.
- [tidyverse](https://www.tidyverse.org) collection of packages [@tidyverse2019; @R-tidyverse] -- to conduct the data manipulation, processing, and visualisation), especially the functions from the following packages:
- [dplyr](https://dplyr.tidyverse.org) [@R-dplyr]
- [tidyr](https://tidyr.tidyverse.org) [@R-tidyr]
- [stringr](https://stringr.tidyverse.org) [@R-stringr]
- [ggplot2](https://ggplot2.tidyverse.org) [@R-ggplot2; @ggplot22016]
- [readr](https://readr.tidyverse.org) [@R-readr]
- [tibble](https://tibble.tidyverse.org) [@R-tibble]
- [bookdown](https://bookdown.org/home/) [@R-bookdown; @bookdown2016] and [knitr](https://yihui.org/knitr/) [@knitr2015; @R-knitr] -- to print the table and knit the R Markdown Notebook into MS Word document
- [rmarkdown](https://rmarkdown.rstudio.com) [@R-rmarkdown; @rmarkdown2018; @rmarkdown2020] -- to write the paper, combining the R codes and regular texts
- [maps](https://cran.r-project.org/web/packages/maps/maps.pdf) [@R-maps] and [mapdata](https://cran.r-project.org/web/packages/mapdata/mapdata.pdf) [@R-mapdata] -- to generate the Indonesian map
- [ggthemes](https://github.com/jrnold/ggthemes) [@R-ggthemes] -- to customise theme for map visualisation
- [ggrepel](R-ggrepel) [@R-ggrepel] -- to make automatic, non-overlapping text labels
The [R Session info](#sess-info) sub-section below shows the R version [@R-base] and operating system used for this project.
## R Session info {#sess-info}
```{r sess-info}
devtools::session_info()
```
# References