https://github.com/jvcasillas/untidydata
R package of untidy datasets made for the purpose of teaching the tidyverse
https://github.com/jvcasillas/untidydata
Last synced: 11 days ago
JSON representation
R package of untidy datasets made for the purpose of teaching the tidyverse
- Host: GitHub
- URL: https://github.com/jvcasillas/untidydata
- Owner: jvcasillas
- License: gpl-3.0
- Created: 2018-02-16T19:05:43.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2023-01-26T15:20:48.000Z (about 2 years ago)
- Last Synced: 2025-03-20T08:01:34.477Z (26 days ago)
- Language: R
- Homepage: https://www.jvcasillas.com/untidydata/
- Size: 104 KB
- Stars: 47
- Watchers: 2
- Forks: 7
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- jimsghstars - jvcasillas/untidydata - R package of untidy datasets made for the purpose of teaching the tidyverse (R)
README
[](https://github.com/jvcasillas/untidydata/actions)
[](https://www.codefactor.io/repository/github/jvcasillas/untidydata)## untidydata
An R package of untidy datasets made for the purpose of teaching the
tidyverse.Last update: 2021-01-27
### Overview
The purpose of this package is to store untidy datasets I have been
creating for teaching purposes in a version controlled environment. The
datasets vary in difficulty and present different problems common when
tidying data.### Installation
You can install the development version from GitHub with:
install.packages("devtools")
devtools::install_github("jvcasillas/untidydata")### Datasets
- [language\_diversity](#language_diversity)
- [pre\_post](#pre_post)
- [spanish\_vowels](#spanish_vowels)
- [spirantization](#spirantization)
- [vot](#vot)#### `language_diversity`
- Difficulty: easy
- A long format dataset that is most useful in wide format.
- Data taken from Appendix 1 in:
Nettle, D. (1998). Explaining Global Patterns of Language Diversity.
*Journal of Anthropological Archaeology*, 17, 354–374.#### `pre_post`
- Difficulty: easy
- A typical pre-test, post-test data set in wide format.#### `spanish_vowels`
- Difficulty: easy
- Simulated Spanish vowel formant measurements from male and female
speakers.#### `spirantization`
- Difficulty: easy
- Simulated intensity measurements of CV sequences in word initial and
word medial position from L2 learners and native speakers.#### `vot`
- Difficulty: medium
- A voice-onset time data set. Includes coronal stop data from English
and Spanish monolinguals, as well as English/Spanish bilinguals.