https://github.com/jvcasillas/untidydata

R package of untidy datasets made for the purpose of teaching the tidyverse
https://github.com/jvcasillas/untidydata

Last synced: 3 months ago
JSON representation

R package of untidy datasets made for the purpose of teaching the tidyverse

Host: GitHub
URL: https://github.com/jvcasillas/untidydata
Owner: jvcasillas
License: gpl-3.0
Created: 2018-02-16T19:05:43.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2023-01-26T15:20:48.000Z (over 2 years ago)
Last Synced: 2025-03-20T08:01:34.477Z (3 months ago)
Language: R
Homepage: https://www.jvcasillas.com/untidydata/
Size: 104 KB
Stars: 47
Watchers: 2
Forks: 7
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

jimsghstars - jvcasillas/untidydata - R package of untidy datasets made for the purpose of teaching the tidyverse (R)

README

        

[![R-CMD-check](https://github.com/jvcasillas/untidydata/workflows/R-CMD-check/badge.svg)](https://github.com/jvcasillas/untidydata/actions)  

[![CodeFactor](https://www.codefactor.io/repository/github/jvcasillas/untidydata/badge)](https://www.codefactor.io/repository/github/jvcasillas/untidydata)

## untidydata 

An R package of untidy datasets made for the purpose of teaching the

tidyverse.

Last update: 2021-01-27

### Overview

The purpose of this package is to store untidy datasets I have been

creating for teaching purposes in a version controlled environment. The

datasets vary in difficulty and present different problems common when

tidying data.

### Installation

You can install the development version from GitHub with:

    install.packages("devtools")

    devtools::install_github("jvcasillas/untidydata")

### Datasets

-   [language\_diversity](#language_diversity)

-   [pre\_post](#pre_post)

-   [spanish\_vowels](#spanish_vowels)

-   [spirantization](#spirantization)

-   [vot](#vot)

#### `language_diversity`

-   Difficulty: easy

-   A long format dataset that is most useful in wide format.

-   Data taken from Appendix 1 in:  

    Nettle, D. (1998). Explaining Global Patterns of Language Diversity.

    *Journal of Anthropological Archaeology*, 17, 354–374.

#### `pre_post`

-   Difficulty: easy

-   A typical pre-test, post-test data set in wide format.

#### `spanish_vowels`

-   Difficulty: easy

-   Simulated Spanish vowel formant measurements from male and female

    speakers.

#### `spirantization`

-   Difficulty: easy

-   Simulated intensity measurements of CV sequences in word initial and

    word medial position from L2 learners and native speakers.

#### `vot`

-   Difficulty: medium

-   A voice-onset time data set. Includes coronal stop data from English

    and Spanish monolinguals, as well as English/Spanish bilinguals.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jvcasillas/untidydata

Awesome Lists containing this project

README