Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/underthecurve/r-data-cleaning-tricks
Data Cleaning Tricks in R for Boston University's "Storytelling with Data" workshop
https://github.com/underthecurve/r-data-cleaning-tricks
r rstudio
Last synced: about 2 months ago
JSON representation
Data Cleaning Tricks in R for Boston University's "Storytelling with Data" workshop
- Host: GitHub
- URL: https://github.com/underthecurve/r-data-cleaning-tricks
- Owner: underthecurve
- Created: 2017-06-05T19:23:30.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-06-12T03:45:59.000Z (over 7 years ago)
- Last Synced: 2024-08-06T03:04:30.998Z (5 months ago)
- Topics: r, rstudio
- Language: R
- Size: 1.39 MB
- Stars: 70
- Watchers: 8
- Forks: 21
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- jimsghstars - underthecurve/r-data-cleaning-tricks - Data Cleaning Tricks in R for Boston University's "Storytelling with Data" workshop (R)
README
# Tricks for cleaning your data in R
Data + Code for **"Tricks for cleaning your data in R"** at the [Storytelling with Data](https://www.bu.edu/com/data-storytelling/) workshop at Boston University on Tuesday, June 6th 2017.
Equivalent materials for **"Advancing with data visualization in R using ggplot2"** available [here](https://github.com/underthecurve/r-dataviz-ggplot2).
## Links to install R and RStudio
* [R](https://www.r-project.org/): website for the R software
* [RStudio](https://www.rstudio.com/): website for RStudio, a powerful graphical user interface for R## Files included
### Annotated code and step-by step instructions for the workshop
* [R-datacleaning-tricks.md](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/R-datacleaning-tricks.md): Markdown file (for viewing on the web)
* [R-datacleaning-tricks.pdf](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/R-datacleaning-tricks.pdf): PDF file (for printing out)### R code
* [R-datacleaning-tricks.R](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/R-datacleaning-tricks.R): the R code, which can be run in RStudio### Underlying data needed to run the R code
* [employee-earnings-report-2016.csv](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/employee-earnings-report-2016.csv): data on earnings for Boston's municipal employees, from the city's [open data portal](https://data.boston.gov/dataset/employee-earnings-report)
* [unemployment.xlsx](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/unemployment.xlsx): data on global unemployment rates from 2012 to 2016, from the [International Monetary Fund](https://www.imf.org/external/pubs/ft/weo/2017/01/weodata/index.aspx)
* [attendees.csv](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/attendees.csv): data on some attendees of this workshop, with names and identifying information removed## How to follow this workshop
* You can clone or download this repository by clicking on the green button above, "Clone or download"
* Open the `.R` file in RStudio
* Follow along by reading the `.md` file online or printing the `.pdf` file out by clicking the Github links above## Questions / Feedback?
ychristinezhang at gmail dot com
or on Twitter
[@christinezhang](https://twitter.com/christinezhang)
This work is licensed under a Creative Commons Attribution 4.0 International License.