https://github.com/underthecurve/r-data-cleaning-tricks
  
  
    Data Cleaning Tricks in R for Boston University's "Storytelling with Data" workshop 
    https://github.com/underthecurve/r-data-cleaning-tricks
  
r rstudio
        Last synced: 4 months ago 
        JSON representation
    
Data Cleaning Tricks in R for Boston University's "Storytelling with Data" workshop
- Host: GitHub
 - URL: https://github.com/underthecurve/r-data-cleaning-tricks
 - Owner: underthecurve
 - Created: 2017-06-05T19:23:30.000Z (over 8 years ago)
 - Default Branch: master
 - Last Pushed: 2017-06-12T03:45:59.000Z (over 8 years ago)
 - Last Synced: 2024-11-22T15:41:27.417Z (12 months ago)
 - Topics: r, rstudio
 - Language: R
 - Size: 1.39 MB
 - Stars: 70
 - Watchers: 8
 - Forks: 21
 - Open Issues: 0
 - 
            Metadata Files:
            
- Readme: README.md
 
 
Awesome Lists containing this project
- jimsghstars - underthecurve/r-data-cleaning-tricks - Data Cleaning Tricks in R for Boston University's "Storytelling with Data" workshop (R)
 
README
          # Tricks for cleaning your data in R
Data + Code for **"Tricks for cleaning your data in R"** at the [Storytelling with Data](https://www.bu.edu/com/data-storytelling/) workshop at Boston University on Tuesday, June 6th 2017.
Equivalent materials for **"Advancing with data visualization in R using ggplot2"** available [here](https://github.com/underthecurve/r-dataviz-ggplot2).
## Links to install R and RStudio
* [R](https://www.r-project.org/): website for the R software
* [RStudio](https://www.rstudio.com/): website for RStudio, a powerful graphical user interface for R
## Files included
### Annotated code and step-by step instructions for the workshop
* [R-datacleaning-tricks.md](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/R-datacleaning-tricks.md): Markdown file (for viewing on the web)
* [R-datacleaning-tricks.pdf](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/R-datacleaning-tricks.pdf): PDF file (for printing out)
### R code
* [R-datacleaning-tricks.R](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/R-datacleaning-tricks.R): the R code, which can be run in RStudio
### Underlying data needed to run the R code
* [employee-earnings-report-2016.csv](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/employee-earnings-report-2016.csv): data on earnings for Boston's municipal employees, from the city's [open data portal](https://data.boston.gov/dataset/employee-earnings-report)
* [unemployment.xlsx](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/unemployment.xlsx): data on global unemployment rates from 2012 to 2016, from the [International Monetary Fund](https://www.imf.org/external/pubs/ft/weo/2017/01/weodata/index.aspx)
* [attendees.csv](https://github.com/underthecurve/r-data-cleaning-tricks/blob/master/attendees.csv): data on some attendees of this workshop, with names and identifying information removed
## How to follow this workshop
* You can clone or download this repository by clicking on the green button above, "Clone or download"
* Open the `.R` file in RStudio 
* Follow along by reading the `.md` file online or printing the `.pdf` file out by clicking the Github links above
## Questions / Feedback?
ychristinezhang at gmail dot com
or on Twitter
[@christinezhang](https://twitter.com/christinezhang)

This work is licensed under a Creative Commons Attribution 4.0 International License.