An open API service indexing awesome lists of open source software.

https://github.com/post2web/ddscasestudy1

Case study 1 for Doing Data Science at SMU
https://github.com/post2web/ddscasestudy1

casestudy1 dds

Last synced: about 1 year ago
JSON representation

Case study 1 for Doing Data Science at SMU

Awesome Lists containing this project

README

          

# DDSCaseStudy1

This is a Case Study 1 project for DDS at SMU.

## Instructions for running the project
* Clone the repository:

* git clone https://github.com/post2web/DDSCaseStudy1
* Go in the DDSCaseStudy1 folder and open Makefile.Rmd with RStudio ( having R Working Directory set to the root of the project )
* Knit the Makefile.Rmd

## Files and folders
* Makefile.Rmd - build the project in a Makefile stile
* Analysis.Rmd - contains all analysis of the data
* data - a folder container for the data files
* source - a folder containing all scripts used for downloading and cleaning
* source/download.R - a script used for downloading the csv data files
* downloads [GDP data](https://d396qusza40orc.cloudfront.net/getdata%2Fdata%2FGDP.csv) and saves it as gdp.csv
* downloads [Country Income Grups data](https://d396qusza40orc.cloudfront.net/getdata%2Fdata%2FEDSTATS_Country.csv) and saves it as income_groups.csv
* source/clean_gdp.R - a script used for cleaning the gdp data
* cleans data from gdp.csv and creates a tidy gdp_tidy.csv file
* source/clean_income_groups.R - a script used for cleaning the clean_income_groups data
* cleans data from income_groups.csv and creates a tidy income_groups_tidy.csv file
* source/merge.R - a script used for merging the two datasets
* merges the data from gdp_tidy.csv and income_groups_tidy.csv files and creates merged_data.csv