https://github.com/post2web/ddscasestudy1
Case study 1 for Doing Data Science at SMU
https://github.com/post2web/ddscasestudy1
casestudy1 dds
Last synced: about 1 year ago
JSON representation
Case study 1 for Doing Data Science at SMU
- Host: GitHub
- URL: https://github.com/post2web/ddscasestudy1
- Owner: post2web
- Created: 2017-02-26T15:06:29.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2017-03-19T17:20:57.000Z (about 9 years ago)
- Last Synced: 2025-02-03T23:43:06.940Z (over 1 year ago)
- Topics: casestudy1, dds
- Language: R
- Homepage:
- Size: 2.71 MB
- Stars: 2
- Watchers: 4
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DDSCaseStudy1
This is a Case Study 1 project for DDS at SMU.
## Instructions for running the project
* Clone the repository:
* git clone https://github.com/post2web/DDSCaseStudy1
* Go in the DDSCaseStudy1 folder and open Makefile.Rmd with RStudio ( having R Working Directory set to the root of the project )
* Knit the Makefile.Rmd
## Files and folders
* Makefile.Rmd - build the project in a Makefile stile
* Analysis.Rmd - contains all analysis of the data
* data - a folder container for the data files
* source - a folder containing all scripts used for downloading and cleaning
* source/download.R - a script used for downloading the csv data files
* downloads [GDP data](https://d396qusza40orc.cloudfront.net/getdata%2Fdata%2FGDP.csv) and saves it as gdp.csv
* downloads [Country Income Grups data](https://d396qusza40orc.cloudfront.net/getdata%2Fdata%2FEDSTATS_Country.csv) and saves it as income_groups.csv
* source/clean_gdp.R - a script used for cleaning the gdp data
* cleans data from gdp.csv and creates a tidy gdp_tidy.csv file
* source/clean_income_groups.R - a script used for cleaning the clean_income_groups data
* cleans data from income_groups.csv and creates a tidy income_groups_tidy.csv file
* source/merge.R - a script used for merging the two datasets
* merges the data from gdp_tidy.csv and income_groups_tidy.csv files and creates merged_data.csv