Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ablack3/cleaning_data_course_project
This is the course project for the Getting and Cleaning Data course on Coursera.
https://github.com/ablack3/cleaning_data_course_project
Last synced: 12 days ago
JSON representation
This is the course project for the Getting and Cleaning Data course on Coursera.
- Host: GitHub
- URL: https://github.com/ablack3/cleaning_data_course_project
- Owner: ablack3
- Created: 2015-07-16T00:55:45.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2015-07-16T19:22:53.000Z (over 9 years ago)
- Last Synced: 2023-08-17T02:05:34.396Z (over 1 year ago)
- Language: R
- Size: 133 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# cleaning_data_course_project
This repository contains the course project for the Getting and Cleaning Data course on Coursera.## run_analysis.R
This R script creates a tidy data set from the raw UCI smartphone data. It requires that the user has already downloaded and uzipped the data. The user will also need to set the working directory to be the downloaded, unzipped folder containing the data called "UCI HAR Dataset". The script will perform the following actions.
- Read the training and test data into R
- Label the activities with descriptive variable names such as "LAYING", "SITTING", ect.
- Rename the variables in the data with meaningful variable names taken from features.txt document
- Subsets the data selecting only variables that contain the substring "mean" or "std" using a regular expression
- Saves the data as a text file named UCI_HAR_tidy.txt in the working directoryThe output file, UCI_HAR_tidy.txt, meets the requirements of this assignment because it has
- one variable per column
- one observation per row
- descriptive variable names
- and descriptive activity labels.The variable names are not all that meaninful to me but I am assuming they would make sense to a final user of the data. I chose to keep any varible that had the substring "mean" or "std" because I wanted to be sure that all means and standard deviations were included.
## CodeBook.md
This document describes all of the variables in UCI_HAR_tidy.txt