Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nihonjinrxs/dwdc-june2014
Slides and materials from my talk for Data Wranglers DC on June 4, 2014
https://github.com/nihonjinrxs/dwdc-june2014
Last synced: 3 months ago
JSON representation
Slides and materials from my talk for Data Wranglers DC on June 4, 2014
- Host: GitHub
- URL: https://github.com/nihonjinrxs/dwdc-june2014
- Owner: nihonjinrxs
- Created: 2014-06-01T02:13:19.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2014-06-04T19:12:33.000Z (over 10 years ago)
- Last Synced: 2023-03-11T11:42:51.375Z (almost 2 years ago)
- Language: CSS
- Homepage: http://nihonjinrxs.github.io/dwdc-june2014/
- Size: 7.22 MB
- Stars: 12
- Watchers: 4
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Wrangling in SQL & Other Tools
##### Scripting Reproducible and Understandable Data Wrangling and Analysis Pipelines with Tabular and Relational DataThis repository contains materials for [my talk at the Data Wranglers DC meetup on June 4, 2014](http://www.meetup.com/Data-Wranglers-DC/events/171768162/).
### Contents
The talk consists of several major directions:
- A slide deck (`./slides`) in Apple Keynote, [PDF](https://nihonjinrxs.github.io/dwdc-june2014/DWDC-June2014-RyanHarvey.pdf) and [HTML](https://nihonjinrxs.github.io/dwdc-june2014) formats
- Sample data in CSV format (`./csv`), courtesy of [tilling](https://github.com/tilling)
- A set of SQL scripts (`./sql`) that create the local PostgreSQL database used for the examples and perform the simple linear model analysis example
- An RMarkdown document (`./R`), [published on RPubs](http://rpubs.com/ryanbharvey/dwdc-june2014), that uses the data from the database to perform the analysis in R and compare with the SQL results
- An iPython notebook document (`./python`) that uses the data from the database to perform the example analysis, compare the results across SQL and R, and plot the resulting linear models### Where do I start?
I recommend that anyone wishing to understand what I've done should tackle these pieces in order, starting with the slide deck.### Future Work
Given time and maturity of database libraries, I hope to add a parallel example in Julia soon.### Disclaimer
This work and the opinions expressed here are my own, and do not purport to represent the views of my current or former employers.