Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nihonjinrxs/dwdc-august2014
Slides and materials for my talk for Data Wranglers DC on August 6, 2014
https://github.com/nihonjinrxs/dwdc-august2014
Last synced: about 2 months ago
JSON representation
Slides and materials for my talk for Data Wranglers DC on August 6, 2014
- Host: GitHub
- URL: https://github.com/nihonjinrxs/dwdc-august2014
- Owner: nihonjinrxs
- License: mit
- Created: 2014-07-18T02:13:35.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2014-08-06T22:01:39.000Z (over 10 years ago)
- Last Synced: 2024-10-21T21:35:49.055Z (3 months ago)
- Language: CSS
- Homepage: http://nihonjinrxs.github.io/dwdc-august2014/
- Size: 7.64 MB
- Stars: 5
- Watchers: 2
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# SQL, the Sequel
##### More SQL in the Database, and SQL in Data Science Contexts (i.e. SQL on DataFrames)This repository contains materials for [my talk at the Data Wranglers DC meetup on August 6, 2014](http://www.meetup.com/Data-Wranglers-DC/events/177269432/), which is a follow-on to [my talk at the Data Wranglers DC meetup on June 4, 2014](http://www.meetup.com/Data-Wranglers-DC/events/171768162/). Materials for the prior talk are in the GitHub Repo [nihonjinrxs/dwdc-june2014](http://www.github.com/nihonjinrxs/dwdc-june2014).
### Contents
The talk consists of two major directions:
- Using more advanced SQL techniques in a database system (examples in PostgreSQL) to script auto-updating computations
- Using SQL on data frames in R and in Python (also maybe Julia?)Folders are as follows:
- A slide deck (`./slides`) in Apple Keynote, [PDF](http://nihonjinrxs.github.io/dwdc-august2014/DWDC-August2014-RyanHarvey.pdf) and [HTML](http://nihonjinrxs.github.io/dwdc-august2014) formats
- A set of SQL scripts (`./sql`) that create the local PostgreSQL database objects demonstrating creation and use of views, custom functions and indexes for use in data analysis
- An RMarkdown document (`./R`), [published on RPubs](http://rpubs.com/ryanbharvey/dwdc-august2014), that demonstrates using `sqldf` in R to perform SQL queries on data frames as if they are tables
- An IPython notebook (`./python`), [available at IPython nbviewer](http://nbviewer.ipython.org/github/nihonjinrxs/dwdc-august2014/blob/master/python/sqldf_examples_python.ipynb), that demonstrates using `sqldf` from the `pandasql` package to perform SQL queries on Pandas DataFrame objects as if they are tables
- An IJulia notebook document (`./julia`) that demonstrates using `sqldf` from the `SQLite.jl` package in Julia to perform SQL queries on data frames as if they are tables (in progress, and not working yet)### Where do I start?
I recommend that anyone wishing to understand what I've done should start with [the prior talk materials](http://www.github.com/nihonjinrxs/dwdc-june2014), then tackle these pieces in order, starting with the slide deck.### Future Work
Given time, I hope to get `sqldf` working in Julia as well - being a young language, it's a little finicky at the moment. Also, a few examples of SQL views with INSERT and UPDATE rules and a SQL trigger or two would be a nice addition.### Disclaimer
This work and the opinions expressed here are my own, and do not purport to represent the views of my current or former employers.