https://github.com/2DegreesInvesting/ds.database
https://github.com/2DegreesInvesting/ds.database
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/2DegreesInvesting/ds.database
- Owner: 2DegreesInvesting
- License: other
- Archived: true
- Created: 2022-08-30T06:57:27.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-09-27T13:54:16.000Z (over 2 years ago)
- Last Synced: 2024-08-13T07:11:36.097Z (8 months ago)
- Language: R
- Size: 2.69 MB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.Rmd
- License: LICENSE
Awesome Lists containing this project
- jimsghstars - 2DegreesInvesting/ds.database - (R)
README
---
output: github_document
---```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>"
)
```# Relational data
The goal of this series is help you understand and work with multiple,
related tables of data.At the end of this series you will be able to do things like these:
* Understand and use relational data with the dplyr package.
* Define and use relational data models with the dm package.
* Use remote databases.## Who is the audience?
Anyone who works with two or more related tables of data.
## Why is this important?
> It's rare that a data analysis involves only a single table of
data. Typically you have many tables of data, and you must combine them to
answer the questions that you're interested in. Collectively, multiple
tables of data are called relational data because it is the relations,
not just the individual datasets, that are important.
-- https://r4ds.had.co.nz/relational-data.html## Syllabus
### Relational data with dplyr
This meetup covers how to work with relational data with dplyr.
Objectives:
* Understand what is relational data.
* Understand and use the different types of joins.
* Understand what happens when a key has duplicated values.
* Learn how to validate the keys to avoid problems.### Work with relational data with the dm package
This meetup covers how to work with relational package with the dm package.
Objectives:
* Create, visualize, check, and use complex datasets.
### Validating data quality
This meetup covers how to validate the quality of a dataset.
Objectives:
* Validate data quality with the [pointblank package](https://rich-iannone.github.io/pointblank/).
* Validate data quality with lower-level, developer-oriented tools.### Using a Relational Database Management System (RDBMS)
This meetup covers how to copy a data model to a RDBMS and how to later use it.
Objectives:
* Connect to a RDBMS (an .sqlite file).
* Copy a data model to a RDBMS.
* Use the RDBMS with dplyr.
* Use the RDBMS with dm.## Resources
* YouTube [playlist](https://bit.ly/ds-incubator-videos).
* The
[ds-incubator](https://github.com/2DegreesInvesting/ds-incubator#ds-incubator)
project.
* [Ideas](https://bit.ly/dsi-ideas) for future meetups.
* [Relational data](https://r4ds.had.co.nz/relational-data.html).
* [The dm package](https://cynkra.github.io/dm/).
* [The dbplyr package](https://dbplyr.tidyverse.org/).## Thanks
Tanks Mirja for motivating me to learn more about relational data.