Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/topepo/user2018
Slides and code for the 2018 useR! tutorial "Recipes for Data Processing"
https://github.com/topepo/user2018
Last synced: 7 days ago
JSON representation
Slides and code for the 2018 useR! tutorial "Recipes for Data Processing"
- Host: GitHub
- URL: https://github.com/topepo/user2018
- Owner: topepo
- License: gpl-2.0
- Created: 2018-06-25T22:29:50.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-07-13T23:11:42.000Z (over 6 years ago)
- Last Synced: 2024-11-01T04:42:14.734Z (14 days ago)
- Language: JavaScript
- Size: 8.69 MB
- Stars: 31
- Watchers: 8
- Forks: 19
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# user2018
If you need to use a remote version of R to run the code, go to:
[**`http://colorado.rstudio.com/rsc/classroom-assignment/`**](http://colorado.rstudio.com/rsc/classroom-assignment/)
If you require a server instance, ***please let us know*** before going here. We will have to assign you an ID.
Slides and code for the 2018 useR! tutorial "Recipes for Data Processing" on Tuesday July 10th at 1:30:
> R has an excellent framework for specifying models using formulas. While elegant and useful, it was designed in a time when models had small numbers of terms and complex preprocessing of data was not commonplace. As such, it has some limitations. In this tutorial, a new package called `recipes` is shown where the specification of model terms and preprocessing steps can be enumerated sequentially. The recipe can be estimated and applied to any dataset. Current options include simple transformations (log, Box-Cox, interactions, dummy variables, ...), signal extraction (PCA, PLS, ICA, MDS, ...), basis functions (splines, polynomials, ...), imputation methods, and others. An example is used to demonstrate the functionality.
The audience should include people who do feature engineering or need to include preprocessing with their models. From a technical standpoint, some experience in modeling and R is a good idea. Basic [`tidyverse`](https://www.tidyverse.org) syntax will be reviewed.
The materials will be added a few days before the tutorial. To install the required packages:
```r
install.packages(
c('AmesHousing', 'broom', 'kknn', 'recipes', 'rsample', 'tidyverse', 'yardstick'),
repos = "http://cran.r-project.org"
)# and optionally
install.packages('caret', repos = "http://cran.r-project.org")
```If you have trouble with any of these, email me at `[email protected]`.