Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/datalorax/sds-r

Repo for a draft book on social data science methods with R
https://github.com/datalorax/sds-r

data-science r rstats social-data-science

Last synced: 2 months ago
JSON representation

Repo for a draft book on social data science methods with R

Awesome Lists containing this project

README

        

[![Netlify Status](https://api.netlify.com/api/v1/badges/752c3783-e9b3-4b22-ba7e-43ba4b761aff/deploy-status)](https://app.netlify.com/sites/nervous-golick-ef3308/deploys)

# Social Data Science with R
This is basically course notes that may eventually turn into a book. It's a collaborative effort with [Brendan Cullen](https://github.com/brendanhcullen) and [Ouafaa Hmaddi](https://github.com/ohmaddi) and we'll see where it goes. The plan is to have multiple chapters of interactive exercises where you won't have to always leave the book to try things out. I'll update this README as we go and content is developed to provide a better overview of what we're covering (and link to any material we have created).

## Code of Conduct

Please note that the sds-r project is released with a [Contributor Code of Conduct](https://contributor-covenant.org/version/2/0/CODE_OF_CONDUCT.html). By contributing to this project, you agree to abide by its terms.

## Book outline

### Section 1: Foundations

Getting Started
Producing your first plot
Basic R Markdown
Basic data wrangling
Types of data
Tidy data
Joins
Collaborating with git and GitHub
A collaborative exploratory data analysis example
Collaborating with git and GitHub
A collaborative exploratory data analysis example

### Section 2: Data visualization & Communication

Introduction to visualizations
Visual perception
Color
Refining your plots
Geographic data
Visualizing uncertainty
Tables & fonts
Websites in R Markdown
Flex dashboards
Shiny

### Section 3: Functional Programming

Data Types
Iteration
Batch load and processing dat
List columns
Parallel iterations
Writing functions
Package development

### Section 4: Machine Learning

Inference vs. Prediction
Ethics in Machine Learning
Cross validation
Cloud computing
Extending `lm`: Ridge, Lasso, Elastic net
Feature engineering
*K*-nearest neighbor
Decision trees
Bagged trees & Random forests
Boosted Trees
Model Stacking