Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sandravizz/visual-data-science-r
Visual explorative analysis in R from scratch mainly using ggplot
https://github.com/sandravizz/visual-data-science-r
data-science data-visualization ggplot r
Last synced: about 17 hours ago
JSON representation
Visual explorative analysis in R from scratch mainly using ggplot
- Host: GitHub
- URL: https://github.com/sandravizz/visual-data-science-r
- Owner: sandravizz
- Created: 2022-10-07T12:46:34.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-16T16:05:18.000Z (22 days ago)
- Last Synced: 2024-10-19T10:32:25.216Z (19 days ago)
- Topics: data-science, data-visualization, ggplot, r
- Language: R
- Homepage: https://slides.com/sandraviz/rggplot2
- Size: 1.56 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Visual data science
Please check the corresponding [presentation](https://slides.com/sandraviz/rggplot2/embed?style=light).
![Alt text](Images/R1.png)
Explorative data visualisation is a very important part when analysing the structure in our data. It should always be one of the first steps when starting a new data science project.
### Setup
In case this is your first time using R you need to install your own copy as well as RStudio, a software application that makes R easier to use. Both R and RStudio are free and easy to download. R is maintained by an international team of developers. The top of the web page provides three links for downloading R. Under this [link](https://posit.co/products/open-source/rstudio/) you can download Rstudio
### Introduction
In this chapter I introduce the concept of the R visualisation package ggplot. Regarding the installation please check the [code](https://github.com/sandravizz/visual-data-science-R/blob/main/Scripts/Installations.R). Afterwards it is the next step to define a theme, please check the [code](https://github.com/sandravizz/visual-data-science-R/blob/main/Scripts/Themes.R).
### Distributions
In this chapter we look at different visualisation types showing the distribution of one variables. Please check the [code](https://github.com/sandravizz/visual-data-science-R/blob/main/Scripts/Distributions.R).
Chart types covered
- Frequency plot
- Histogram
- Box-plot
- Density plot### Relationships
In this chapter we look at different visualisation types presenting the relationships between two variables. Please check the [code](https://github.com/sandravizz/visual-data-science-R/blob/main/Scripts/Explorative%20analysis.R).
Chart types covered
- Scatter plot
- Beeswarm plot
- Hexagonal binning
- Heatmap### Time series
Time series data have a very specific structure. The goal is to understand over time patterns (trends) which are often presented as line charts. The focus of this chapter is on the visual analytical flow when investigating patterns over time. Please check the [code](https://github.com/sandravizz/visual-data-science-R/blob/main/Scripts/Time%20series.R).
### Advanced visualisations
In this chapter I present three more advanced visualisation types as their strucutre is less commonly used and/or cover higer visual complexity. Pleased check the [code](https://github.com/sandravizz/visual-data-science-R/blob/main/Scripts/Advanced%20visualisations.R).
Chart types covered
- Parallel coordinates
- Dumbbell chart
- Waffle chart### Markdown
The markdown option is a good way to save the whole project including comments and summaries. Please check the [code](https://github.com/sandravizz/visual-data-science-R/blob/main/Scripts/Markdown.R).