Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/touradbaba/data_analysis_with_r_specialization

This repository includes assignments, and labs from the Data Analysis with R Specialization by Duke University on Coursera. It covers a range of topics in statistics and data analysis using R, including probability, inferential statistics, and regression modeling.
https://github.com/touradbaba/data_analysis_with_r_specialization

coursera inference inferential-statistics linear-regression r r-language statistics

Last synced: about 2 months ago
JSON representation

This repository includes assignments, and labs from the Data Analysis with R Specialization by Duke University on Coursera. It covers a range of topics in statistics and data analysis using R, including probability, inferential statistics, and regression modeling.

Awesome Lists containing this project

README

        

# Data Analysis with R Specialization

This repository contains labs, assignments, and project files for the [**Data Analysis with R Specialization**](https://www.coursera.org/specializations/statistics) on Coursera, offered by **Duke University**. The specialization consists of three courses that explore various topics in statistics and data analysis using R. Below is an overview of the repository's structure.

## 1. Introduction to Probability and Data with R
This course introduces the basics of R, RStudio, data structures, and the foundations of probability and data analysis.

### Labs
- **Week 1: Intro to R and RStudio**
Lab File: `Week-1-Lab-Intro_R_RStudio.rmd`
This lab covers the basics of R and the RStudio interface.

- **Week 2: Introduction to Data**
Lab File: `Week-2-Lab-Intro_to_Data.rmd`
Introduction to working with datasets and basic data visualization.

- **Week 3: Probability**
Lab File: `Week-3-Lab-Probabilty.rmd`
Focuses on understanding probability concepts in data analysis.

- **Week 4: Data Analysis Project**
Lab File: `Week-4-Lab-Data_Analysis_Project.rmd`
Capstone project that combines data exploration and probabilistic reasoning.

### Data
- **BRFSS 2013 Data**
File: `brfss2013.rdata`
Dataset used in The Data Analysis Project

---

## 2. Inferential Statistics
This course focuses on inferential techniques, including sampling distributions, confidence intervals, and hypothesis testing for numerical and categorical data.

### Labs
- **Week 1: Sampling Distributions**
Lab File: `Week-1-Lab-sampling_distributions_Coursera.rmd`
Introduction to the concept of sampling and its distributions.

- **Week 2: Confidence Intervals**
Lab File: `Week-2-Lab-confidence_intervals_Coursera.rmd`
Covers the theory and application of confidence intervals.

- **Week 3: Inference for Numerical Data**
Lab File: `Week-3-Lab-inf_for_numerical_data_Coursera.rmd`
Focuses on hypothesis testing for numerical data.

- **Week 4: Inference for Categorical Data**
Lab File: `Week-4-Lab-inf_for_categorical_data_Coursera.rmd`
Extends inferential techniques to categorical data.

- **Week 4: Statistical Inference Project**
Lab File: `Week-4-stat_inf_project.rmd`
Final project on hypothesis testing and inference.

### Data
- **GSS Data**
File: `gss.rdata`
General Social Survey dataset used in The Statistical Inference Project.

---

## 3. Linear Regression and Modeling
This course dives into modeling relationships between variables using linear regression, extending into multiple regression.

### Labs
- **Weeks 1 & 2: Simple Regression and Scatterplots**
Lab File: `Week-1&2-Lab-simple_regression_Coursera.rmd`
Introduction to simple linear regression with a focus on visualizing relationships.

- **Week 3: Multiple Regression**
Lab File: `Week-3-Lab-multiple_regression_Coursera.rmd`
Learn to model more complex relationships using multiple regression.

- **Week 3: Regression Modeling Project**
Lab File: `Week-3-reg_model_project.rmd`
Final project on building and interpreting linear regression models.

### Data
- **Movies Data**
File: `movies.rdata`
Dataset used in The Regression Modeling Project.

---

## Repository Files
- **.gitattributes**
Git configuration file to handle file attributes.

- **.gitignore**
Specifies files and directories to be ignored by Git.

---

## Usage Instructions

To get started with this repository, it is recommended to install **R** and **RStudio**. Additionally, make sure to install the following R packages:

```r
library(statsr)
library(dplyr)
library(shiny)
library(ggplot2)
```

### Cloning the Repository

Clone this repository to your local machine using the following command:

```bash
git clone https://github.com/TouradBaba/Data_Analysis_with_R_Specialization.git
```

### Running the R Markdown Files

Once the repository is cloned, you can run the R Markdown (`.rmd`) files in RStudio. Open the desired `.rmd` file and use the "Knit" button to generate the output document.

---

## Acknowledgments

This repository is based on the **Data Analysis with R Specialization** offered by **Duke University** on Coursera. All credit for the course content goes to the creators of the specialization.

For more details and to enroll in the specialization, visit the official course page:
[Data Analysis with R Specialization on Coursera](https://www.coursera.org/specializations/statistics).