Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/touradbaba/data_analysis_with_r_specialization
This repository includes assignments, and labs from the Data Analysis with R Specialization by Duke University on Coursera. It covers a range of topics in statistics and data analysis using R, including probability, inferential statistics, and regression modeling.
https://github.com/touradbaba/data_analysis_with_r_specialization
coursera inference inferential-statistics linear-regression r r-language statistics
Last synced: about 2 months ago
JSON representation
This repository includes assignments, and labs from the Data Analysis with R Specialization by Duke University on Coursera. It covers a range of topics in statistics and data analysis using R, including probability, inferential statistics, and regression modeling.
- Host: GitHub
- URL: https://github.com/touradbaba/data_analysis_with_r_specialization
- Owner: TouradBaba
- Created: 2024-08-22T17:47:59.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2024-11-16T18:32:42.000Z (about 2 months ago)
- Last Synced: 2024-11-16T19:29:55.043Z (about 2 months ago)
- Topics: coursera, inference, inferential-statistics, linear-regression, r, r-language, statistics
- Homepage: https://www.coursera.org/specializations/statistics
- Size: 2.57 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Analysis with R Specialization
This repository contains labs, assignments, and project files for the [**Data Analysis with R Specialization**](https://www.coursera.org/specializations/statistics) on Coursera, offered by **Duke University**. The specialization consists of three courses that explore various topics in statistics and data analysis using R. Below is an overview of the repository's structure.
## 1. Introduction to Probability and Data with R
This course introduces the basics of R, RStudio, data structures, and the foundations of probability and data analysis.### Labs
- **Week 1: Intro to R and RStudio**
Lab File: `Week-1-Lab-Intro_R_RStudio.rmd`
This lab covers the basics of R and the RStudio interface.
- **Week 2: Introduction to Data**
Lab File: `Week-2-Lab-Intro_to_Data.rmd`
Introduction to working with datasets and basic data visualization.
- **Week 3: Probability**
Lab File: `Week-3-Lab-Probabilty.rmd`
Focuses on understanding probability concepts in data analysis.- **Week 4: Data Analysis Project**
Lab File: `Week-4-Lab-Data_Analysis_Project.rmd`
Capstone project that combines data exploration and probabilistic reasoning.### Data
- **BRFSS 2013 Data**
File: `brfss2013.rdata`
Dataset used in The Data Analysis Project---
## 2. Inferential Statistics
This course focuses on inferential techniques, including sampling distributions, confidence intervals, and hypothesis testing for numerical and categorical data.### Labs
- **Week 1: Sampling Distributions**
Lab File: `Week-1-Lab-sampling_distributions_Coursera.rmd`
Introduction to the concept of sampling and its distributions.- **Week 2: Confidence Intervals**
Lab File: `Week-2-Lab-confidence_intervals_Coursera.rmd`
Covers the theory and application of confidence intervals.- **Week 3: Inference for Numerical Data**
Lab File: `Week-3-Lab-inf_for_numerical_data_Coursera.rmd`
Focuses on hypothesis testing for numerical data.- **Week 4: Inference for Categorical Data**
Lab File: `Week-4-Lab-inf_for_categorical_data_Coursera.rmd`
Extends inferential techniques to categorical data.- **Week 4: Statistical Inference Project**
Lab File: `Week-4-stat_inf_project.rmd`
Final project on hypothesis testing and inference.### Data
- **GSS Data**
File: `gss.rdata`
General Social Survey dataset used in The Statistical Inference Project.---
## 3. Linear Regression and Modeling
This course dives into modeling relationships between variables using linear regression, extending into multiple regression.### Labs
- **Weeks 1 & 2: Simple Regression and Scatterplots**
Lab File: `Week-1&2-Lab-simple_regression_Coursera.rmd`
Introduction to simple linear regression with a focus on visualizing relationships.- **Week 3: Multiple Regression**
Lab File: `Week-3-Lab-multiple_regression_Coursera.rmd`
Learn to model more complex relationships using multiple regression.- **Week 3: Regression Modeling Project**
Lab File: `Week-3-reg_model_project.rmd`
Final project on building and interpreting linear regression models.### Data
- **Movies Data**
File: `movies.rdata`
Dataset used in The Regression Modeling Project.---
## Repository Files
- **.gitattributes**
Git configuration file to handle file attributes.
- **.gitignore**
Specifies files and directories to be ignored by Git.---
## Usage Instructions
To get started with this repository, it is recommended to install **R** and **RStudio**. Additionally, make sure to install the following R packages:
```r
library(statsr)
library(dplyr)
library(shiny)
library(ggplot2)
```### Cloning the Repository
Clone this repository to your local machine using the following command:
```bash
git clone https://github.com/TouradBaba/Data_Analysis_with_R_Specialization.git
```### Running the R Markdown Files
Once the repository is cloned, you can run the R Markdown (`.rmd`) files in RStudio. Open the desired `.rmd` file and use the "Knit" button to generate the output document.
---
## Acknowledgments
This repository is based on the **Data Analysis with R Specialization** offered by **Duke University** on Coursera. All credit for the course content goes to the creators of the specialization.
For more details and to enroll in the specialization, visit the official course page:
[Data Analysis with R Specialization on Coursera](https://www.coursera.org/specializations/statistics).