Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dcs-training/summer-school-23-stream-2-text-and-data-analysis-in-the-wild

This is the repo associated with the 2023 CDCS Summer School
https://github.com/dcs-training/summer-school-23-stream-2-text-and-data-analysis-in-the-wild

data-analysis data-visualisation data-wrangling r sentiment-analysis statistics text-analysis web-scraping

Last synced: about 1 month ago
JSON representation

This is the repo associated with the 2023 CDCS Summer School

Awesome Lists containing this project

README

        

# CDCS Summer School 2023: Stream 2
Welcome to the CDCS Data and Text analysis Summer School Stream 2 Repo. Here you will be able to access all the material, data, and instructions connected to the school.

This course is designed to help researchers with coding experience understand how data and text analysis projects are performed in a research environment.
It starts with identifying a series of research questions connected to this year’s core topic (cost of living in Scotland and the UK). Then it explores how computational methods can be used to obtain, clean, and analyse structured and unstructured datasets in R to answer those questions.

Topics will include **web scraping**, **text analysis**, **sentiment analysis**, **data wrangling**, **statistics**, and **data visualisation**.

## Key principles:
- General knowledge of the interface of RStudio and coding is required. .
- Fosters interdisciplinary thinking by bringing social science and humanities researchers together to explore methods.
- Covers both structured and unstructured data.
- Illustrates the development process of data-led projects by moving through phases of the project lifecycle.
- Challenge-led, helping researchers learn how to deal with real-world data.

## RStudio Refresher
Although the Text and Data Analysis in the Wild stream will provide attendees with many new digital skills, a level of prior knowledge is necessary to get the most out of our training. Try our short quiz to test your knowledge of R, covering some of the basics you should be familiar with before the Summer School.

[TAKE The Quiz](https://forms.office.com/e/cjsdkpbyMv)

If you didn't do as well as you'd hoped, don't worry! [In this video](https://edin.ac/3JzOM0P), Training Manager Dr. Lucia Michelin goes through a refresher of some of the core R tools and techniques.

## Content of the Repository
Each day of the Summer School has its own folder. Within each day, you will find the slides for the day and a folder for each block containing a data file/folder and the code that will be used during the class.

## Summer School Time Table
| |MONDAY|TUESDAY|WEDNESDAY|THURSDAY|FRIDAY|
|---|---|---|---|---|---|
|09:40-10:40| Seminar| Seminar| Seminar| Seminar| Seminar|
|11:40-11:00| Coffee Break| Coffee Break| Coffee Break| Coffee Break| Coffee Break|
|11:00-12:30| Introduction| Text Analysis| Sentiment Analysis| Data Analysis| Data Visualisation|
|12:30-13:30| Lunch Break| Lunch Break| Lunch Break| Lunch Break| Lunch Break|
|13:30- 15:00| Web Scraping| Text Analysis| Data Wrangling| Data Analysis| Data Visualisation|
|15:00-15:30| Coffee Break| Coffee Break| Coffee Break| Coffee Break| Coffee Break|
|15:30-17:00| BYOD| BYOD| BYOD| Keynote Lecture| Next Steps|
|Evening|Pub Quiz|Pub Crawl|Ceilidh|Drinks Reception|Dinner|

## Summer School Format
o The Summer School is meant to be interactive, and you will be prompted to replicate what the instructor is demonstrating on your own machine.

o Besides the instructors, there will be helpers present in the call who are there to help you if you get stuck or if you run into an error. Please use your sticky notes to raise problems or issues and someone will help you with addressing it. During the introduction at the start of the Summer School we will cover how to ask for help in more detail.

o We promote an inclusive and welcoming environment, so we ask you to be respectful towards our instructors, helpers and fellow participants. Disruptive behaviour will not be tolerated, and you will be asked to leave if it occurs. More information can be found in the Summer School Code of Conduct Document.

## Summer School Data
The data for the Summer School is a composite of information taken from a variety of government and charity documents. The provenance of the data is outlined below:

### Region:
International Territorial Levels (Tier 2). ONS https://www.ons.gov.uk/methodology/geography/ukgeographies/eurostat#:~:text=12.-,Scotland,1%20areas%20in%20the%20UK.

### Food Insecurity:
Scottish Government https://www.gov.scot/publications/scottish-health-survey-2021-volume-1-main-report/pages/9/#:~:text=Levels%20of%20food%20insecurity%20have,between%208%25%20and%209%25.

### Life Expectancy:
Scottish Government https://www.gov.scot/publications/national-care-service-scotlands-health-demographic-profile/pages/4/

### House Prices:
Scottish Government https://www.gov.uk/government/statistical-data-sets/uk-house-price-index-data-downloads-july-2022?utm_medium=GOV.UK&utm_source=scotland&utm_campaign=UKHPI_Scotlandreport&utm_term=9.30_14_09_22&utm_content=download_data

### Welfare Applications:
Scottish Government https://www.gov.scot/collections/sg-social-security-scotland-stats-publications/

### Homelessness:
Scottish Government https://www.gov.scot/news/homelessness-statistics-2021-22/

### Gas Consumption:
Scottish Energy Statistics Hub. https://www.gov.scot/publications/scottish-energy-statistics-hub-index/

### Businesses:
Scotland Growth Sector Statistics. https://www.gov.scot/publications/growth-sector-statistics/

### Foodbank Use:
The Trussell Trust. https://www.trusselltrust.org/

### Average Rent:
Scottish Government https://www.gov.scot/publications/private-sector-rent-statistics-scotland-2010-2022/

### Energy Bill:
UK Government https://www.gov.uk/government/statistical-data-sets/annual-domestic-energy-price-statistics

### SIMD:
https://simd.scot/#/simd2020/BTTTFTT/9/-4.0000/55.9000/

## Licence of the material
All the material collected here is covered by a CC-BY-NC 4.0 License