https://github.com/ecoronado92/tdi_project_proposal
https://github.com/ecoronado92/tdi_project_proposal
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/ecoronado92/tdi_project_proposal
- Owner: ecoronado92
- Created: 2019-11-03T19:54:54.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2019-11-13T14:44:22.000Z (over 5 years ago)
- Last Synced: 2025-03-14T00:26:54.729Z (3 months ago)
- Language: R
- Size: 3.51 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# TDI Project Proposal (Spring 2020)
The following repo contains an early rough draft of The Data Incubator 2020 project proposal:
- `scrape_tripadvisor.R`: Script to scrape data from TripAdvisor. __Note__: TripAdvisor is the sole proprietary of said data.
- `eda_tripadvisor.R`: Script to generate figures 1 and 2 from this repository
- `tripadvisor_reviews.rds`: Subset of scraped TripAdvisor reviews data
The `tripadvisor_reviews.rds` contains the following fields:
- `id`: review id
- `hotel_name` : hotel name
- `review_title`: title given to review
- `comment`: user review
- `author_info`: simple information about who wrote the review (e.g. username and date)
- `user_name` : user name (uniquely identifies reviewer)
- `rating`: numerical value of rating based on 1-5 bubbles in webpage