{"id":13566919,"url":"https://github.com/0x0be/scrapeadvisor","last_synced_at":"2025-04-04T00:32:28.478Z","repository":{"id":43247233,"uuid":"197433602","full_name":"0x0be/scrapeadvisor","owner":"0x0be","description":"A user-friendly python-based GUI which provides sentiment analysis of users' reviews toward a specific TripAdvisor facility ","archived":false,"fork":false,"pushed_at":"2023-05-22T22:29:09.000Z","size":666,"stargazers_count":27,"open_issues_count":5,"forks_count":7,"subscribers_count":1,"default_branch":"master","last_synced_at":"2024-11-04T21:37:12.551Z","etag":null,"topics":["data-mining","data-science","python3","r","scraping","sentiment-analysis","sentiment-classification","text-mining","tripadvisor","tripadvisor-scraper","web-scraping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/0x0be.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2019-07-17T17:25:43.000Z","updated_at":"2024-07-30T07:26:53.000Z","dependencies_parsed_at":"2024-04-07T23:44:04.697Z","dependency_job_id":null,"html_url":"https://github.com/0x0be/scrapeadvisor","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0x0be%2Fscrapeadvisor","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0x0be%2Fscrapeadvisor/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0x0be%2Fscrapeadvisor/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0x0be%2Fscrapeadvisor/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/0x0be","download_url":"https://codeload.github.com/0x0be/scrapeadvisor/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247103290,"owners_count":20884023,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-mining","data-science","python3","r","scraping","sentiment-analysis","sentiment-classification","text-mining","tripadvisor","tripadvisor-scraper","web-scraping"],"created_at":"2024-08-01T13:02:19.565Z","updated_at":"2025-04-04T00:32:28.128Z","avatar_url":"https://github.com/0x0be.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003ca href=\"\" rel=\"noopener\"\u003e\n \u003cimg width=200px height=200px src=\"https://github.com/blackeko/scrapeadvisor/blob/media/logo.png\" alt=\"Project logo\"\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n\u003ch3 align=\"center\"\u003escrapeadvisor\u003c/h3\u003e\n\n\u003cdiv align=\"center\"\u003e\n\n  [![Status](https://img.shields.io/badge/status-active-success.svg)]() \n  [![License](https://img.shields.io/badge/license-GPL3-blue.svg)](/LICENSE)\n\n\u003c/div\u003e\n\n---\n\n\u003cp align=\"center\"\u003e\n\tA user-friendly python-based GUI which provides sentiment analysis of users' reviews toward a specific TripAdvisor facility \n    \u003cbr\u003e \n\u003c/p\u003e\n\n## Table of Contents\n\n- [About](#about)\n- [Getting Started](#getting_started)\n- [Run](#run)\n- [Usage](#usage)\n- [Statistics](#statistics)\n- [Supported Languages](#languages)\n- [Built Using](#built_using)\n- [Authors](#authors)\n- [Acknowledgments](#acknowledgement)\n- [Disclaimer](#disclaimer)\n\n## About \u003ca name = \"about\"\u003e\u003c/a\u003e\n\nIf you're reading, dear Tripadvisor, Inc., hire me!\n\n## Getting Started \u003ca name = \"getting_started\"\u003e\u003c/a\u003e\n\n### Prerequisites\n\n- [Python](https://www.python.org/downloads/) installed \n- [R](https://cran.r-project.org/bin/windows/base/) installed \n\n### Installing\n\nMake sure you've all Python dependencies installed with:\n\n```console\nscrape@advisor:~$ pip3 install -r requirements.txt\n```\n\nAlso, the following R packages are needed:\n\n- dplyr \n- readr \n- lubridate\n- ggplot2\n- tidytext\n- tidyverse \n- stringr\n- tidyr\n- scales\n- broom \n- purrr\n- widyr \n- igraph\n- ggraph\n- SnowballC\n- wordcloud\n- reshape2\n- TeachingDemos\n\nYou can manually install missing ones with: \n\n```R\ninstall.packages(\"library_name\")\n```\n\nor run [this script](https://github.com/blackeko/scrapeadvisor/blob/master/ipak.R) (credit to [@stevenworthington](https://gist.github.com/stevenworthington)) to install them all.\n\n### Note\n\nFor Italian language support, **TextWiller** library must be installed.\u003cbr/\u003e\nTo do that:\n\n```R\ninstall.packages(\"devtools\") \ninstall_github(\"livioivil/TextWiller\")\n```\n\n## Run \u003ca name = \"run\"\u003e\u003c/a\u003e\n\nIn order to launch *scrapeadvisor* GUI, run:\n\n```console\nscrape@advisor:~$ python3 ScrapeAdvisor.py\n```\n\n## Usage \u003ca name=\"usage\"\u003e\u003c/a\u003e\n\n### Insert URL\n\n1. Insert the main page URL of a TripAdvisor structure (pub/restaurant/hotel/whatever) in the **URL bar** and click **Enter** (or press Enter)\n2. Wait until **\"Reviews Loaded\"** label appears (may take time, depending on number of reviews)\n\n### Show Reviews\n\nAfter the download is finished, press **\"Show Reviews\"** to see all the downloaded reviews.\n\n### Sentiment Analysis\n\nAfter the download is finished, press **\"Sentiment Analysis\"** button and wait: all the graphs related to the facility will appear follow after, so you can **swipe** between them.\n\n## Statistics \u003ca name=\"statistics\"\u003e\u003c/a\u003e\n\n- Frequent **couple/trio of consecutive words** (bigrams/trigrams)\n- Most **positive/negative review**\n- Top **positive/negative sentiments** of users\n- The **trending/shrinking words** \n- **Users' main cities**\n\n## Screenshot \u003ca name=\"screenshot\"\u003e\u003c/a\u003e\n\n\u003ctable style=\"width:100%\"\u003e\n\t\t\u003ctr\u003e\n\t\t\t\u003ctd\u003e\u003cimg src=\"https://github.com/blackeko/scrapeadvisor/blob/media/word_cloud.png\" \u003e\u003c/td\u003e\n\t\t\t\u003ctd\u003e\u003cimg src=\"https://github.com/blackeko/scrapeadvisor/blob/media/common_words.png\" \u003e\u003c/td\u003e\n\t\t\u003c/tr\u003e\n\t\t\u003ctr\u003e\n\t\t\t\u003ctd\u003e\u003cimg src=\"https://github.com/blackeko/scrapeadvisor/blob/media/sent_afinn.png\" \u003e\u003c/td\u003e\n\t\t\t\u003ctd\u003e\u003cimg src=\"https://github.com/blackeko/scrapeadvisor/blob/media/shrinking.png\" \u003e\u003c/td\u003e\n\t\t\u003c/tr\u003e\n\u003c/table\u003e\n\n## Supported Languages \u003ca name=\"languages\"\u003e\u003c/a\u003e\n\n- English\n- Italian\n\n## Built Using \u003ca name = \"built_using\"\u003e\u003c/a\u003e\n\n- [Kivy](https://kivy.org/#home) - GUI\n- [Beautiful Soup](https://www.crummy.com/software/BeautifulSoup/bs4/doc/) - HTML scraping \n- [R](https://www.r-project.org/about.html) - Sentiment Analysis\n\n\n## Acknowledgements \u003ca name = \"acknowledgement\"\u003e\u003c/a\u003e\n\n- [@susanli2016](https://github.com/susanli2016) - [Web Scraping TripAdvisor](https://towardsdatascience.com/scraping-tripadvisor-text-mining-and-sentiment-analysis-for-hotel-reviews-cc4e20aef333)\n- [TextWiller](https://github.com/livioivil/TextWiller) - For providing Italian stop words and lexicon \n- All the other [packages](#about) - Thank you for being you\n\n## Disclaimer\n\n*Scrapeadvisor* is provided under this License on an AS-IS basis, **without warranty of any kind**, either expressed, implied, or statutory, including, without limitation, warranties that the *scrapeadvisor* is free of defects, merchantable, fit for a particular purpose or non-infringing.\n\nTo the extent permitted under Law, *scrapeadvisor* is provided under an AS-IS basis. The *scrapeadvisor* Team shall never, and without any limit, be liable for any damage, cost, expense or any other payment incurred as a result of *scrapeadvisor*'s actions, failure, bugs and/or any other interaction between *scrapeadvisor* and end-equipment, computers, other software or any 3rd party, end-equipment, computer or services.\n\nWe **do not encourage** running *scrapeadvisor* against Tripadvisor without prior mutual consent. The *scrapeadvisor* Team accept no liability and are not responsible for any misuse or damage caused by *scrapeadvisor*.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F0x0be%2Fscrapeadvisor","html_url":"https://awesome.ecosyste.ms/projects/github.com%2F0x0be%2Fscrapeadvisor","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F0x0be%2Fscrapeadvisor/lists"}