Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/naufalbasara/scrapestaurant
End-to-end Surabaya restaurants scraping project
https://github.com/naufalbasara/scrapestaurant
beautifulsoup4 etl-pipeline python scraping selenium
Last synced: 2 months ago
JSON representation
End-to-end Surabaya restaurants scraping project
- Host: GitHub
- URL: https://github.com/naufalbasara/scrapestaurant
- Owner: naufalbasara
- Created: 2024-08-22T13:09:05.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-08-26T17:46:11.000Z (4 months ago)
- Last Synced: 2024-09-29T02:01:25.121Z (3 months ago)
- Topics: beautifulsoup4, etl-pipeline, python, scraping, selenium
- Language: Python
- Homepage:
- Size: 94.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Surabaya Restaurants Data Scraping
## Overview
Surabaya is the central capital in East Java Province Indonesia. Being the 2nd largest city in Indonesia, Surabaya is filled with rich culture and multidiversity in their land. Foods become an integral part of the culture.This project is gathering the data available online from pergikuliner’s website. Pergi Kuliner is a dining review directory platform where you can review food across Indonesia from street stalls to five-star restaurants. Data are gathered utilizing Selenium and Beautiful Soup scraping tools. The data were transformed with Python and the results stored in Google Sheets with Google Cloud API calls. Stored data are used for the end product, which is visualizing the data to dashboards with Looker Studio.
## Data Pipeline
![image](https://github.com/user-attachments/assets/67feaddd-0e23-4cb8-bcb2-fe93f8cfcf8c)## Data Visualization
**Stacks**: Selenium, Beautiful Soup, Python, Looker Studio
**Dashboard Link**: [Dashboard](https://lookerstudio.google.com/s/lQxLamhoSek)