Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yuminn-k/crawling-tabelog
Crawling store information from tabelog
https://github.com/yuminn-k/crawling-tabelog
crawler python3
Last synced: about 8 hours ago
JSON representation
Crawling store information from tabelog
- Host: GitHub
- URL: https://github.com/yuminn-k/crawling-tabelog
- Owner: yuminn-k
- Created: 2023-05-06T01:33:23.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-05-25T15:08:55.000Z (over 1 year ago)
- Last Synced: 2024-05-02T04:24:28.645Z (7 months ago)
- Topics: crawler, python3
- Language: Python
- Homepage:
- Size: 4.88 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# crawling-tabelog
This repository contains Python code for crawling restaurant information from [Tabelog](https://tabelog.com/kr/fukuoka/rstLst/), a popular restaurant review website in Japan.
## Installation
Before running the script, you need to install the following Python packages:
- `requests`
- `beautifulsoup4`You can install these packages using `pip`. For example, run the following command to install both packages:
```bash
pip install requests beautifulsoup4
```## Usage
To run the script, simply run the `tabelog_crawler.py` file. The script will crawl the first 50 pages of restaurant listings in Fukuoka, Japan, and extract the following information for each restaurant:
- Title
- Address
- Menu type- Phone number
The script will store the data in a text file named `restaurant_data.txt` in the same directory.
## Disclaimer
The script is intended for educational purpose only. Please use it responsibly and in accordance with the terms of use of [Tabelog](https://tabelog.com/kr/fukuoka/rstLst/).