Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/yuminn-k/crawling-tabelog

Crawling store information from tabelog
https://github.com/yuminn-k/crawling-tabelog

crawler python3

Last synced: 18 days ago
JSON representation

Crawling store information from tabelog

Host: GitHub
URL: https://github.com/yuminn-k/crawling-tabelog
Owner: yuminn-k
Created: 2023-05-06T01:33:23.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2023-05-25T15:08:55.000Z (over 1 year ago)
Last Synced: 2024-11-17T06:14:23.541Z (3 months ago)
Topics: crawler, python3
Language: Python
Homepage:
Size: 4.88 KB
Stars: 2
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# crawling-tabelog

This repository contains Python code for crawling restaurant information from [Tabelog](https://tabelog.com/kr/fukuoka/rstLst/), a popular restaurant review website in Japan.

## Installation

Before running the script, you need to install the following Python packages:

- `requests`
- `beautifulsoup4`

You can install these packages using `pip`. For example, run the following command to install both packages:

```bash
pip install requests beautifulsoup4
```

## Usage

To run the script, simply run the `tabelog_crawler.py` file. The script will crawl the first 50 pages of restaurant listings in Fukuoka, Japan, and extract the following information for each restaurant:

- Title
- Address
- Menu type

- Phone number

The script will store the data in a text file named `restaurant_data.txt` in the same directory.

## Disclaimer

The script is intended for educational purpose only. Please use it responsibly and in accordance with the terms of use of [Tabelog](https://tabelog.com/kr/fukuoka/rstLst/).