https://github.com/kimyk0120/google_maps_scraper
This is a powerful web scraper designed to extract data from Google Maps. Built with Playwright, it ensures efficient and reliable scraping to gather detailed information directly from Google Maps with ease.
https://github.com/kimyk0120/google_maps_scraper
googlemaps playwright python
Last synced: 10 months ago
JSON representation
This is a powerful web scraper designed to extract data from Google Maps. Built with Playwright, it ensures efficient and reliable scraping to gather detailed information directly from Google Maps with ease.
- Host: GitHub
- URL: https://github.com/kimyk0120/google_maps_scraper
- Owner: kimyk0120
- License: apache-2.0
- Created: 2024-09-19T00:04:27.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-04-13T03:50:36.000Z (about 1 year ago)
- Last Synced: 2025-04-13T04:33:21.727Z (about 1 year ago)
- Topics: googlemaps, playwright, python
- Language: Python
- Homepage:
- Size: 82 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[//]: # ([](https://forthebadge.com))
[](https://forthebadge.com)
[](https://forthebadge.com)
Google Maps Scraper
=======================
This is a Google Maps scraper.
It uses Playwright library to web scraping and data extraction from Google Maps.
This scraper will extract the following information list from the Google Maps:
- name: Store name
- review_count: Total number of reviews
- review_average: Average review rating
- infos: Store service information (e.g., ["Dine-in", "Takeout", "Delivery Service"])
- opens_at: Store opening time (e.g., "Opens at 12:00 PM")
- address: Store address
- website: Store website
- phone: Store phone number
- place_type: Place type (e.g., "Restaurant")
- reviews: List of detailed review information
- review_name: Reviewer name
- review_info: Number of reviews and photos
- review_content: Review content
- review_rate: Review rating
- review_image_urls: List of review image URLs
- review_at: Review posted time (e.g., "1 week ago")
```json
[
{
"name": "Store Name",
"review_count": 756,
"review_average": 4.2,
"infos": [
"매장 내 식사",
"테이크아웃",
"배달 서비스"
],
"opens_at": "오후 12:00에 영업 시작",
"address": "25 The West Mall, Etobicoke, ON M9C 1B8 캐나다",
"website": "sample.com",
"phone": "+1 416-641-7327",
"place_type": "음식점",
"reviews": [
{
"review_name": "reviewer name",
"review_info": "리뷰 19개 · 사진 12장",
"review_content": "So good~~",
"review_rate": 5,
"review_image_urls": [
"https://sample.com/p/-k-no",
"https://sample.com/p/-k-no2"
],
"review_at": "1주 전"
},
...
```
## Prerequisite
- python >= 3.10
- chromium
## Installation
1. Clone this repository:
```bash
git clone https://github.com/kimyk0120/google_maps_scraper
```
2. Move to the project directory
3. Install the required Python packages:
```bash
pip install -r requirements.txt
```
## How to Use:
To use this script, follow these steps:
### 1. Set up Configuration
- You need to configure the necessary settings in the {project}/config/config.ini file.
- You must write the path to the chromium executable in "chromium_path".
- ```chromium_path = C:\Users\kimyk\Downloads\chrome-win\chrome-win\chrome.exe```
- or `Leave `chromium_path` empty and run `playwright install``
- Set "timout_sec" appropriately to prevent infinite loading.
- Set the maximum number of businesses to scrape with the "store_limit_cnt" setting.
- Set the maximum number of reviews to scrape with the "review_limit_cnt" setting.
- If you need proxy settings, put them in "proxy_server"
### 2. Run the script with Python:
```bash
python main.py "search term"
```
If you need to change the output path, do as follows.
```bash
python main.py "search term" --output "ouput path"
```
## Update
- 2024.12.20
## Contact
For any feedback or queries, please reach out to me at [kimyk0120@gmail.com](kimyk0120@gmail.com).
[](https://www.buymeacoffee.com/kimyk0120)