https://github.com/soulomoon/hotelscraper
A very slow hotel scraper for airbnb, booking, using selenium, beautifulsoup4
https://github.com/soulomoon/hotelscraper
airbnb booking scraper scraper-engine selenium webscraper webscraping
Last synced: 2 months ago
JSON representation
A very slow hotel scraper for airbnb, booking, using selenium, beautifulsoup4
- Host: GitHub
- URL: https://github.com/soulomoon/hotelscraper
- Owner: soulomoon
- License: mit
- Created: 2017-12-11T04:55:08.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2017-12-11T13:11:16.000Z (over 8 years ago)
- Last Synced: 2025-10-24T14:47:40.035Z (8 months ago)
- Topics: airbnb, booking, scraper, scraper-engine, selenium, webscraper, webscraping
- Language: Python
- Homepage:
- Size: 23.4 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# HotelScraper
A very slow hotel scraper for airbnb, booking, using selenium, beautifulsoup4
# Structure
* app.py: main selenium scraper, collecting web site back to data folder
* parser.py: main parser to extract hotel information from data folder with beautifulsoup4, converting it to pandas dataframe, and save it in csv.