Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/genescu/python-hotels
Booking.com's Python Parsing Tool
https://github.com/genescu/python-hotels
beautifulsoup booking parsing python scraping scraping-python scraping-websites
Last synced: about 8 hours ago
JSON representation
Booking.com's Python Parsing Tool
- Host: GitHub
- URL: https://github.com/genescu/python-hotels
- Owner: genescu
- Created: 2023-05-22T01:01:04.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-06-21T11:53:22.000Z (over 1 year ago)
- Last Synced: 2024-11-23T22:18:01.786Z (2 months ago)
- Topics: beautifulsoup, booking, parsing, python, scraping, scraping-python, scraping-websites
- Language: HTML
- Homepage:
- Size: 140 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Documentation for Booking.com's Python Parsing Tool
**Instructions on executing the script**
Firstly, you need to import the project.
Next, set the environment to either 'dev' or 'live'.
After that, navigate to the main folder and run the file named '**bookingcom.py**'. Once done, you will find a '**hotels.json**' file generated in the output folder.
Additionally, the project also provides unit tests as a bonus feature.
**Description**
A Python script that scrap hotel information from a website and saves it as a JSON file.
It uses the requests and BeautifulSoup modules to make requests to the website and parse the HTML content, respectively.
The dictionary_booking com function extracts information about hotels from the parsed HTML and creates instances of the 'hotel.Hotel' class to store the information.
The environment function determines whether to run the script in a live or development environment by specifying the URL of the website to scrape.
Finally, the main function runs the scraping process and saves the resulting hotel information as a JSON file.