Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/scrapehero/zillow_real_estate
Zillow.com Web Scraper written in Python and LXML to extract real estate listings available based on a zip code.
https://github.com/scrapehero/zillow_real_estate
html lxml parsing python-requests scraper web-scraping
Last synced: about 2 months ago
JSON representation
Zillow.com Web Scraper written in Python and LXML to extract real estate listings available based on a zip code.
- Host: GitHub
- URL: https://github.com/scrapehero/zillow_real_estate
- Owner: scrapehero
- Created: 2018-02-23T05:52:08.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-02-26T09:56:11.000Z (almost 7 years ago)
- Last Synced: 2024-04-09T05:36:22.528Z (9 months ago)
- Topics: html, lxml, parsing, python-requests, scraper, web-scraping
- Language: Python
- Homepage: https://www.scrapehero.com/how-to-scrape-real-estate-listings-on-zillow-com-using-python-and-lxml/
- Size: 24.4 KB
- Stars: 97
- Watchers: 8
- Forks: 57
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Zillow Real Estate Listing Scraper
This script will scrape Zillow.com, an online real estate database to extract real estate listings available based on a zip code. If you would like to know more about this scraper you can check out our blog post at this link https://www.scrapehero.com/how-to-scrape-real-estate-listings-on-zillow-com-using-python-and-lxml/
## Getting Started
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.
### Fields
This zillow scraper can extract the fields below
1. Title
2. Street Name
3. City
4. State
5. Zip Code
6. Price
7. Facts and Features
8. Real Estate Provider
9. URL### Prerequisites
For this web scraping tutorial using Python 3, we will need some packages for downloading and parsing the HTML.
Below are the package requirements:- lxml
- requests### Installation
PIP to install the following packages in Python (https://pip.pypa.io/en/stable/installing/).
Python Requests, to make requests and download the HTML content of the pages (http://docs.python-requests.org/en/master/user/install/).
To install python request module:
```
pip3 install requests
```Python LXML, for parsing the HTML Tree Structure using Xpaths (Learn how to install that here – http://lxml.de/installation.html)
Installing lxml:```
pip3 install lxml
```## Running the scraper
You must run the script using Python with arguments for zip code and sort. The sort argument has the options ‘newest’ and ‘cheapest’
listings available. As an example, to find the listings of the newest properties up for sale in Boston, Massachusetts we would run the
script as:```
python3 zillow.py 02126 newest
```
## Sample OutputThis will create a csv file:
[Sample Output](https://raw.githubusercontent.com/scrapehero/zillow_real_estate/master/properties-02126.csv)