https://github.com/habibslim/madridimmo
Webscraping of house prices in Madrid, using Python and BeautifulSoup
https://github.com/habibslim/madridimmo
Last synced: 2 months ago
JSON representation
Webscraping of house prices in Madrid, using Python and BeautifulSoup
- Host: GitHub
- URL: https://github.com/habibslim/madridimmo
- Owner: HabibSlim
- License: mit
- Created: 2020-03-02T12:18:47.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2020-03-02T23:01:11.000Z (about 5 years ago)
- Last Synced: 2025-01-14T14:45:50.710Z (4 months ago)
- Language: Jupyter Notebook
- Size: 1.15 MB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
MadridImmo
Webscraping of house prices in Madrid, using Python and BeautifulSoup
![]()
# Dependencies
Below are the dependencies needed to run the project:- `pandas`
- `BeautifulSoup`
- `fake_headers`# Usage
After installing dependencies, the script can be run with:`python3 main.py [options]`
Positional arguments:
- `name`: website to scrape (`pisos`, `tucasa` or `habitaclia`).
- `type`: type of dwellings to target (`houses` or `flats`).Example:
`python3 main.py pisos houses`
# Example dataset
The extracted data can be found in the file `/dataset/madrid_immo.csv` (~30.000 entries in total).