https://github.com/upasanadhameliya/advocate-scrape-selenium
Scraping websites using selenium python. The lawyers websites would get scraped by using Chrome Webdriver along with Windows OS
https://github.com/upasanadhameliya/advocate-scrape-selenium
python python-scraper python-selenium python3 python3-selenium python38 python38scraper python3webscraper pythonwebscraper selenium selenium-python selenium-webdriver web-scraping web-scrapper webscraping websitescraper
Last synced: 4 months ago
JSON representation
Scraping websites using selenium python. The lawyers websites would get scraped by using Chrome Webdriver along with Windows OS
- Host: GitHub
- URL: https://github.com/upasanadhameliya/advocate-scrape-selenium
- Owner: Upasanadhameliya
- License: mit
- Created: 2021-08-25T04:02:19.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2021-08-27T06:46:46.000Z (almost 4 years ago)
- Last Synced: 2025-01-10T03:26:16.793Z (5 months ago)
- Topics: python, python-scraper, python-selenium, python3, python3-selenium, python38, python38scraper, python3webscraper, pythonwebscraper, selenium, selenium-python, selenium-webdriver, web-scraping, web-scrapper, webscraping, websitescraper
- Language: Python
- Homepage:
- Size: 25.4 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Scraping advocate websites using SELENIUM - PYTHON
Project contains source code for scraping two websites containing information about lawyers. Code uses `Google Chrome` driver and is made to run on `Windows 10 OS`.
`Python version 3.8`
To run the code locally:
- IMP: You would need to install the web driver for your browser for selenium to work
- Clone the repository
- `cd` in the `Advocate` folder
- Create a virtual environment using `venv` or `conda`
- Activate your environment and do `pip install -r requirements.txt`
- Change your driver path in the files
- Line 60, 27: `driver = webdriver.Chrome("D:\\your\\path\\to\\driver\\here\\chromedriver.exe")`
- `cd` in the respective directory and `python _scrape.py`### Cyprus Website
https://www.cyprusbar.org/CypriotAdvocateMembersPage.aspx
```
cd cyprus
python cyprus_scrape.py
```
##### Output
### Davac Website
http://vyhledavac.cak.cz
```
cd davac
python davac_scrape.py
```
NOTE: The scraping would be slow as there is a lot of intialisation for this website. So don't close the program and keep
your patience. It would work. Only close if there is some error displayed :D
##### Output
