https://github.com/upasanadhameliya/advocate-scrape-selenium

Scraping websites using selenium python. The lawyers websites would get scraped by using Chrome Webdriver along with Windows OS
https://github.com/upasanadhameliya/advocate-scrape-selenium

python python-scraper python-selenium python3 python3-selenium python38 python38scraper python3webscraper pythonwebscraper selenium selenium-python selenium-webdriver web-scraping web-scrapper webscraping websitescraper

Last synced: 4 months ago
JSON representation

Scraping websites using selenium python. The lawyers websites would get scraped by using Chrome Webdriver along with Windows OS

Host: GitHub
URL: https://github.com/upasanadhameliya/advocate-scrape-selenium
Owner: Upasanadhameliya
License: mit
Created: 2021-08-25T04:02:19.000Z (almost 4 years ago)
Default Branch: main
Last Pushed: 2021-08-27T06:46:46.000Z (almost 4 years ago)
Last Synced: 2025-01-10T03:26:16.793Z (5 months ago)
Topics: python, python-scraper, python-selenium, python3, python3-selenium, python38, python38scraper, python3webscraper, pythonwebscraper, selenium, selenium-python, selenium-webdriver, web-scraping, web-scrapper, webscraping, websitescraper
Language: Python
Homepage:
Size: 25.4 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Scraping advocate websites using SELENIUM - PYTHON

Project contains source code for scraping two websites containing information about lawyers. Code uses `Google Chrome` driver and is made to run on `Windows 10 OS`.

`Python version 3.8`

To run the code locally:
- IMP: You would need to install the web driver for your browser for selenium to work
- Clone the repository
- `cd` in the `Advocate` folder
- Create a virtual environment using `venv` or `conda`
- Activate your environment and do `pip install -r requirements.txt`
- Change your driver path in the files
- Line 60, 27: `driver = webdriver.Chrome("D:\\your\\path\\to\\driver\\here\\chromedriver.exe")`
- `cd` in the respective directory and `python _scrape.py`

### Cyprus Website
https://www.cyprusbar.org/CypriotAdvocateMembersPage.aspx
![ezgif com-gif-maker](https://user-images.githubusercontent.com/28010398/131078306-c7be099d-9278-4ac4-a1b8-bdff0db278b3.gif)

```
cd cyprus
python cyprus_scrape.py
```
##### Output
![2021-08-27-11-31-43](https://user-images.githubusercontent.com/28010398/131079834-ebcaa3b9-7d77-47fe-9985-46934c240b39.png)

### Davac Website
http://vyhledavac.cak.cz
![ezgif com-gif-maker_davac](https://user-images.githubusercontent.com/28010398/131079241-733da2e8-037d-4184-a528-19f3adfd4f13.gif)

```
cd davac
python davac_scrape.py
```
NOTE: The scraping would be slow as there is a lot of intialisation for this website. So don't close the program and keep
your patience. It would work. Only close if there is some error displayed :D
##### Output
![2021-08-27-11-39-44](https://user-images.githubusercontent.com/28010398/131080892-5f7269c3-5187-4965-9ec4-b424d062ec0f.png)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/upasanadhameliya/advocate-scrape-selenium

Awesome Lists containing this project

README