https://github.com/vasistareddy/any_pagesource_scraper_selenium
Extract pagesource/html from the selenium webdriver with get request of Flask API
https://github.com/vasistareddy/any_pagesource_scraper_selenium
docker docker-image flask-api html selenium selenium-python selenium-webdriver
Last synced: 4 months ago
JSON representation
Extract pagesource/html from the selenium webdriver with get request of Flask API
- Host: GitHub
- URL: https://github.com/vasistareddy/any_pagesource_scraper_selenium
- Owner: Vasistareddy
- Created: 2018-09-09T06:18:59.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-09-11T09:44:28.000Z (almost 7 years ago)
- Last Synced: 2025-01-13T06:41:58.948Z (6 months ago)
- Topics: docker, docker-image, flask-api, html, selenium, selenium-python, selenium-webdriver
- Language: Python
- Size: 4.88 KB
- Stars: 2
- Watchers: 2
- Forks: 2
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# docker_selenium_pagesource_flask
Extract pagesource/html through selenium webdriver with get request of Flask API# guide for the docker installation @digitalocean
https://www.digitalocean.com/community/tutorials/how-to-install-and-use-docker-on-ubuntu-18-04# Build the docker
docker build . -t selenium:pagesource
# Run the docker
docker run -p 5056:5056 --env DISPLAY=:10 -d selenium:pagesource
- This will make docker to run in detached mode# endpoint
## - pagesource
# querystrings
## params:
- url
## response:
200:
- status_code: 200
- pageSource:
500:
- status_code: 500
- erro: error messageEx: http://localhost:5056/pagesource?url=http://vasistareddy.blogspot.com/