https://github.com/vasistareddy/any_pagesource_scraper_selenium

Extract pagesource/html from the selenium webdriver with get request of Flask API
https://github.com/vasistareddy/any_pagesource_scraper_selenium

docker docker-image flask-api html selenium selenium-python selenium-webdriver

Last synced: 4 months ago
JSON representation

Extract pagesource/html from the selenium webdriver with get request of Flask API

Host: GitHub
URL: https://github.com/vasistareddy/any_pagesource_scraper_selenium
Owner: Vasistareddy
Created: 2018-09-09T06:18:59.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2018-09-11T09:44:28.000Z (almost 7 years ago)
Last Synced: 2025-01-13T06:41:58.948Z (6 months ago)
Topics: docker, docker-image, flask-api, html, selenium, selenium-python, selenium-webdriver
Language: Python
Size: 4.88 KB
Stars: 2
Watchers: 2
Forks: 2
Open Issues: 1
Metadata Files:
- Readme: README.md

README

# docker_selenium_pagesource_flask
Extract pagesource/html through selenium webdriver with get request of Flask API

# guide for the docker installation @digitalocean
https://www.digitalocean.com/community/tutorials/how-to-install-and-use-docker-on-ubuntu-18-04

# Build the docker

docker build . -t selenium:pagesource

# Run the docker

docker run -p 5056:5056 --env DISPLAY=:10 -d selenium:pagesource

- This will make docker to run in detached mode

# endpoint

## - pagesource

# querystrings

## params:
- url
## response:
200:
- status_code: 200
- pageSource:
500:
- status_code: 500
- erro: error message

Ex: http://localhost:5056/pagesource?url=http://vasistareddy.blogspot.com/