Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/guhan-tofu/web-scraper
This will accept a link and return all text present in the page.
https://github.com/guhan-tofu/web-scraper
python selenium web-scraper webdriver
Last synced: about 2 months ago
JSON representation
This will accept a link and return all text present in the page.
- Host: GitHub
- URL: https://github.com/guhan-tofu/web-scraper
- Owner: guhan-tofu
- License: mit
- Created: 2024-06-24T11:02:41.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-06-24T11:20:52.000Z (7 months ago)
- Last Synced: 2024-06-24T12:39:35.439Z (7 months ago)
- Topics: python, selenium, web-scraper, webdriver
- Language: Python
- Homepage:
- Size: 14.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Web-Scraper
Obtains a URL which has its UI removed and read, returning the text obtained from the given URL.
## Create and activate a virtual environment if you don't have one
`python3 -m venv venv`
`source venv/bin/activate`
## Install dependencies
`pip install selenium`
`pip install webdriver_manager`
`pip install beautifulsoup4`
## Enter URL in scraper.py
(line 15)## Run scraper.py
`python scraper.py`