Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/julzerinos/python-scraping-tools
A collection of repeatable methods and concepts appearing in python web scraping with the use of Scrapy and Selenium
https://github.com/julzerinos/python-scraping-tools
bot python python-bot python-web-scraper scraping scrapy selenium selenium-python web-scraping
Last synced: 2 months ago
JSON representation
A collection of repeatable methods and concepts appearing in python web scraping with the use of Scrapy and Selenium
- Host: GitHub
- URL: https://github.com/julzerinos/python-scraping-tools
- Owner: julzerinos
- License: mit
- Archived: true
- Created: 2019-06-21T09:23:41.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-11-14T09:39:07.000Z (about 4 years ago)
- Last Synced: 2024-08-03T01:26:28.759Z (6 months ago)
- Topics: bot, python, python-bot, python-web-scraper, scraping, scrapy, selenium, selenium-python, web-scraping
- Language: Python
- Homepage:
- Size: 17.6 KB
- Stars: 8
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-rainmana - julzerinos/python-scraping-tools - A collection of repeatable methods and concepts appearing in python web scraping with the use of Scrapy and Selenium (Python)
README
# Scraping Tools and Templates
This repository aims to be an educational collection of repeatable methods and concepts appearing in python-based web scraping with the use of Scrapy and Selenium. The content has been created based on long exposure to this field and is further divided into three categories, reffered to as:
1. Tools - a selection of various utilities, helper functions and other tools, the existence of which can make working with web scraping easier.
2. Techniques - a compendium of useful functions and methods with explanation on their nature
3. Templates - an assortment of general examples of spiders (for their respective website types), which require little tweaking to get running## Context
A few assumptions and specifications are made for the nature of this webscraping endeavor, and thus reading through most of the README is advised.