Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/zytedata/spidyquotes
Example site for web scraping tutorials
https://github.com/zytedata/spidyquotes
crawling playground scraping tutorials web-crawling web-scraping web-scraping-tutorials
Last synced: about 2 months ago
JSON representation
Example site for web scraping tutorials
- Host: GitHub
- URL: https://github.com/zytedata/spidyquotes
- Owner: zytedata
- License: bsd-3-clause
- Created: 2015-11-02T02:27:20.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2024-04-17T02:09:05.000Z (8 months ago)
- Last Synced: 2024-04-17T07:56:06.070Z (8 months ago)
- Topics: crawling, playground, scraping, tutorials, web-crawling, web-scraping, web-scraping-tutorials
- Language: Julia
- Homepage:
- Size: 223 KB
- Stars: 30
- Watchers: 8
- Forks: 15
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Spidy Quotes
This is an example site for web scraping tutorials.
## Installation
Install dependencies:
pip install -r requirements.txt
## How to run
python spidyquotes.py
To run in development mode, use the `--debug` flag.
## Endpoints
* `/`: well structured HTML with microdata markup and pagination buttons.
* `/scroll`: same as `/`, but with infinite scrolling via AJAX calls.
* `/js`: the content is generated by JavaScript code.
* `/js-delayed/?delay=1000`: same as `/js` but with a delay. Default delay is 10000 ms.
* `/tableful`: a messed-up layout based on tables.
* `/login`: login page with CSRF token (any user/passwd works). The website shows GoodReads links for authenticated users.
* `/viewState`: an AJAX-based filter form that simulates ViewStates behavior.
* `/random`: shows a single random quote.## Hosted version
There is a hosted version available at http://quotes.toscrape.com.