Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zytedata/spidyquotes

Example site for web scraping tutorials
https://github.com/zytedata/spidyquotes

crawling playground scraping tutorials web-crawling web-scraping web-scraping-tutorials

Last synced: about 2 months ago
JSON representation

Example site for web scraping tutorials

Awesome Lists containing this project

README

        

# Spidy Quotes

This is an example site for web scraping tutorials.

## Installation

Install dependencies:

pip install -r requirements.txt

## How to run

python spidyquotes.py

To run in development mode, use the `--debug` flag.

## Endpoints

* `/`: well structured HTML with microdata markup and pagination buttons.
* `/scroll`: same as `/`, but with infinite scrolling via AJAX calls.
* `/js`: the content is generated by JavaScript code.
* `/js-delayed/?delay=1000`: same as `/js` but with a delay. Default delay is 10000 ms.
* `/tableful`: a messed-up layout based on tables.
* `/login`: login page with CSRF token (any user/passwd works). The website shows GoodReads links for authenticated users.
* `/viewState`: an AJAX-based filter form that simulates ViewStates behavior.
* `/random`: shows a single random quote.

## Hosted version

There is a hosted version available at http://quotes.toscrape.com.