Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/chhantyal/scrapd
Python web scrapping utils
https://github.com/chhantyal/scrapd
Last synced: 3 months ago
JSON representation
Python web scrapping utils
- Host: GitHub
- URL: https://github.com/chhantyal/scrapd
- Owner: chhantyal
- License: bsd-3-clause
- Created: 2014-05-12T21:20:26.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2014-05-15T19:09:15.000Z (over 10 years ago)
- Last Synced: 2023-03-22T22:06:10.030Z (almost 2 years ago)
- Language: Python
- Size: 124 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.rst
- Changelog: HISTORY.rst
- Contributing: CONTRIBUTING.rst
- License: LICENSE
Awesome Lists containing this project
README
===============================
scrapd
===============================Python web scrapping utils.
``scrapd`` is small util as well as Flask application to scrap contents from www.zalando.de.
Usage
-----
It scraps products information from given path.
Example:
>>> import zalando
>>> URL_PATH = "damenbekleidung-jeans-straight-leg"
>>> category = zalando.get_category(URL_PATH)
>>> # Returns categories as Python dict
>>> print category["name"] # prints "Straight Leg"
>>> print category["url"]
>>> # prints "http://www.zalando.de/damenbekleidung-jeans-straight-leg/"
>>> print len(category["articles"])
>>> # etc.The Flask app exposes same functionality to the web interface.
Install
--------
Steps ::$ git clone https://github.com/chhantyal/zalando or get gzipped package
Install package using setuptools. It will also install a command line application, which can be
used to run flask app::$ python setup.py install
To run the zalando command line app::
$ zalando # runs flask app at port 5000. Use --port to specify different port.