Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zytedata/extract-summit-contest-solutions

Example solutions for the practice and contest websites of the code contest of Web Data Extraction Summit.
https://github.com/zytedata/extract-summit-contest-solutions

Last synced: 2 months ago
JSON representation

Example solutions for the practice and contest websites of the code contest of Web Data Extraction Summit.

Awesome Lists containing this project

README

        

===============================================================
Example solution for the Extract Summit 2024 Coding Competition
===============================================================

There are 2 different solution spiders, one that uses AI parsing by default and
only uses custom parsing where AI fails, and one that uses custom parsing code
only, no AI.

Both solutions are implemented with the `e-commerce spider`_ from
zyte-spider-templates.

.. _e-commerce spider: https://zyte-spider-templates.readthedocs.io/en/latest/templates/e-commerce.html

To run the AI solution::

scrapy crawl ecommerce -s SOLUTION=ai -a url="https://zzcvcpnfzoogpxiqupsergvrmdopqgrk-744852047878.us-south1.run.app/navigation"

To run the non-AI solution::

scrapy crawl ecommerce -s SOLUTION=non_ai -a url="https://zzcvcpnfzoogpxiqupsergvrmdopqgrk-744852047878.us-south1.run.app/navigation"