Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jumbaeric/woocommerce_shop_scraper_to_csv

The WooCommerce Shop Scraper to CSV tool is a powerful python tool designed to simplify the process of extracting product data from a WooCommerce-powered online shop and generating a comprehensive CSV file. Built on the foundations of WordPress and WooCommerce,
https://github.com/jumbaeric/woocommerce_shop_scraper_to_csv

data python scraping-websites scrapy-spider woocommerce-products wordpress

Last synced: 6 days ago
JSON representation

The WooCommerce Shop Scraper to CSV tool is a powerful python tool designed to simplify the process of extracting product data from a WooCommerce-powered online shop and generating a comprehensive CSV file. Built on the foundations of WordPress and WooCommerce,

Awesome Lists containing this project

README

        

The WooCommerce Shop Scraper to CSV tool is a powerful python tool designed to simplify the process of extracting product data from a WooCommerce-powered online shop and generating a comprehensive CSV file. Built on the foundations of WordPress and WooCommerce, this project leverages advanced web scraping techniques to automate the extraction and organization of essential product information.

By utilizing this scraper, you can effortlessly gather a wealth of data from any WooCommerce-based online shop, including product names, descriptions, prices, images, categories, stock availability, and more. Whether you're an e-commerce enthusiast, a business analyst, or a developer seeking efficient data extraction, this project provides an intuitive and efficient solution.

# woocommerce_shop_scraper_to_csv
Using Scrapy Python.

Scrapy (/ˈskreɪpaɪ/) is an application framework for crawling web sites and extracting structured data which can be used for a wide range of useful applications, like data mining, information processing or historical archival.

Installation instructions


1. Install scrapy


Scrapy requires Python 3.7+, either the CPython implementation (default) or the PyPy implementation (see Alternate Implementations).

pip install Scrapy


2. Clone the repo


3. Insert the woocommerce shop urls in spiders > products_spider.py , You can use 1 or more shop urls


start_urls = [
'https://www.woostore1.com/shop/',
'https://www.woostore2.co.ke/shop/page/1',
'https://woostore3.com/shop/page/1',
]


cd into the projects directory and run the command


scrapy crawl productsSpider

The crawl will start and a new csv file will be generated on the projects root directory with the products populated