Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jumbaeric/woocommerce_shop_scraper_to_csv
The WooCommerce Shop Scraper to CSV tool is a powerful python tool designed to simplify the process of extracting product data from a WooCommerce-powered online shop and generating a comprehensive CSV file. Built on the foundations of WordPress and WooCommerce,
https://github.com/jumbaeric/woocommerce_shop_scraper_to_csv
data python scraping-websites scrapy-spider woocommerce-products wordpress
Last synced: 6 days ago
JSON representation
The WooCommerce Shop Scraper to CSV tool is a powerful python tool designed to simplify the process of extracting product data from a WooCommerce-powered online shop and generating a comprehensive CSV file. Built on the foundations of WordPress and WooCommerce,
- Host: GitHub
- URL: https://github.com/jumbaeric/woocommerce_shop_scraper_to_csv
- Owner: jumbaeric
- Created: 2023-01-14T00:14:43.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-06-28T08:20:04.000Z (over 1 year ago)
- Last Synced: 2024-11-20T23:00:59.859Z (2 months ago)
- Topics: data, python, scraping-websites, scrapy-spider, woocommerce-products, wordpress
- Language: Python
- Homepage:
- Size: 484 KB
- Stars: 11
- Watchers: 1
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
The WooCommerce Shop Scraper to CSV tool is a powerful python tool designed to simplify the process of extracting product data from a WooCommerce-powered online shop and generating a comprehensive CSV file. Built on the foundations of WordPress and WooCommerce, this project leverages advanced web scraping techniques to automate the extraction and organization of essential product information.
By utilizing this scraper, you can effortlessly gather a wealth of data from any WooCommerce-based online shop, including product names, descriptions, prices, images, categories, stock availability, and more. Whether you're an e-commerce enthusiast, a business analyst, or a developer seeking efficient data extraction, this project provides an intuitive and efficient solution.
# woocommerce_shop_scraper_to_csv
Using Scrapy Python.Scrapy (/ˈskreɪpaɪ/) is an application framework for crawling web sites and extracting structured data which can be used for a wide range of useful applications, like data mining, information processing or historical archival.
Installation instructions
1. Install scrapy
Scrapy requires Python 3.7+, either the CPython implementation (default) or the PyPy implementation (see Alternate Implementations).
pip install Scrapy
2. Clone the repo
3. Insert the woocommerce shop urls in spiders > products_spider.py , You can use 1 or more shop urls
start_urls = [
'https://www.woostore1.com/shop/',
'https://www.woostore2.co.ke/shop/page/1',
'https://woostore3.com/shop/page/1',
]cd into the projects directory and run the command
scrapy crawl productsSpider
The crawl will start and a new csv file will be generated on the projects root directory with the products populated