Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/basemax/buskool.com-crawler

This repository contains a PHP-based crawler and scraper designed to fetch and download all product data from the Buskool website (باسکول). The crawler is designed to handle large-scale data scraping efficiently and stores the collected data in JSON format.
https://github.com/basemax/buskool.com-crawler

buskool buskoolcom crawler crawler-php php php-crawler

Last synced: 1 day ago
JSON representation

This repository contains a PHP-based crawler and scraper designed to fetch and download all product data from the Buskool website (باسکول). The crawler is designed to handle large-scale data scraping efficiently and stores the collected data in JSON format.

Awesome Lists containing this project

README

        

# Buskool Crawler

This repository contains a PHP-based crawler and scraper designed to fetch and download all product data from the [Buskool website](http://www.buskool.com/) (باسکول). The crawler is designed to handle large-scale data scraping efficiently and stores the collected data in JSON format.

## Features

- Scrapes product data from the Buskool website.
- Supports robust error handling.
- Stores the extracted data in structured JSON format for easy processing.

## Requirements

PHP 8.3+

## Installation

Clone the repository:
```bash
git clone https://github.com/BaseMax/buskool.com-crawler.git
cd buskool.com-crawler
```

## Usage

Run the crawler script:
```bash
php download.php
php create-products.php
```

The collected data will be saved in the specified directory as JSON files.

## Contributing

Contributions, issues, and feature requests are welcome! Feel free to open an issue or submit a pull request.

## Data Source

The data in this repository was collected from Buskool.com website. It is intended for educational, research, and development purposes only.

## License

This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.

Let me know if you want any modifications or additional details!

Copyright © 2024, Max Base