https://github.com/danieladdisonorg/ecommerce-scraping-api
https://github.com/danieladdisonorg/ecommerce-scraping-api
Last synced: 12 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/danieladdisonorg/ecommerce-scraping-api
- Owner: danieladdisonorg
- License: mit
- Created: 2025-06-20T10:16:43.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2025-06-20T10:19:07.000Z (12 months ago)
- Last Synced: 2025-06-20T10:51:00.449Z (12 months ago)
- Language: PHP
- Size: 0 Bytes
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# eCommerce Scraping API
## ๐ Overview
The **eCommerce Scraping API** by Smartproxy enables developers to extract comprehensive product data from major eCommerce platforms including Amazon and Wayfair. Our API provides structured, reliable data extraction with high success rates and built-in parsing capabilities.
### โจ Key Features
- **Multi-platform Support**: Amazon, Wayfair, and generic eCommerce sites
- **Comprehensive Data**: Products, reviews, pricing, Q&A, and search results
- **AI-Powered Parsing**: Intelligent data extraction for any eCommerce site
- **Global Coverage**: Support for multiple locales and geographical locations
- **High Performance**: Fast response times with reliable uptime
- **Developer Friendly**: RESTful API with extensive documentation and examples
## ๐ Table of Contents
- [Quick Start](#-quick-start)
- [Authentication](#-authentication)
- [Supported Platforms](#-supported-platforms)
- [Amazon](#amazon)
- [Wayfair](#wayfair)
- [Generic eCommerce](#generic-ecommerce)
- [API Reference](#-api-reference)
- [Code Examples](#-code-examples)
- [Response Codes](#-response-codes)
- [Support](#-support)
## ๐ Quick Start
### Prerequisites
- Active eCommerce Scraping API subscription
- Valid API credentials (username/password)
### Basic Request
```bash
curl -X POST https://scraper-api.smartproxy.com/v2/scrape \
-H "Content-Type: application/json" \
-u "username:password" \
-d '{
"target": "amazon",
"url": "https://www.amazon.com/dp/B09H74FXNW"
}'
```
## ๐ Authentication
Authentication is handled via HTTP Basic Auth using your Smartproxy credentials.
1. Navigate to your [Smartproxy Dashboard](https://dashboard.smartproxy.com)
2. Go to **eCommerce > Authentication**
3. Enter your username and password
4. Generate and test your API request
> **Note**: The dashboard provides example requests with preset values. Customize parameters in your actual implementation.
## ๐๏ธ Supported Platforms
### Amazon
Extract comprehensive data from Amazon including products, reviews, pricing, and search results.
#### Available Targets
| Target | Description | Parseable | Required Parameter |
|--------|-------------|-----------|-------------------|
| `amazon` | Product page via URL | โ
| `url` |
| `amazon_product` | Product via ASIN | โ
| `query` |
| `amazon_pricing` | Pricing information | โ
| `query` |
| `amazon_reviews` | Product reviews | โ
| `query` |
| `amazon_questions` | Q&A section | โ
| `query` |
| `amazon_search` | Search results | โ
| `query` |
| `amazon_bestsellers` | Bestseller lists | โ
| `query` |
#### Example Response Structure
Click to expand Amazon product response
```json
{
"results": [
{
"content": {
"url": "https://www.amazon.com/dp/B09H74FXNW",
"asin": "B09H74FXNW",
"title": "Gaming Headset with Microphone...",
"price": 20.98,
"currency": "USD",
"rating": 4.4,
"reviews_count": 2239,
"images": ["https://m.media-amazon.com/images/..."],
"description": "Product description...",
"bullet_points": "Key features...",
"category": [...],
"variations": [...],
"ads": [...]
},
"status_code": 200,
"created_at": "2022-09-01 11:03:48"
}
]
}
```
### Wayfair
Access Wayfair product data and search results.
#### Available Targets
| Target | Description | Parseable | Required Parameter |
|--------|-------------|-----------|-------------------|
| `wayfair` | Product page via URL | โ | `url` |
| `wayfair_search` | Search results | โ | `query` |
### Generic eCommerce
Extract data from any eCommerce website using our AI-powered parser.
#### Target Configuration
| Target | Description | Parseable | Required Parameter |
|--------|-------------|-----------|-------------------|
| `ecommerce` | Any eCommerce site | โ
(AI) | `url` |
> **Note**: For AI parsing, set `parse: true` and `parser_type: "ecommerce_product"`
## ๐ API Reference
### Base URL
```
https://scraper-api.smartproxy.com/v2/scrape
```
### Request Parameters
| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `target` | string | โ
| Scraping target (see [supported targets](#supported-platforms)) |
| `url` | string | * | Direct URL to scrape |
| `query` | string | * | Search query or product ID |
| `locale` | string | โ | Interface language (e.g., `en-US`, `en-GB`) |
| `geo` | string | โ | Geographical location |
| `device_type` | string | โ | Device type: `desktop`, `mobile`, `tablet` |
| `headless` | string | โ | JavaScript rendering: `html`, `png` |
| `parser_type` | string | โ | Parser type for eCommerce target |
*Required parameter depends on the target
### Device Types
- `desktop` (default)
- `desktop_chrome`
- `desktop_firefox`
- `mobile`
- `mobile_android`
- `mobile_ios`
## ๐ป Code Examples
### Python
```bash
curl https://raw.githubusercontent.com/Smartproxy/eCommerce-Scraping-API/main/python/amazon.py > amazon.py
```
### PHP
```bash
curl https://raw.githubusercontent.com/Smartproxy/eCommerce-Scraping-API/main/php/amazon.php > amazon.php
```
### Node.js
```bash
curl https://raw.githubusercontent.com/Smartproxy/eCommerce-Scraping-API/main/nodejs/amazon.js > amazon.js
```
### Complete Example Collection
| Platform | Python | PHP | Node.js |
|----------|--------|-----|---------|
| Amazon Product | [amazon.py](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/python/amazon.py) | [amazon.php](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/php/amazon.php) | [amazon.js](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/nodejs/amazon.js) |
| Amazon Search | [amazonsearch.py](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/python/amazonsearch.py) | [amazonsearch.php](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/php/amazonsearch.php) | [amazonsearch.js](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/nodejs/amazonsearch.js) |
| Amazon Reviews | [amazonreviews.py](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/python/amazonreviews.py) | [amazonreviews.php](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/php/amazonreviews.php) | [amazonreviews.js](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/nodejs/amazonreviews.js) |
| Wayfair | [wayfair.py](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/python/wayfair.py) | [wayfair.php](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/php/wayfair.php) | [wayfair.js](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/nodejs/wayfair.js) |
| Generic eCommerce | [ecommerce.py](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/python/ecommerce.py) | [ecommerce.php](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/php/ecommerce.php) | [ecommerce.js](https://github.com/Smartproxy/eCommerce-Scraping-API/blob/main/nodejs/ecommerce.js) |
## ๐ Response Codes
### HTTP Status Codes
| Code | Status | Description | Action Required |
|------|--------|-------------|-----------------|
| **200** | โ
Success | Request completed successfully | Continue processing |
| **204** | โณ Processing | Job still in progress | Wait and retry |
| **400** | โ Bad Request | Invalid request format | Check request structure |
| **401** | ๐ Unauthorized | Invalid credentials | Verify authentication |
| **403** | ๐ซ Forbidden | Access denied | Check subscription/permissions |
| **404** | ๐ Not Found | Target not found | Verify URL/parameters |
| **429** | โก Rate Limited | Too many requests | Wait before retrying |
| **500** | ๐ง Server Error | Internal server error | Contact support |
| **524** | โฐ Timeout | Request timeout | Retry after delay |
### Parser Status Codes
| Code | Status | Description |
|------|--------|-------------|
| **12000** | โ
Success | Data parsed successfully |
| **12002** | โ Parse Failed | Complete parsing failure |
| **12003** | ๐ซ Not Supported | Target not supported |
| **12004** | โ ๏ธ Incomplete | Some fields missing |
| **12005** | โ ๏ธ Partial | Some fields unparsed |
| **12006** | ๐ง Error | Unexpected error occurred |
| **12007** | โ Unknown | Parse status unclear |
| **12008** | โ Failed | Failed to parse data |
| **12009** | ๐ Not Found | Target parameters invalid |
## ๐งช Testing
### Postman Collection
[](https://app.getpostman.com/run-collection/23304112-92a123e7-171c-497e-8ca1-57eff04361f3?action=collection%2Ffork&collection-url=entityId%3D23304112-92a123e7-171c-497e-8ca1-57eff04361f3%26entityType%3Dcollection%26workspaceId%3D52705bab-433c-4fbf-afce-ccbfc97430fe)
Import our comprehensive Postman collection to test all API endpoints with pre-configured examples.
## ๐ Support
### Documentation & Resources
- ๐ [Full API Documentation](https://smartproxy.com/scraping/ecommerce)
- ๐ฏ [Dashboard](https://dashboard.smartproxy.com)
### Getting Help
1. **Check Documentation**: Review this README and our full documentation
2. **Community Support**: Join our Discord for community help
3. **Technical Issues**: Contact our support team with your task ID
4. **Feature Requests**: Submit via GitHub issues
### Rate Limits & Best Practices
- Monitor your request quota in the dashboard
- Implement exponential backoff for retries
- Cache responses when appropriate
- Use appropriate `device_type` for your use case
## ๐ License
This project is licensed under the [MIT License](./LICENSE).
---