https://github.com/oxylabs/oxylabs-mcp
https://github.com/oxylabs/oxylabs-mcp
Last synced: about 1 month ago
JSON representation
- Host: GitHub
- URL: https://github.com/oxylabs/oxylabs-mcp
- Owner: oxylabs
- License: mit
- Created: 2025-01-17T13:47:05.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-02-28T14:33:21.000Z (2 months ago)
- Last Synced: 2025-02-28T19:09:22.254Z (2 months ago)
- Language: Python
- Size: 46.9 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-mcp-servers - Oxylabs - Web data extraction with support for dynamic rendering and structured data parsing (π’ Enterprise-Supported Implementations / Browser & Web Automation)
- awesome-mcp-servers - Oxylabs - Web data extraction with support for dynamic rendering and structured data parsing (π’ Enterprise-Supported Implementations / Browser & Web Automation)
- awesome-mcp-zh - Oxylabs
- Awesome-MCP-Servers-directory - Oxylabs - Scrape websites with Oxylabs Web API, supporting dynamic rendering and parsing for structured data extraction (Browser Automation)
- awesome-mcp-servers - Oxylabs - Provides web scraping capabilities for AI assistants, bypassing anti-scraping measures and geo-restrictions (Table of Contents / Browser Automation)
- awesome-mcp-servers - Oxylabs - Provides web scraping capabilities for AI assistants, bypassing anti-scraping measures and geo-restrictions (Table of Contents / Browser Automation)
- awesome-mcp-servers - Oxylabs - Scrape websites with Oxylabs Web API, supporting dynamic rendering and parsing for structured data extraction. (Official Servers)
README
# MCP Server for Oxylabs Scraper
[](https://smithery.ai/server/@oxylabs/oxylabs-mcp)A Model Context Protocol (MCP) server that enables AI assistants like Claude to seamlessly access web data through Oxylabs' powerful web scraping technology.
## π Overview
The Oxylabs MCP server provides a bridge between AI models and the web. It enables them to scrape any URL, render JavaScript-heavy pages, extract and format content for AI use, bypass anti-scraping measures, and access geo-restricted data from 195+ countries.
This implementation leverages the Model Context Protocol (MCP) to create a secure, standardized way for AI assistants to interact with web content.
## β¨ Key Features
Scrape content from any site
- Extract data from any URL, including complex single-page applications
- Fully render dynamic websites using headless browser support
- Handle JavaScript-heavy and client-rendered content
- Choose full JavaScript rendering, HTML-only, or none
- Emulate Mobile and Desktop viewports for realistic renderingAutomatically get AI-ready data
- Automatically clean and convert HTML to Markdown for improved readability
- Use dedicated parsers extract structured data from popular targets like Google, Amazon, and etc.
- Define your custom parsing logic for any targetBypass blocks & geo-restrictions
- Bypass sophisticated bot protection systems with high success rate
- Reliably scrape even the most complex websites
- Access geo-specific content from 195+ countries
- Automatically rotate IPs and target specific regionsFlexible setup, batch scraping & cross-platform support
- Customize rendering and parsing options per request
- Handle multiple URLs in a single job using Batch requests
- Feed data directly into AI models or analytics tools
- Works on macOS, Windows, and LinuxBuilt-in error handling and smart request management
- Comprehensive error handling and reporting
- Intelligent rate limiting and request management## π‘ Example Queries
When you've set up the MCP server with Claude or another AI assistant, you can make requests like:
Could you scrape https://www.google.com/search?q=ai page?Scrape https://www.amazon.de/-/en/Smartphone-Contract-Function-Manufacturer-Exclusive/dp/B0CNKD651V with parse enabled.Scrape https://www.amazon.de/-/en/gp/bestsellers/beauty/ref=zg_bs_nav_beauty_0 with parse and render enabled.Use web unblocker with render to scrape https://www.bestbuy.com/site/top-deals/all-electronics-on-sale/pcmcat1674241939957.c## β Prerequisites
Before you begin, make sure you have:
- **Oxylabs Account**: Obtain your username and password from [Oxylabs](https://dashboard.oxylabs.io/) (1-week free trial available)
### Basic Usage (via Smithery CLI or Cursor)
- **Node.js** (v16+)
- `npx` command-line tool### Local/Dev Setup
- **Python 3.12+**
- `uv` package manager β install it using [this guide](https://docs.astral.sh/uv/getting-started/installation/)## βοΈ Basic Setup Instructions
### Install via Smithery (Recommended)
Automatically install Oxylabs MCP server for Claude Desktop via [Smithery](https://smithery.ai/server/@oxylabs/oxylabs-mcp):
```bash
npx -y @smithery/cli install @oxylabs/oxylabs-mcp --client claude
```
---
### Run on Cursor> [!IMPORTANT]
> Requires Cursor version `0.45.6+`To configure Oxylabs in Cursor:
1. Open Cursor **Settings**
2. Go to **Features > MCP Servers**
3. Click "**+ Add New MCP Server**"
4. Enter the following:
- **Name**: `oxylabs` (or your preferred name)
- **Type**: `command`
- **Command**:`npx -y @smithery/cli@latest run @oxylabs/oxylabs-mcp --config "{\"oxylabsUsername\":\"YOUR_USERNAME\",\"oxylabsPassword\":\"YOUR_PASSWORD\"}"`
Replace `your-username` and `your-password` with your Oxylabs credentials.> [!TIP]
> If you're using Windows and run into issues, try this:
>
> `cmd /c "set OXYLABS_USERNAME=your-username && set OXYLABS_PASSWORD=your-password && npx -y oxylabs-mcp"`After that, refresh the MCP server list to see the new tools. The Composer Agent will automatically use Oxylabs when appropriate, but you can explicitly request it by describing your web scraping needs. Access the Composer via `Command+L` (Mac), select "Agent" next to the submit button, and enter your query.
---
### Set up with Claude DesktopEnable **Developer Mode** and then navigate to **Claude β Settings β Developer β Edit Config** and edit your `claude_desktop_config.json` file as follows:
```json
{
"mcpServers": {
"oxylabs_scraper": {
"command": "uvx",
"args": ["oxylabs-mcp"],
"env": {
"OXYLABS_USERNAME": "YOUR_USERNAME_HERE",
"OXYLABS_PASSWORD": "YOUR_PASSWORD_HERE"
}
}
}
}
```
---## π» Local/Dev Setup Instructions
### Clone repository
```
git clone
```### Install dependencies
Install MCP server dependencies:
```bash
cd mcp-server-oxylabs# Create virtual environment and activate it
uv venvsource .venv/bin/activate # MacOS/Linux
# OR
.venv/Scripts/activate # Windows# Install dependencies
uv sync
```### Setup with Claude Desktop
Enable **Developer Mode** and then navigate to **Claude β Settings β Developer β Edit Config** and edit your `claude_desktop_config.json` file as follows:
```json
{
"mcpServers": {
"oxylabs_scraper": {
"command": "uv",
"args": [
"--directory",
"//oxylabs-mcp",
"run",
"oxylabs-mcp"
],
"env": {
"OXYLABS_USERNAME": "YOUR_USERNAME_HERE",
"OXYLABS_PASSWORD": "YOUR_PASSWORD_HERE"
}
}
}
}
```
---
### π Debugging```bash
make run
```
Then access MCP Inspector at `http://localhost:5173`. You may need to add your username and password as environment variables in the inspector under `OXYLABS_USERNAME` and `OXYLABS_PASSWORD`.## π§© API Parameters
The Oxylabs MCP server supports these parameters:
| Parameter | Description | Values |
|-----------|-------------|--------|
| `url` | The URL to scrape | Any valid URL |
| `parse` | Enable structured data extraction | `True` or `False` |
| `render` | Use headless browser rendering | `html` or `None` |## π οΈ Technical Details
This server provides two main tools:
1. **oxylabs_scraper**: Uses Oxylabs Web Scraper API for general website scraping
2. **oxylabs_web_unblocker**: Uses Oxylabs Web Unblocker for hard-to-access websites[Web Scraper API](https://oxylabs.io/products/scraper-api/web) supports JavaScript rendering, parsed structured data, and cleaned HTML in Markdown format. [Web Unblocker](https://oxylabs.io/products/web-unblocker) offers JavaScript rendering and cleaned HTML, but doesnβt return parsed data.
---
> [!WARNING]
> Usage with the MCP Inspector is affected by an ongoing issue with the Python SDK for MCP, see: https://github.com/modelcontextprotocol/python-sdk/pull/85. For Claude, a forked version of the SDK is used as a temporary fix.## License
This project is licensed under the [MIT License](LICENSE).
## About Oxylabs
Established in 2015, Oxylabs is a market-leading web intelligence collection
platform, driven by the highest business, ethics, and compliance standards,
enabling companies worldwide to unlock data-driven insights.[](https://oxylabs.io/)