https://github.com/kazuph/mcp-screenshot
Repository for MCP screenshot functionality
https://github.com/kazuph/mcp-screenshot
Last synced: 2 months ago
JSON representation
Repository for MCP screenshot functionality
- Host: GitHub
- URL: https://github.com/kazuph/mcp-screenshot
- Owner: kazuph
- Created: 2024-12-19T04:43:22.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-12-24T07:41:18.000Z (6 months ago)
- Last Synced: 2024-12-24T07:45:05.590Z (6 months ago)
- Language: JavaScript
- Size: 2.93 KB
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- mcp-index - Screenshot - Captures screenshots and performs OCR text recognition on macOS. Supports both Japanese and English text, offering multiple output formats. (Image and Video Generation)
README
# MCP Screenshot
An MCP server that captures screenshots and performs OCR text recognition.
## Features
- Screenshot capture (left half, right half, full screen)
- OCR text recognition (supports Japanese and English)
- Multiple output formats (JSON, Markdown, vertical, horizontal)## OCR Engines
This server uses two OCR engines:
1. [yomitoku](https://github.com/kazuph/yomitoku)
- Primary OCR engine
- High-accuracy Japanese text recognition
- Runs as an API server2. [Tesseract.js](https://github.com/naptha/tesseract.js)
- Fallback OCR engine
- Used when yomitoku is unavailable
- Supports both Japanese and English recognition## Installation
```bash
npx -y @kazuph/mcp-screenshot
```## Claude Desktop Configuration
Add the following configuration to your `claude_desktop_config.json`:
```json
{
"mcpServers": {
"screenshot": {
"command": "npx",
"args": ["-y", "@kazuph/mcp-screenshot"],
"env": {
"OCR_API_URL": "http://localhost:8000" // yomitoku API base URL
}
}
}
}
```## Environment Variables
| Variable Name | Description | Default Value |
|--------------|-------------|---------------|
| OCR_API_URL | yomitoku API base URL | http://localhost:8000 |## Usage Example
You can use it by instructing Claude like this:
```
Please take a screenshot of the left half of the screen and recognize the text in it.
```## Tool Specification
### capture
Takes a screenshot and performs OCR.
Options:
- `region`: Screenshot area ('left'/'right'/'full', default: 'left')
- `format`: Output format ('json'/'markdown'/'vertical'/'horizontal', default: 'markdown')## License
MIT
## Author
kazuph