An open API service indexing awesome lists of open source software.

https://github.com/PSPDFKit-labs/nutrient-agent-skill

Universal Agent Skill for document processing with Nutrient DWS API β€” works with Claude Code, Codex, Gemini CLI, Cursor, and 35+ more agents
https://github.com/PSPDFKit-labs/nutrient-agent-skill

agent-skills ai-agents claude-code codex cursor document-processing gemini-cli mcp nutrient pdf

Last synced: about 1 month ago
JSON representation

Universal Agent Skill for document processing with Nutrient DWS API β€” works with Claude Code, Codex, Gemini CLI, Cursor, and 35+ more agents

Awesome Lists containing this project

README

          

# Nutrient Document Processing β€” Agent Skill


Nutrient DWS API
License
Agent Skills


Give your AI agent PDF superpowers β€” in one command.

Convert, extract, OCR, redact, sign, and fill documents from any coding agent.


Demo: Ask your agent to redact PII from a PDF


Quickstart β€’
Workflows β€’
Features β€’
40+ Agents

---

## 30-Second Quickstart

**1. Get a free API key** β†’ ****

**2. Install & configure:**

```bash
# Install the skill (works with 40+ agents)
npx skills add PSPDFKit-labs/nutrient-agent-skill

# Set your API key
export NUTRIENT_API_KEY="pdf_live_..."
```

**3. Ask your agent:**

> *"Extract the text from invoice.pdf"*

That's it. Your agent now has full document processing capabilities.

---

## Requirements

- Python 3.10+
- `uv` installed:
- Nutrient API key

---

## Supported Agents

Works out of the box with **40+ AI coding agents**:


Claude Code
Codex CLI
Gemini CLI
Cursor
GitHub Copilot
Windsurf
OpenCode
Amp
Roo Code
OpenClaw
and 30 more

Any agent that supports the [Agent Skills](https://agentskills.io) standard works automatically.

---

## Real-World Workflows

### πŸ” Workflow 1: OCR a scanned document and extract text

You have a scanned PDF β€” no selectable text. Ask your agent:

> *"OCR scanned-contract.pdf in English and extract the text to a file"*

**What happens:**
```
scanned-contract.pdf (image-only)
β†’ OCR (English) β†’ searchable-contract.pdf (selectable text)
β†’ Extract text β†’ contract-text.txt
```

OCR workflow

### πŸ“‹ Workflow 2: Fill a PDF form and sign it

You have an onboarding form to complete. Ask your agent:

> *"Fill employee-onboarding.pdf with name 'Jane Smith', start date '2026-03-01', and department 'Engineering', then digitally sign it"*

**What happens:**
```
employee-onboarding.pdf (blank form)
β†’ Fill fields (name, date, department)
β†’ Digital signature (CMS)
β†’ employee-onboarding-signed.pdf βœ…
```

Fill form and sign workflow

### πŸ”’ Workflow 3: Redact PII before sharing

You need to share a document but it contains sensitive data. Ask your agent:

> *"Redact all social security numbers, email addresses, and credit card numbers from patient-records.pdf"*

**What happens:**
```
patient-records.pdf (contains PII)
β†’ Detect SSNs, emails, credit cards
β†’ Apply black redaction boxes (irreversible)
β†’ patient-records-redacted.pdf πŸ”’
```

> **Tip:** For smarter redaction, try: *"Use AI redaction to find and remove all personally identifiable information from contract.pdf"* β€” this uses contextual AI analysis instead of pattern matching.

---

## Features

| Capability | Description | Example prompt |
|------------|-------------|----------------|
| πŸ“„ **Convert** | PDF ↔ DOCX/XLSX/PPTX, HTML β†’ PDF, images β†’ PDF | *"Convert report.docx to PDF"* |
| πŸ“ **Extract** | Text, tables, and key-value pairs from PDFs | *"Extract all tables from invoice.pdf as Excel"* |
| πŸ” **OCR** | Multi-language OCR for scanned documents | *"OCR this German scan and extract the text"* |
| πŸ”’ **Redact** | Pattern-based + AI-powered PII redaction | *"Redact all SSNs and emails from records.pdf"* |
| πŸ’§ **Watermark** | Text or image watermarks with full styling | *"Add a DRAFT watermark to proposal.pdf"* |
| ✍️ **Sign** | CMS and CAdES digital signatures | *"Digitally sign contract.pdf"* |
| πŸ“‹ **Fill Forms** | Programmatic PDF form filling | *"Fill the tax form with these values…"* |
| πŸ“Š **Credits** | Monitor API usage and balance | *"How many API credits do I have left?"* |

---

## Installation

### Using `npx skills` (Recommended)

```bash
# Install to all detected agents
npx skills add PSPDFKit-labs/nutrient-agent-skill

# Install to specific agents only
npx skills add PSPDFKit-labs/nutrient-agent-skill -a claude-code -a codex -a cursor

# Install globally (available across all projects)
npx skills add PSPDFKit-labs/nutrient-agent-skill -g
```

### Manual Installation

Copy the `nutrient-document-processing/` folder to your agent's skills directory:

| Agent | Project Path | Global Path |
|-------|-------------|-------------|
| **Claude Code** | `.claude/skills/` | `~/.claude/skills/` |
| **Codex CLI** | `.codex/skills/` | `~/.codex/skills/` |
| **Gemini CLI** | `.gemini/skills/` | `~/.gemini/skills/` |
| **Cursor** | `.cursor/skills/` | `~/.cursor/skills/` |
| **GitHub Copilot** | `.github/skills/` | `~/.copilot/skills/` |
| **OpenCode** | `.opencode/skills/` | `~/.config/opencode/skills/` |
| **Windsurf** | `.windsurf/skills/` | `~/.codeium/windsurf/skills/` |
| **Amp** | `.agents/skills/` | `~/.config/agents/skills/` |
| **OpenClaw** | `skills/` | `~/.moltbot/skills/` |
| **Roo Code** | `.roo/skills/` | `~/.roo/skills/` |

Example for Claude Code:

```bash
git clone https://github.com/PSPDFKit-labs/nutrient-agent-skill.git
cp -r nutrient-agent-skill/nutrient-document-processing ~/.claude/skills/
```

---

## Skill Structure

```
nutrient-document-processing/
β”œβ”€β”€ SKILL.md # Main instructions (loaded by agents)
β”œβ”€β”€ scripts/
β”‚ β”œβ”€β”€ *.py # Single-operation scripts
β”‚ └── lib/common.py # Shared utilities
β”œβ”€β”€ assets/
β”‚ └── templates/
β”‚ └── custom-workflow-template.py # Runtime pipeline template
β”œβ”€β”€ tests/
β”‚ └── testing-guide.md
└── LICENSE # Apache-2.0
```

### Script Model

- `scripts/*.py` are single-operation scripts only.
- Multi-step workflows are generated at runtime in a temporary script from `assets/templates/custom-workflow-template.py`.
- Do not commit runtime pipeline scripts.

## Documentation

- **[SKILL.md](nutrient-document-processing/SKILL.md)** β€” Agent instructions with setup and operation examples
- **[Testing Guide](nutrient-document-processing/tests/testing-guide.md)** β€” Manual test procedures
- **[Custom Workflow Template](nutrient-document-processing/assets/templates/custom-workflow-template.py)** β€” Runtime pipeline starting point
- **[API Playground](https://dashboard.nutrient.io/processor-api/playground/)** β€” Interactive API testing
- **[Official API Docs](https://www.nutrient.io/guides/dws-processor/)** β€” Nutrient documentation

## About

Built by [Nutrient](https://www.nutrient.io/) (formerly PSPDFKit) β€” document SDKs trusted by thousands of companies worldwide.

## License

[Apache-2.0](nutrient-document-processing/LICENSE)