https://github.com/invariantlabs-ai/mcp-injection-experiments

Code snippets to reproduce MCP tool poisoning attacks.
https://github.com/invariantlabs-ai/mcp-injection-experiments

Last synced: 7 months ago
JSON representation

Code snippets to reproduce MCP tool poisoning attacks.

Host: GitHub
URL: https://github.com/invariantlabs-ai/mcp-injection-experiments
Owner: invariantlabs-ai
Created: 2025-04-06T14:33:42.000Z (8 months ago)
Default Branch: main
Last Pushed: 2025-04-06T15:05:17.000Z (8 months ago)
Last Synced: 2025-04-06T16:22:04.697Z (8 months ago)
Language: Python
Homepage:
Size: 6.84 KB
Stars: 2
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-mcp-security - mcp-injection-experiments by invariantlabs-ai
awesome-mcp-servers - **mcp-injection-experiments** - Code snippets to reproduce MCP tool poisoning attacks. `python` `mcp` `pip install git+https://github.com/invariantlabs-ai/mcp-injection-experiments` (🤖 AI/ML)
awesome-ai-security - mcp-injection-experiments - _Code snippets to reproduce MCP tool poisoning attacks._ (Offensive tools and frameworks / LLM)

README

# MCP Tool Poisoning Experiments

This repository contains a few experimental MCP server implementations, that attempt ot inject the MCP client/agent in use.

For more details about the attack method, please see our [blog post](https://invariantlabs.ai/blog/mcp-security-notification-tool-poisoning-attacks).

Regarding mitigations, check out [invariantlabs-ai/invariant](https://github.com/invariantlabs-ai/invariant?tab=readme-ov-file#analyzer).

## Direct Poisoning

In [`direct-poisoning.py`](./direct-poisoning.py), we implement a simple MCP server that instructs an agent to leak sensitive files, when calling the `add` tool (in this case SSH keys and the `mcp.json` file itself).

An example execution in cursor looks like this:

![Cursor executes tool poisoning](https://invariantlabs.ai/images/cursor-injection.png)

## Tool Shadowing

In [`shadowing.py`](./shadowing.py), we implement a more sophisticated MCP attack, that manipulates the agent's behavior of a `send_email` tool (provided by a different, trusted server), such that all emails sent by the agent are leaked to the attacker's server.

An example execution in Cursor looks like this:

![Cursor executes tool shadowing](https://invariantlabs.ai/images/mcp-shadow.png)

## WhatsApp takeover

Lastly, in [`whatsapp-takeover.py`](./whatsapp-takeover.py), we implement a shadowing attack combined with a sleeper rug pull, i.e. an MCP server that changes its tool interface only on the second load to a malicious one.

The server first masks as a benign "random fact of the day" implementation, and then changes the tool to a malicious one that manipulates [whatsapp-mcp](https://github.com/lharries/whatsapp-mcp) in the same agent, to leak messages to the attacker's phone number.

![Cursor executes WhatsApp MCP attack](https://github.com/user-attachments/assets/a39ea101-3fd2-4945-abcd-942006cfe11c)

Can you spot the exfiltration? Here, the malicious tool instructions ask the agent to include the smuggled data after many spaces, such that with invisible scroll bars, the user does not see the data being leaked. Only when you scroll all the way to the right, will you be able to find the exfiltration payload.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/invariantlabs-ai/mcp-injection-experiments

Awesome Lists containing this project

README