https://github.com/ProjectRecon/awesome-ai-agents-security
A living map of the AI agent security ecosystem.
https://github.com/ProjectRecon/awesome-ai-agents-security
List: awesome-ai-agents-security
ai-agents ai-security autonomous-agents awesome awesome-list cybersecurity generative-ai guardrails jailbreak large-language-models llm llm-ops llm-security prompt-injection red-teaming runtime-protection sandboxing secure-ai static-analysis vulnerability-scanners
Last synced: 3 days ago
JSON representation
A living map of the AI agent security ecosystem.
- Host: GitHub
- URL: https://github.com/ProjectRecon/awesome-ai-agents-security
- Owner: ProjectRecon
- License: other
- Created: 2025-12-08T07:58:57.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2025-12-10T15:48:10.000Z (6 months ago)
- Last Synced: 2026-05-16T01:45:23.108Z (11 days ago)
- Topics: ai-agents, ai-security, autonomous-agents, awesome, awesome-list, cybersecurity, generative-ai, guardrails, jailbreak, large-language-models, llm, llm-ops, llm-security, prompt-injection, red-teaming, runtime-protection, sandboxing, secure-ai, static-analysis, vulnerability-scanners
- Homepage:
- Size: 33.2 KB
- Stars: 28
- Watchers: 1
- Forks: 33
- Open Issues: 26
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
- awesome-claude-code-security - awesome-ai-agents-security - Living map of the AI agent security ecosystem covering orchestration risks. (๐ค Agent Orchestration and Loop Safety / Claude Code Specific)
- awesome-rainmana - ProjectRecon/awesome-ai-agents-security - A living map of the AI agent security ecosystem. (Others)
README
# Awesome AI Agents Security
[](https://awesome.re)
[](http://creativecommons.org/publicdomain/zero/1.0/)
[](https://github.com/ProjectRecon/awesome-ai-agents-security/blob/main/CONTRIBUTING.md)
[](https://github.com/ProjectRecon/awesome-ai-agents-security/graphs/commit-activity)
A curated list of open-source tools, frameworks, and resources for securing autonomous AI agents.
This list is organized by the **security lifecycle** of an autonomous agent, covering red teaming, runtime protection, sandboxing, and governance.
## ๐ Table of Contents
- [Agent Firewalls & Gateways (Runtime Protection)](#-agent-firewalls--gateways-runtime-protection)
- [Red Teaming & Vulnerability Scanners](#-red-teaming--vulnerability-scanners)
- [Static Analysis & Linters](#-static-analysis--linters)
- [Sandboxing & Isolation Environments](#-sandboxing--isolation-environments)
- [Guardrails & Compliance](#-guardrails--compliance)
- [Benchmarks & Datasets](#-benchmarks--datasets)
- [Identity & Authentication](#-identity--authentication)
- [Contributing](#-contributing)
---
## ๐ก๏ธ Agent Firewalls & Gateways (Runtime Protection)
*Tools that sit between the agent and the world to filter traffic, prevent unauthorized tool access, and block prompt injections.*
- **[AgentGateway](https://github.com/agentgateway/agentgateway)** - A Linux Foundation project providing an AI-native proxy for secure connectivity (A2A & MCP protocols). It adds RBAC, observability, and policy enforcement to agent-tool interactions.
- **[Envoy AI Gateway](https://gateway.envoyproxy.io/)** - An Envoy-based gateway that manages request traffic to GenAI services, providing a control point for rate limiting and policy enforcement.
## โ๏ธ Red Teaming & Vulnerability Scanners
*Offensive tools to test agents for security flaws, loop conditions, and unauthorized actions.*
- **[Strix](https://github.com/usestrix/strix)** - An autonomous AI agent designed for penetration testing. It runs inside a docker sandbox to actively probe applications and generate verified exploit capabilities.
- **[PyRIT](https://github.com/Azure/PyRIT)** - Microsoftโs open-source red teaming framework for generative AI. It automates multi-turn adversarial attacks to test if an agent can be coerced into harmful behavior.
- **[Agentic Security](https://github.com/msoedov/agentic_security)** - A dedicated vulnerability scanner for agent workflows and LLMs capable of running multi-step jailbreaks and fuzzing attacks against agent logic.
- **[Garak](https://github.com/leondz/garak)** - The "Nmap for LLMs." A vulnerability scanner that probes models for hallucination, data leakage, and prompt injection susceptibilities.
- **[A2A Scanner](https://github.com/cisco-ai-defense/a2a-scanner)** - A scanner by Cisco designed to inspect "Agent-to-Agent" communication protocols for threats, validating agent identities and ensuring compliance with communication specs.
- **[Cybersecurity AI (CAI)](https://github.com/aliasrobotics/cai)** - A framework for building specialized security agents for offensive and defensive operations, often used in CTF (Capture The Flag) scenarios.
## ๐ Static Analysis & Linters
*Tools to analyze agent configuration and logic code before deployment.*
- **[Agentic Radar](https://github.com/splx-ai/agentic-radar)** - A static analysis tool that visualizes agent workflows (LangGraph, CrewAI, AutoGen). It detects risky tool usage, permission loops, and maps them to known vulnerabilities.
- **[Agent Bound](https://github.com/ElPaisano/agent-bound)** - A design-time analysis tool that calculates "Agentic Entropy"โa metric to quantify the unpredictability and risk of infinite loops or unconstrained actions in agent architectures.
- **[Checkov](https://github.com/bridgecrewio/checkov)** - While primarily for IaC, Checkov includes policies for scanning AI infrastructure and configurations to prevent misconfigurations in deployment.
## ๐ฆ Sandboxing & Isolation Environments
*Secure runtimes to prevent agents from damaging the host system during code execution.*
- **[SandboxAI](https://github.com/substratusai/sandboxai)** - An open-source runtime for executing AI-generated code (Python/Shell) in isolated containers with granular permission controls.
- **[Kubernetes Agent Sandbox](https://github.com/kubernetes-sigs/agent-sandbox)** - A Kubernetes Native project providing a Sandbox Custom Resource Definition (CRD) to manage isolated, stateful workloads for AI agents.
- **[Agent-Infra Sandbox](https://github.com/agent-infra/sandbox)** - An "All-In-One" sandbox combining Browser, Shell, VSCode, and File System access in a single Docker container, optimized for agentic tasks.
- **[OpenHands](https://github.com/All-Hands-AI/OpenHands)** - Formerly OpenDevin, this platform includes a secure runtime environment for autonomous coding agents to operate without accessing the host machine's sensitive files.
## ๐ง Guardrails & Compliance
*Middleware to enforce business logic and safety policies on inputs and outputs.*
- **[NeMo Guardrails](https://github.com/NVIDIA/NeMo-Guardrails)** - NVIDIAโs toolkit for adding programmable rails to LLM-based apps. It ensures agents stay on topic, avoid jailbreaks, and adhere to defined safety policies.
- **[Guardrails](https://github.com/guardrails-ai/guardrails)** - A Python framework for validating LLM outputs against structural and semantic rules (e.g., "must return valid JSON," "must not contain PII").
- **[LiteLLM Guardrails](https://github.com/BerriAI/litellm)** - While known for model proxying, LiteLLM includes built-in guardrail features to filter requests and responses across multiple LLM providers.
## ๐ Benchmarks & Datasets
*Resources to evaluate agent security performance.*
- **[CVE Bench](https://github.com/uiuc-kang-lab/cve-bench)** - A benchmark for evaluating an AI agent's ability to exploit real-world web application vulnerabilities (useful for testing defensive agents).
## ๐ Identity & Authentication
*Tools to manage agent identity (non-human identities).*
- **[WSO2](https://github.com/wso2)** - An identity management solution that treats AI agents as first-class identities, enabling secure authentication and authorization for agent actions.
---
## ๐ค Contributing
Contributions are welcome! Please read the contribution guidelines first.
1. Fork the project.
2. Create your feature branch (`git checkout -b feature/AmazingFeature`).
3. Commit your changes (`git commit -m 'Add some AmazingFeature'`).
4. Push to the branch (`git push origin feature/AmazingFeature`).
5. Open a Pull Request.