https://github.com/promptfoo/promptfoo

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
https://github.com/promptfoo/promptfoo

ci ci-cd cicd evaluation evaluation-framework llm llm-eval llm-evaluation llm-evaluation-framework llmops pentesting prompt-engineering prompt-testing prompts rag red-teaming testing vulnerability-scanners

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/promptfoo/promptfoo
Owner: promptfoo
License: mit
Created: 2023-04-28T15:48:49.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2025-03-08T00:53:33.000Z (3 months ago)
Last Synced: 2025-03-08T01:24:43.599Z (3 months ago)
Topics: ci, ci-cd, cicd, evaluation, evaluation-framework, llm, llm-eval, llm-evaluation, llm-evaluation-framework, llmops, pentesting, prompt-engineering, prompt-testing, prompts, rag, red-teaming, testing, vulnerability-scanners
Language: TypeScript
Homepage: https://promptfoo.dev
Size: 308 MB
Stars: 5,756
Watchers: 21
Forks: 475
Open Issues: 188
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
- Citation: CITATION.cff

Awesome Lists containing this project

awesome-ChatGPT-repositories - promptfoo - Test your prompts, models, RAGs. Evaluate and compare LLM outputs, catch regressions, and improve prompt quality. LLM evals for OpenAI/Azure GPT, Anthropic Claude, VertexAI Gemini, Ollama, Local & private models like Mistral/Mixtral/Llama with CI/CD (Prompts)
awesome-langchain - Promptfoo
StarryDivineSky - promptfoo/promptfoo
awesome - promptfoo/promptfoo - Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. (TypeScript)
awesome-ai-security - promptfoo - _Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration._ (Offensive tools and frameworks / LLM)
awesome - promptfoo/promptfoo - Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. (TypeScript)
jimsghstars - promptfoo/promptfoo - Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command (TypeScript)
awesome-safety-critical-ai - `promptfoo/promptfoo` - friendly local tool for testing LLM applications (<a id="tools"></a>🛠️ Tools / Bleeding Edge ⚗️)
awesome-hacking-lists - promptfoo/promptfoo - Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command (TypeScript)
awesome-langchain-zh - Promptfoo

README

        # Promptfoo: LLM evals & red teaming

[![npm](https://img.shields.io/npm/v/promptfoo)](https://npmjs.com/package/promptfoo)

[![npm](https://img.shields.io/npm/dm/promptfoo)](https://npmjs.com/package/promptfoo)

[![GitHub Workflow Status](https://img.shields.io/github/actions/workflow/status/typpo/promptfoo/main.yml)](https://github.com/promptfoo/promptfoo/actions/workflows/main.yml)

![MIT license](https://img.shields.io/github/license/promptfoo/promptfoo)

[![Discord](https://github.com/user-attachments/assets/2092591a-ccc5-42a7-aeb6-24a2808950fd)](https://discord.gg/promptfoo)

`promptfoo` is a developer-friendly local tool for testing LLM applications. Stop the trial-and-error approach - start shipping secure, reliable AI apps.

## Quick Start

```sh

# Install and initialize project

npx promptfoo@latest init

# Run your first evaluation

npx promptfoo eval

```

See [Getting Started](https://www.promptfoo.dev/docs/getting-started/) (evals) or [Red Teaming](https://www.promptfoo.dev/docs/red-team/) (vulnerability scanning) for more.

## What can you do with Promptfoo?

- **Test your prompts and models** with [automated evaluations](https://www.promptfoo.dev/docs/getting-started/)

- **Secure your LLM apps** with [red teaming](https://www.promptfoo.dev/docs/red-team/) and vulnerability scanning

- **Compare models** side-by-side (OpenAI, Anthropic, Azure, Bedrock, Ollama, and [more](https://www.promptfoo.dev/docs/providers/))

- **Automate checks** in [CI/CD](https://www.promptfoo.dev/docs/integrations/ci-cd/)

- **Share results** with your team

Here's what it looks like in action:

![prompt evaluation matrix - web viewer](https://www.promptfoo.dev/img/[email protected])

It works on the command line too:

![prompt evaluation matrix - command line](https://github.com/promptfoo/promptfoo/assets/310310/480e1114-d049-40b9-bd5f-f81c15060284)

It also can generate [security vulnerability reports](https://www.promptfoo.dev/docs/red-team/):

![gen ai red team](https://www.promptfoo.dev/img/[email protected])

## Why promptfoo?

- 🚀 **Developer-first**: Fast, with features like live reload and caching

- 🔒 **Private**: Runs 100% locally - your prompts never leave your machine

- 🔧 **Flexible**: Works with any LLM API or programming language

- 💪 **Battle-tested**: Powers LLM apps serving 10M+ users in production

- 📊 **Data-driven**: Make decisions based on metrics, not gut feel

- 🤝 **Open source**: MIT licensed, with an active community

## Learn More

- 📚 [Full Documentation](https://www.promptfoo.dev/docs/intro/)

- 🔐 [Red Teaming Guide](https://www.promptfoo.dev/docs/red-team/)

- 🎯 [Getting Started](https://www.promptfoo.dev/docs/getting-started/)

- 💻 [CLI Usage](https://www.promptfoo.dev/docs/usage/command-line/)

- 📦 [Node.js Package](https://www.promptfoo.dev/docs/usage/node-package/)

- 🤖 [Supported Models](https://www.promptfoo.dev/docs/providers/)

## Contributing

We welcome contributions! Check out our [contributing guide](https://www.promptfoo.dev/docs/contributing/) to get started.

Join our [Discord community](https://discord.gg/promptfoo) for help and discussion.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/promptfoo/promptfoo

Awesome Lists containing this project

README