https://github.com/Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for AI & LLM systems
https://github.com/Giskard-AI/giskard
agent-evaluation ai-red-team ai-security ai-testing fairness-ai llm llm-eval llm-evaluation llm-security llmops ml-testing ml-validation mlops rag-evaluation red-team-tools responsible-ai trustworthy-ai
Last synced: about 1 year ago
JSON representation
🐢 Open-Source Evaluation & Testing for AI & LLM systems
- Host: GitHub
- URL: https://github.com/Giskard-AI/giskard
- Owner: Giskard-AI
- License: apache-2.0
- Created: 2022-03-06T21:45:37.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2025-04-09T15:13:59.000Z (about 1 year ago)
- Last Synced: 2025-04-09T15:32:59.935Z (about 1 year ago)
- Topics: agent-evaluation, ai-red-team, ai-security, ai-testing, fairness-ai, llm, llm-eval, llm-evaluation, llm-security, llmops, ml-testing, ml-validation, mlops, rag-evaluation, red-team-tools, responsible-ai, trustworthy-ai
- Language: Python
- Homepage: https://docs.giskard.ai
- Size: 175 MB
- Stars: 4,457
- Watchers: 34
- Forks: 316
- Open Issues: 33
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: .github/CODEOWNERS
Awesome Lists containing this project
- awesome-mlops - Giskard
- awesome-llm - Giskard
- awesome-llmops - Giskard - AI/giskard.svg?style=flat-square) | (Security / Observability)
- awesome-MLSecOps - Giskard - source testing tool for LLM applications| (Open Source Security Tools)
- AwesomeResponsibleAI - Giskard
- awesome-production-machine-learning - Giskard - AI/giskard.svg?style=social) - Giskard is an open-source Python library that automatically detects performance, bias & security issues in AI applications. (Evaluation and Monitoring)
- StarryDivineSky - Giskard-AI/giskard
- Awesome-LLM - Giskard - Testing & evaluation library for LLM applications, in particular RAGs (LLM Evaluation:)
- awesome-ai-testing - Giskard - Testing framework for LLMs and ML models. (LLM and AI System Testing)
- Awesome-AI-Security - Giskard - AI/giskard?logo=github&label=&style=social)](https://github.com/Giskard-AI/giskard) ([↑](#table-of-contents)Tools <a name="tools"></a> / Red-Teaming Harnesses & Automated Security Testing)
- awesome-safety-critical-ai - `Giskard-AI/giskard`
- awesome-ai-security - Giskard - _Open-Source Evaluation & Testing for AI & LLM systems_ (Attack Techniques & Red Teaming / LLM & GenAI Red Teaming)
- awesome-LLM-security - Giskard - Giskard is an open-source Python library that automatically detects performance, bias & security issues in AI applications. The library covers LLM-based applications such as RAG agents, all the way to traditional ML models for tabular data. (⚔️ LLM And GenAI Security Testing Tools)
- awesome-hacking-lists - Giskard-AI/giskard - 🐢 Open-Source Evaluation & Testing for AI & LLM systems (Python)
- awesome-llmops - Giskard
- awesome-ai-eval - **Giskard** - AI/giskard?style=social&label=github.com) - Testing framework for ML models with vulnerability scanning and LLM-specific detectors. (Platforms / Open Source Platforms)
- awesome-llm-tools - Giskard - source testing framework for LLMs and ML models; automated red-teaming | Evaluation | (3. Prompt Optimization / Rust)
- awesome-local-llm - giskard - an open-source evaluation & testing for AI & LLM systems (Tools / Testing, Evaluation and Observability)
- awesome-ai-offensive-security - Giskard - An automated red-teaming platform for LLM agents (chatbots, RAG pipelines). Performs dynamic multi-turn stress tests to uncover context-dependent vulnerabilities. (AI Red Teaming (Testing AI Targets))
- awesome-open-data-centric-ai - Giskard - AI/giskard?style=social) | <a href="https://github.com/Giskard-AI/giskard/blob/main/LICENSE"><img src="https://img.shields.io/github/license/Giskard-AI/giskard" height="15"/></a> | (Security and robustness)
- jimsghstars - Giskard-AI/giskard - 🐢 Open-Source Evaluation & Testing for AI & LLM systems (Python)
- awesome-production-llm - giskard - AI/giskard.svg?style=social) Open-Source Evaluation & Testing for LLMs and ML models (LLM Testing / Monitoring)