https://github.com/cisco-open/adversaryshield
Mitigate adversial attacks on LLMs via automatic deployment of predefined plugins.
https://github.com/cisco-open/adversaryshield
Last synced: about 2 months ago
JSON representation
Mitigate adversial attacks on LLMs via automatic deployment of predefined plugins.
- Host: GitHub
- URL: https://github.com/cisco-open/adversaryshield
- Owner: cisco-open
- Created: 2024-10-01T06:50:40.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2025-01-08T10:32:40.000Z (4 months ago)
- Last Synced: 2025-01-08T11:32:31.174Z (4 months ago)
- Language: Python
- Size: 1.89 MB
- Stars: 0
- Watchers: 0
- Forks: 2
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: .github/CODEOWNERS
- Security: SECURITY.md
Awesome Lists containing this project
README
# AdversaryShield
This project is being developed as a solution to mitigate adversial attacks on LLMs via automatic deployment of predefined plugins.
## Description
__AdversaryShield__ project is composed of three main services:
- __Tabularium__ provides the administrator with an interface to CRUD plugins and store into the DB;
- __Galea__ creates the containers to run the plugins based on manager configuration and helm charts;Other services used are:
- __\__: Used to store user test suites.## Service Diagram

## Getting started
### Dependencies
- Docker Engine (Docker Desktop)
- minikube
- Python### Deploying
ToDo: Run the __launch.sh__ script.
### Usage
Launch the application and use the Strategos provided interface to store the plugins (and their parameters) via the plugins view.
__WIP__: The run view will provide the user with the posibility of running a prompt through the plugins towards the desired LLM and have the outputs filtered in case of adversial attacks. Outputs with respective inputs will be tracked within the database for later review of mitigations.## Authors
- Charles Fleming ([email protected]), Project Manager
- Vasile Vartic ([email protected]), Frontend Developer
- Andrei Manea ([email protected]), Backend Developer## Version history
- alpha 0.1
- Init## License
Work in progress.