https://github.com/thu-ml/mla-trust
A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions through 34 interactive tasks
https://github.com/thu-ml/mla-trust
agent benchmark controllability multi-modal privacy safety toolbox trustworthy-ai truthfulness
Last synced: 2 months ago
JSON representation
A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions through 34 interactive tasks
- Host: GitHub
- URL: https://github.com/thu-ml/mla-trust
- Owner: thu-ml
- License: mit
- Created: 2025-06-19T06:45:35.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-06-19T07:21:34.000Z (4 months ago)
- Last Synced: 2025-06-19T07:50:19.051Z (4 months ago)
- Topics: agent, benchmark, controllability, multi-modal, privacy, safety, toolbox, trustworthy-ai, truthfulness
- Language: Python
- Homepage: https://mla-trust.github.io
- Size: 1.69 MB
- Stars: 17
- Watchers: 0
- Forks: 1
- Open Issues: 0