https://github.com/thu-ml/mla-trust

A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions through 34 interactive tasks
https://github.com/thu-ml/mla-trust

agent benchmark controllability multi-modal privacy safety toolbox trustworthy-ai truthfulness

Last synced: 4 months ago
JSON representation

A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions through 34 interactive tasks

Host: GitHub
URL: https://github.com/thu-ml/mla-trust
Owner: thu-ml
License: mit
Created: 2025-06-19T06:45:35.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2025-12-31T02:09:43.000Z (6 months ago)
Last Synced: 2026-01-04T03:13:32.877Z (6 months ago)
Topics: agent, benchmark, controllability, multi-modal, privacy, safety, toolbox, trustworthy-ai, truthfulness
Language: Python
Homepage: https://mla-trust.github.io
Size: 1.96 MB
Stars: 61
Watchers: 0
Forks: 4
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/thu-ml/mla-trust

Awesome Lists containing this project