https://github.com/Co1lin/CWEval
Simultaneous evaluation on both functionality and security of LLM-generated code.
https://github.com/Co1lin/CWEval
llm4code vulnerability
Last synced: 2 months ago
JSON representation
Simultaneous evaluation on both functionality and security of LLM-generated code.
- Host: GitHub
- URL: https://github.com/Co1lin/CWEval
- Owner: Co1lin
- License: apache-2.0
- Created: 2024-10-31T19:06:00.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-11-22T01:06:18.000Z (4 months ago)
- Last Synced: 2025-11-22T03:19:25.573Z (4 months ago)
- Topics: llm4code, vulnerability
- Language: Python
- Homepage: https://arise-lab.github.io/cweval-bench
- Size: 8.32 MB
- Stars: 28
- Watchers: 2
- Forks: 4
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-rainmana - Co1lin/CWEval - Simultaneous evaluation on both functionality and security of LLM-generated code. (Python)
- Awesome-AI-Security - CWEval - simultaneous functionality+security evaluation with secure/functional oracles; Dockerized runner. [arXiv](https://arxiv.org/abs/2501.08200) ([↑](#table-of-contents)Benchmarks <a name="benchmarking"></a> / **Code Security**)