https://github.com/avi350751/test-llm-with-deepeval
A hands-on exploration of Deepeval — an open-source framework for evaluating and red-teaming large language models (LLMs). This repository documents my journey of testing, benchmarking, and improving LLM reliability using custom prompts, metrics, and pipelines.
https://github.com/avi350751/test-llm-with-deepeval
deepeval evals llmtesting
Last synced: about 4 hours ago
JSON representation
A hands-on exploration of Deepeval — an open-source framework for evaluating and red-teaming large language models (LLMs). This repository documents my journey of testing, benchmarking, and improving LLM reliability using custom prompts, metrics, and pipelines.
- Host: GitHub
- URL: https://github.com/avi350751/test-llm-with-deepeval
- Owner: avi350751
- Created: 2025-10-26T20:02:03.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2025-11-02T17:10:37.000Z (7 months ago)
- Last Synced: 2026-06-08T02:34:03.247Z (about 4 hours ago)
- Topics: deepeval, evals, llmtesting
- Language: Jupyter Notebook
- Homepage:
- Size: 84 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files: