Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/danielrosehill/llm-evaluation-prompts

A few prompts that I am storing in a repo for the purpose of running controlled experiments comparing and benchmarking different LLMs for defined use-cases
https://github.com/danielrosehill/llm-evaluation-prompts

Last synced: 6 days ago
JSON representation

A few prompts that I am storing in a repo for the purpose of running controlled experiments comparing and benchmarking different LLMs for defined use-cases

Host: GitHub
URL: https://github.com/danielrosehill/llm-evaluation-prompts
Owner: danielrosehill
Created: 2024-10-13T14:51:06.000Z (about 1 month ago)
Default Branch: main
Last Pushed: 2024-11-11T16:28:13.000Z (7 days ago)
Last Synced: 2024-11-11T16:30:40.097Z (7 days ago)
Language: Python
Size: 169 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# My LLM Evaluation Prompt Repository

## Purpose
The purpose of this repository is to gather together a curated collection of prompts for conducting controlled experiments aimed at **comparing and evaluating the efficacy of different large language models (LLMs)**, including fine-tuned models, for specific use-cases. These prompts are designed to facilitate targeted testing and help determine which models are most effective in performing certain tasks.

## Structure

`Eval-Prompts` - Prompts I've earmarked for evaluation purposes.
`Old-Prompts` - Real prompts that I've previously run for generations (in some cases edited for PII-removal). Many of these are not that useful and repetitive. But they're reflective of the type of casual prompting I commonly use so I use them as "pipeline" candidates for future evaluations.

## Use Cases
The prompts included in this repository reflect some of the **specific use-cases** I have identified in my work with LLMs. These use-cases are unique to my context and needs, so the prompts may not necessarily be applicable or useful outside of this context. However, I am open-sourcing them here in case they may prove helpful to others looking to run similar experiments or tests.

## Experiment Results
Please note that in order to keep the data separate, I **will not be including the actual results of the experiments** conducted with these prompts in this repository. This repository is focused solely on the prompts themselves, serving as a foundation for evaluation and comparison.

## How to Use
Feel free to clone or download this repository, and use the prompts for your own LLM evaluations. Each prompt is stored in markdown format, with additional metadata that may help you contextualize or adapt it for your own experiments.

## Licensing

This repository is licensed under CC-BY-4.0 (Attribution 4.0 International)
[License](https://creativecommons.org/licenses/by/4.0/)

### Summary of the License
The Creative Commons Attribution 4.0 International (CC BY 4.0) license allows others to:
- **Share**: Copy and redistribute the material in any medium or format.
- **Adapt**: Remix, transform, and build upon the material for any purpose, even commercially.

The licensor cannot revoke these freedoms as long as you follow the license terms.

#### License Terms
- **Attribution**: You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- **No additional restrictions**: You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

For the full legal code, please visit the [Creative Commons website](https://creativecommons.org/licenses/by/4.0/legalcode).