Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/paxnea/genai-cohere-testing
Notebooks for testing Cohere's Command-series LLMs.
https://github.com/paxnea/genai-cohere-testing
cohere cohere-ai command command-r generative-ai prompt-engineering
Last synced: 1 day ago
JSON representation
Notebooks for testing Cohere's Command-series LLMs.
- Host: GitHub
- URL: https://github.com/paxnea/genai-cohere-testing
- Owner: paxnea
- License: mit
- Created: 2024-05-10T18:35:33.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-05-10T19:29:06.000Z (8 months ago)
- Last Synced: 2025-01-02T00:09:49.627Z (1 day ago)
- Topics: cohere, cohere-ai, command, command-r, generative-ai, prompt-engineering
- Language: Jupyter Notebook
- Homepage:
- Size: 5.3 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# genAI-cohere-testing
Notebooks for testing Cohere's Command-series LLMs.## Length Testing
Testing Command and Command-R compliance with prompt-enforced length control of outputs.### Methods
A list of five synonym phrases for maximum limitation restrictions (i.e. phrases that
restricted Command’s output to x characters or less) was developed based on general use
popularity and is as follows:```['at most', 'no more than', 'less than', 'maximum', 'under']```
Then the following template was used for all prompts:
```What is gravity? Your explanation should be ____ 150 words.```
where ____ is replaced by each individual limitation phrasing.
### Results
API calls are sent to Command (1000 completions) and Command-R (100 completions)
models using default preamble and temperature parameters. The resulting histograms of word
counts are shown below, with highlighted bins corresponding to the acceptable completions.
| Command: 1000 completions | Command-R: 100 completions |
|---|---|
|![max_limit](https://github.com/paxnea/genAI-cohere-testing/assets/133911288/0dd0f38a-fc79-44af-b991-cf612de89e91)|![max_limit](https://github.com/paxnea/genAI-cohere-testing/assets/133911288/3bbc13b1-293f-4433-984c-6af9f5de43c2)|Further mini-tests of 100 completions each were also done on lists of five synonym phrases for
minimum, approximate, and exact limitation retrictions on Command and Command-R models.
Results can be found in their corresponding folders.