An open API service indexing awesome lists of open source software.

https://github.com/paradite/eval-data

Prompts and evaluation data for LLMs on real world coding and writing tasks
https://github.com/paradite/eval-data

ai benchmark eval evaluation llm prompt prompt-engineering

Last synced: 4 days ago
JSON representation

Prompts and evaluation data for LLMs on real world coding and writing tasks

Awesome Lists containing this project

README

          

# Eval data

Evaluation data for LLMs and prompts on real world coding tasks and writing tasks.

Built by [16x Prompt](https://prompt.16x.engineer/) and [16x Eval](https://eval.16x.engineer/).

## Coding Projects

### Next.js

[emoji-todo](/projects/emoji-todo/) - A simple Next.js TODO app with emojis.

### SQL

[sql](/projects/sql/) - SQL code snippets.

### Python

[python-script](/projects/python-script/) - Python script code snippets.

### Benchmark Visualization

[visualization](/projects/visualization/) - Coding a visualization of benchmark results.

### TypeScript Narrowing

[typescript-narrowing](/projects/typescript-narrowing/) - Coding TypeScript narrowing tests.

### Clean Markdown

[clean-markdown](/projects/clean-markdown/) - Clean markdown into plain text.

### Clean MDX

[clean-mdx](/projects/clean-mdx/) - Clean MDX into plain text.

### Tailwind CSS Z-Index

[tailwindcss-v3-z-index](/projects/tailwindcss-v3-z-index/) - Fix invalid z-index classes in Tailwind CSS v3.

## Writing Projects

### AI Timeline

[ai-timeline](/projects/ai-timeline/) - Writing an AI Timeline.

## Image Analysis Projects

### Kanji

[kanji](/projects/kanji/) - Image analysis and explanation related to kanji and Chinese characters.

### Water bottle

[water-bottle](/projects/water-bottle/) - Image analysis and explanation related to water bottle and physics.

## Model Evaluation Results

[model-eval-results](/model-eval-results/) - Raw results exported from [16x Eval](https://eval.16x.engineer/) for models evaluations.

## 16x Eval

I am building a local desktop app to evaluate models and prompts.

See [16x Eval website](https://eval.16x.engineer/) for more information.