Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-rlhf
Lists of datasets, training, and evals for RLHF and similar
https://github.com/wassname/awesome-rlhf
- Askell et al
- OpenAssistant Conversations Dataset - 04-12
- Stanford human preferences - a dataset of instructions inferred from high quality sbureddits. 300k rows. 2023-02-23 [tweet](https://twitter.com/ethayarajh/status/1628442009500524544/photo/1)
- HH-RLHF - Antropic RLHF
- allenai/natural-instructions
- hendrycks/ethics
- SHP
- paper
- `alpaca_data_cleaned.json`
- OIG-small-chip2
- shareGPT
- unnatural-instructions
- Open Instruction Generalist Dataset
- OIG
- github instruction-turning tag
- RLHF
- Pretraining Language Models with Human Preferences
- Hindsight Instruction Relabeling
- huggingface/evaluate
- EleutherAI/lm-evaluation-harness - has lots of datasets like GLUE and ETHICS already included, works with huggingface
- openai/evals: Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks. - has lots of rare eval sets like sarcasm, works with langchain
- stanford-crfm/helm: Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
Programming Languages