Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

awesome-rlhf

Lists of datasets, training, and evals for RLHF and similar
https://github.com/wassname/awesome-rlhf

Askell et al
OpenAssistant Conversations Dataset - 04-12
Stanford human preferences - a dataset of instructions inferred from high quality sbureddits. 300k rows. 2023-02-23 [tweet](https://twitter.com/ethayarajh/status/1628442009500524544/photo/1)
HH-RLHF - Antropic RLHF
allenai/natural-instructions
hendrycks/ethics
SHP
paper
`alpaca_data_cleaned.json`
OIG-small-chip2
shareGPT
unnatural-instructions
Open Instruction Generalist Dataset
OIG
github instruction-turning tag
RLHF
Pretraining Language Models with Human Preferences
Hindsight Instruction Relabeling
huggingface/evaluate
EleutherAI/lm-evaluation-harness - has lots of datasets like GLUE and ETHICS already included, works with huggingface
openai/evals: Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks. - has lots of rare eval sets like sarcasm, works with langchain
stanford-crfm/helm: Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).

Programming Languages

Keywords

ml-safety 1 machine-ethics 1 gpt-3 1 ethical-ai 1 ai-safety 1 transformer 1 language-model 1 evaluation-framework 1 machine-learning 1 evaluation 1