Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/google-research-datasets/QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthetic data to finetune downstream QA models leading to improved accuracy in comparison to English-only and translation-based baselines.
https://github.com/google-research-datasets/QAmeleon
Last synced: 16 days ago
JSON representation
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthetic data to finetune downstream QA models leading to improved accuracy in comparison to English-only and translation-based baselines.
- Host: GitHub
- URL: https://github.com/google-research-datasets/QAmeleon
- Owner: google-research-datasets
- Created: 2023-07-05T16:04:02.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2023-08-15T17:19:06.000Z (10 months ago)
- Last Synced: 2024-02-23T15:36:00.704Z (4 months ago)
- Size: 2.93 KB
- Stars: 32
- Watchers: 3
- Forks: 5
- Open Issues: 2
Lists
- awesome-stars - google-research-datasets/QAmeleon - QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthetic (Others)