google-research-datasets/QAmeleon

google-research-datasets

Fetched on 2026/03/01 20:06

QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning PaLM with only five examples per language. We use the synthetic data to finetune downstream QA models leading to improved accuracy in comparison to English-only and translation-based baselines. - View it on GitHub

Star

Rank

668542

google-research-datasets

google-research-datasets / QAmeleon