Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
google-research-datasets
Fetched on 2024/04/30 22:40
google-research-datasets
/
swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 languages, generated using PaLM 2 and summarize-then-ask prompting. -
View it on GitHub
https://arxiv.org/abs/2311.05800
Star
41
Rank
486986