google-research-datasets/paws

google-research-datasets

Fetched on 2026/03/01 20:06

This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification. - View it on GitHub

Star

563

Rank

73119

google-research-datasets

google-research-datasets / paws