Open clone of OpenAI's unreleased WebText dataset scraper. This version uses pushshift.io files instead of the API for speed. - View it on GitHub
Star
0
Rank
11399557