A collection of downstream datasets and models geared towards testing performance of LLM downstream tasks. - View it on GitHub
Star
1
Rank
5416266