microsoft/SmartPlay - Gitstar Ranking

microsoft

Fetched on 2026/07/23 05:42

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs. - View it on GitHub

Star

146

Rank

230221

microsoft

microsoft / SmartPlay