Visually grounded planning benchmark for multimodal agents - View it on GitHub
Star
5
Rank
2477888