Visually grounded planning benchmark for multimodal agents - View it on GitHub
Star
3
Rank
3343664