SO-Bench release for evaluating visual structured output capabilities of multimodal LLMs. - View it on GitHub
Star
6
Rank
2212993