Text-to-image generation is crucial in computer vision and NLP, but understanding complex text prompts remains challenging; to address this, we introduce the Logic-Rich Text-to-Image generation task and the Textual-Visual Logic dataset to evaluate models on intricate text inputs with rich relational information. -
View it on GitHub