Text-to-image generation is crucial in computer vision and NLP, but understanding complex text prompts remains challenging; to address this, we introduce the Logic-Rich Text-to-Image generation task and the Textual-Visual Logic dataset to evaluate models on intricate text inputs with rich relational information. - View it on GitHub
Star
0
Rank
12091767