google-research-datasets/screen_annotation

google-research-datasets

Fetched on 2025/01/09 12:01

The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and describe the UI elements present on the screen: their type, location, OCR text and a short description. It has been introduced in the paper `ScreenAI: A Vision-Language Model for UI and Infographics Understanding`. - View it on GitHub

Star

Rank

427689

google-research-datasets

google-research-datasets / screen_annotation