Screen Parsing: Towards Reverse Engineering of UI Models from Screenshots

09/17/2021
by   Jason Wu, et al.
0

Automated understanding of user interfaces (UIs) from their pixels can improve accessibility, enable task automation, and facilitate interface design without relying on developers to comprehensively provide metadata. A first step is to infer what UI elements exist on a screen, but current approaches are limited in how they infer how those elements are semantically grouped into structured interface definitions. In this paper, we motivate the problem of screen parsing, the task of predicting UI elements and their relationships from a screenshot. We describe our implementation of screen parsing and provide an effective training procedure that optimizes its performance. In an evaluation comparing the accuracy of the generated output, we find that our implementation significantly outperforms current systems (up to 23 example applications that are facilitated by screen parsing: (i) UI similarity search, (ii) accessibility enhancement, and (iii) code generation from UI screenshots.

READ FULL TEXT

page 1

page 6

page 9

page 10

page 13

research
01/20/2023

Screen Correspondence: Mapping Interchangeable Elements between UIs

Understanding user interface (UI) functionality is a useful yet challeng...
research
07/08/2019

parboiled2: a macro-based approach for effective generators of parsing expressions grammars in Scala

In today's computerized world, parsing is ubiquitous. Developers parse l...
research
07/25/2023

Holistic Exploration on Universal Decompositional Semantic Parsing: Architecture, Data Augmentation, and LLM Paradigm

In this paper, we conduct a holistic exploration of the Universal Decomp...
research
11/08/2022

Strictly Breadth-First AMR Parsing

AMR parsing is the task that maps a sentence to an AMR semantic graph au...
research
06/15/2021

Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images

State-of-the-art (SOTA) Generative Models (GMs) can synthesize photo-rea...
research
06/17/2009

Personal applications, based on moveable / resizable elements

All the modern day applications have the interface, absolutely defined b...
research
05/01/2019

Context-Dependent Semantic Parsing over Temporally Structured Data

We describe a new semantic parsing setting that allows users to query th...

Please sign up or login with your details

Forgot password? Click here to reset