Mobile User Interface Element Detection Via Adaptively Prompt Tuning

05/16/2023
by   Zhangxuan Gu, et al.
0

Recent object detection approaches rely on pretrained vision-language models for image-text alignment. However, they fail to detect the Mobile User Interface (MUI) element since it contains additional OCR information, which describes its content and function but is often ignored. In this paper, we develop a new MUI element detection dataset named MUI-zh and propose an Adaptively Prompt Tuning (APT) module to take advantage of discriminating OCR information. APT is a lightweight and effective module to jointly optimize category prompts across different modalities. For every element, APT uniformly encodes its visual features and OCR descriptions to dynamically adjust the representation of frozen category prompts. We evaluate the effectiveness of our plug-and-play APT upon several existing CLIP-based detectors for both standard and open-vocabulary MUI element detection. Extensive experiments show that our method achieves considerable improvements on two datasets. The datasets is available at <github.com/antmachineintelligence/MUI-zh>.

READ FULL TEXT

page 4

page 8

research
09/18/2023

Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection

Point cloud-based open-vocabulary 3D object detection aims to detect 3D ...
research
03/10/2023

Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection

Open-vocabulary object detection aims to provide object detectors traine...
research
03/28/2022

Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model

Recently, vision-language pre-training shows great potential in open-voc...
research
08/12/2020

Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?

Detecting Graphical User Interface (GUI) elements in GUI images is a dom...
research
05/07/2023

Context-Aware Chart Element Detection

As a prerequisite of chart data extraction, the accurate detection of ch...
research
08/19/2018

Dynamic simulations in SixTrack

The DYNK module allows element settings in SixTrack to be changed on a t...

Please sign up or login with your details

Forgot password? Click here to reset