TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages

05/13/2022
by   Zihan Zhao, et al.
4

Recently, the structural reading comprehension (SRC) task on web pages has attracted increasing research interests. Although previous SRC work has leveraged extra information such as HTML tags or XPaths, the informative topology of web pages is not effectively exploited. In this work, we propose a Topological Information Enhanced model (TIE), which transforms the token-level task into a tag-level task by introducing a two-stage process (i.e. node locating and answer refining). Based on that, TIE integrates Graph Attention Network (GAT) and Pre-trained Language Model (PLM) to leverage the topological information of both logical structures and spatial structures. Experimental results demonstrate that our model outperforms strong baselines and achieves state-of-the-art performances on the web-based SRC benchmark WebSRC at the time of writing. The code of TIE will be publicly available at https://github.com/X-LANCE/TIE.

READ FULL TEXT

page 4

page 8

page 13

page 14

research
01/23/2021

WebSRC: A Dataset for Web-Based Structural Reading Comprehension

Web search is an essential way for human to obtain information, but it's...
research
11/09/2020

Synonym Knowledge Enhanced Reader for Chinese Idiom Reading Comprehension

Machine reading comprehension (MRC) is the task that asks a machine to a...
research
05/10/2021

ExpMRC: Explainability Evaluation for Machine Reading Comprehension

Achieving human-level performance on some of Machine Reading Comprehensi...
research
03/30/2021

XRJL-HKUST at SemEval-2021 Task 4: WordNet-Enhanced Dual Multi-head Co-Attention for Reading Comprehension of Abstract Meaning

This paper presents our submitted system to SemEval 2021 Task 4: Reading...
research
03/26/2022

Lite Unified Modeling for Discriminative Reading Comprehension

As a broad and major category in machine reading comprehension (MRC), th...
research
07/15/2021

Automatic Task Requirements Writing Evaluation via Machine Reading Comprehension

Task requirements (TRs) writing is an important question type in Key Eng...
research
12/22/2022

Generative Colorization of Structured Mobile Web Pages

Color is a critical design factor for web pages, affecting important fac...

Please sign up or login with your details

Forgot password? Click here to reset