Object-oriented Neural Programming (OONP) for Document Understanding

09/26/2017
by   Zhengdong Lu, et al.
0

We propose Object-oriented Neural Programming (OONP), a framework for semantically parsing documents in specific domains. Basically, OONP reads a document and parses it into a predesigned object-oriented data structure (referred to as ontology in this paper) that reflects the domain-specific semantics of the document. An OONP parser models semantic parsing as a decision process: a neural net-based Reader sequentially goes through the document, and during the process it builds and updates an intermediate ontology to summarize its partial understanding of the text it covers. OONP supports a rich family of operations (both symbolic and differentiable) for composing the ontology, and a big variety of forms (both symbolic and differentiable) for representing the state and the document. An OONP parser can be trained with supervision of different forms and strength, including supervised learning (SL) , reinforcement learning (RL) and hybrid of the two. Our experiments on both synthetic and real-world document parsing tasks have shown that OONP can learn to handle fairly complicated ontology with training data of modest sizes.

READ FULL TEXT
research
05/01/2020

Spatial Dependency Parsing for 2D Document Understanding

Information Extraction (IE) for document images is often approached as a...
research
12/25/2018

Building a Neural Semantic Parser from a Domain Ontology

Semantic parsing is the task of converting natural language utterances i...
research
07/24/2018

Understanding and representing the semantics of large structured documents

Understanding large, structured documents like scholarly articles, reque...
research
12/04/2016

Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision (Short Version)

Extending the success of deep neural networks to natural language unders...
research
05/25/2017

Learning Structured Text Representations

In this paper, we focus on learning structure-aware document representat...
research
04/15/2021

Low-Resource Task-Oriented Semantic Parsing via Intrinsic Modeling

Task-oriented semantic parsing models typically have high resource requi...
research
08/31/2022

GRILLBot: An Assistant for Real-World Tasks with Neural Semantic Parsing and Graph-Based Representations

GRILLBot is the winning system in the 2022 Alexa Prize TaskBot Challenge...

Please sign up or login with your details

Forgot password? Click here to reset