Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections

10/07/2022
by   Roberto Arroyo, et al.
0

Deep Learning (DL) is dominating the fields of Natural Language Processing (NLP) and Computer Vision (CV) in the recent times. However, DL commonly relies on the availability of large data annotations, so other alternative or complementary pattern-based techniques can help to improve results. In this paper, we build upon Key Information Extraction (KIE) in purchase documents using both DL and rule-based corrections. Our system initially trusts on Optical Character Recognition (OCR) and text understanding based on entity tagging to identify purchase facts of interest (e.g., product codes, descriptions, quantities, or prices). These facts are then linked to a same product group, which is recognized by means of line detection and some grouping heuristics. Once these DL approaches are processed, we contribute several mechanisms consisting of rule-based corrections for improving the baseline DL predictions. We prove the enhancements provided by these rule-based corrections over the baseline DL results in the presented experiments for purchase documents from public and NielsenIQ datasets.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
09/21/2023

Improving VTE Identification through Adaptive NLP Model Selection and Clinical Expert Rule-based Classifier from Radiology Reports

Rapid and accurate identification of Venous thromboembolism (VTE), a sev...
research
05/06/2023

Beyond Rule-based Named Entity Recognition and Relation Extraction for Process Model Generation from Natural Language Text

Automated generation of business process models from natural language te...
research
04/13/2022

Experimental Standards for Deep Learning Research: A Natural Language Processing Perspective

The field of Deep Learning (DL) has undergone explosive growth during th...
research
07/17/2018

Developing a Portable Natural Language Processing Based Phenotyping System

This paper presents a portable phenotyping system that is capable of int...
research
01/17/2023

An Empirical Study of Deep Learning Sentiment Detection Tools for Software Engineering in Cross-Platform Settings

Sentiment detection in software engineering (SE) has shown promise to su...
research
01/29/2022

Information Extraction through AI techniques: The KIDs use case at CONSOB

In this paper we report on the initial activities carried out within a c...
research
10/02/2017

DeepER -- Deep Entity Resolution

Entity Resolution (ER) is a fundamental problem with many applications. ...

Please sign up or login with your details

Forgot password? Click here to reset