GUIDO: A Hybrid Approach to Guideline Discovery Ordering from Natural Language Texts

07/19/2023
by   Nils Freyer, et al.
0

Extracting workflow nets from textual descriptions can be used to simplify guidelines or formalize textual descriptions of formal processes like business processes and algorithms. The task of manually extracting processes, however, requires domain expertise and effort. While automatic process model extraction is desirable, annotating texts with formalized process models is expensive. Therefore, there are only a few machine-learning-based extraction approaches. Rule-based approaches, in turn, require domain specificity to work well and can rarely distinguish relevant and irrelevant information in textual descriptions. In this paper, we present GUIDO, a hybrid approach to the process model extraction task that first, classifies sentences regarding their relevance to the process model, using a BERT-based sentence classifier, and second, extracts a process model from the sentences classified as relevant, using dependency parsing. The presented approach achieves significantly better results than a pure rule-based approach. GUIDO achieves an average behavioral similarity score of 0.93. Still, in comparison to purely machine-learning-based approaches, the annotation costs stay low.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2023

Beyond Rule-based Named Entity Recognition and Relation Extraction for Process Model Generation from Natural Language Text

Automated generation of business process models from natural language te...
research
03/09/2022

PET: A new Dataset for Process Extraction from Natural Language Text

Although there is a long tradition of work in NLP on extracting entities...
research
06/30/2016

SnapToGrid: From Statistical to Interpretable Models for Biomedical Information Extraction

We propose an approach for biomedical information extraction that marrie...
research
02/24/2020

A Hybrid Approach to Dependency Parsing: Combining Rules and Morphology with Deep Learning

Fully data-driven, deep learning-based models are usually designed as la...
research
09/24/2015

Description of the Odin Event Extraction Framework and Rule Language

This document describes the Odin framework, which is a domain-independen...
research
05/24/2021

Augmenting Modelers with Semantic Autocompletion of Processes

Business process modelers need to have expertise and knowledge of the do...
research
10/24/2021

Automated Extraction of Sentencing Decisions from Court Cases in the Hebrew Language

We present the task of Automated Punishment Extraction (APE) in sentenci...

Please sign up or login with your details

Forgot password? Click here to reset