Learning Restricted Regular Expressions with Interleaving

04/30/2019
by   Chunmei Dong, et al.
0

The advantages for the presence of an XML schema for XML documents are numerous. However, many XML documents in practice are not accompanied by a schema or by a valid schema. Relax NG is a popular and powerful schema language, which supports the unconstrained interleaving operator. Focusing on the inference of Relax NG, we propose a new subclass of regular expressions with interleaving and design a polynomial inference algorithm. Then we conducted a series of experiments based on large-scale real data and on three XML data corpora, and experimental results show that our subclass has a better practicality than previous ones, and the regular expressions inferred by our algorithm are more precise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2019

An Effective Algorithm for Learning Single Occurrence Regular Expressions with Interleaving

The advantages offered by the presence of a schema are numerous. However...
research
05/31/2018

Practical Study of Deterministic Regular Expressions from Large-scale XML and Schema Data

Regular expressions are a fundamental concept in computer science and wi...
research
02/25/2022

Witness Generation for JSON Schema

JSON Schema is an important, evolving standard schema language for famil...
research
11/28/2019

Type Safety with JSON Subschema

JSON is a popular data format used pervasively in web APIs, cloud comput...
research
07/19/2021

An Empirical Study on the "Usage of Not" in Real-World JSON Schema Documents (Long Version)

In this paper, we study the usage of negation in JSON Schema data modeli...
research
11/23/2022

Benchmarking JSON BinPack

In this paper, we present benchmark results for a pre-production impleme...
research
12/01/2022

A Noise-tolerant Differentiable Learning Approach for Single Occurrence Regular Expression with Interleaving

We study the problem of learning a single occurrence regular expression ...

Please sign up or login with your details

Forgot password? Click here to reset