Technical Report: Match-reference regular expressions and lenses

12/08/2022
by   Jeanne-Marie Musca, et al.
0

A lens is a single program that specifies two data transformations at once: one transformation converts data from source format to target format and a second transformation inverts the process. Over the past decade, researchers have developed many different kinds of lenses with different properties. One class of such languages operate over regular languages. In other words, these lenses convert strings drawn from one regular language to strings drawn from another regular language (and back again). In this paper, we define a more powerful language of lenses, which we call match-reference lenses, that is capable of translating between non-regular formats that contain repeated substrings, which is a primitive form of dependency. To define the non-regular formats themselves, we develop a new language, match-reference regular expressions, which are regular expressions that can bind variables to substrings and use those substrings repeatedly. These match-reference regular expressions are closely related to the familiar “back-references" that can be found in traditional regular expression packages, but are redesigned to adhere to conventional programming language lexical scoping conventions and to interact smoothly with lens language infrastructure. We define the semantics of match-reference regular expressions and match-reference lenses. We also define a new kind of automaton, the match-reference regex automaton system (MRRAS), for deciding string membership in the language match-reference regular expressions. We illustrate our definitions with a variety of examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2017

Beyond ωBS-regular Languages: ωT-regular Expressions and Counter-Check Automata

In the last years, various extensions of ω-regular languages have been p...
research
12/20/2020

Structure-Constrained Process Graphs for the Process Semantics of Regular Expressions

Milner (1984) introduced a process semantics for regular expressions as ...
research
02/05/2018

Deterministic Regular Expressions With Back-References

Most modern libraries for regular expression matching allow back-referen...
research
07/12/2023

A Program That Simplifies Regular Expressions (Tool paper)

This paper presents the main features of a system that aims to transform...
research
10/10/2018

Sound Regular Expression Semantics for Dynamic Symbolic Execution of JavaScript

Existing support for regular expressions in automated test generation or...
research
10/30/2022

gMeta: Template-based Regular Expression Generation over Noisy Examples

Regular expressions (regexes) are widely used in different fields of com...
research
02/15/2021

MatchKAT: An Algebraic Foundation For Match-Action

We present MatchKAT, an algebraic language for modeling match-action pac...

Please sign up or login with your details

Forgot password? Click here to reset