The Minimum Edit Arborescence Problem and Its Use in Compressing Graph Collections [Extended Version]

07/30/2021
by   Lucas Gnecco, et al.
0

The inference of minimum spanning arborescences within a set of objects is a general problem which translates into numerous application-specific unsupervised learning tasks. We introduce a unified and generic structure called edit arborescence that relies on edit paths between data in a collection, as well as the Min Edit Arborescence Problem, which asks for an edit arborescence that minimizes the sum of costs of its inner edit paths. Through the use of suitable cost functions, this generic framework allows to model a variety of problems. In particular, we show that by introducing encoding size preserving edit costs, it can be used as an efficient method for compressing collections of labeled graphs. Experiments on various graph datasets, with comparisons to standard compression tools, show the potential of our method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2023

EPIC: Graph Augmentation with Edit Path Interpolation via Learnable Cost

Graph-based models have become increasingly important in various domains...
research
11/03/2021

An Improved Algorithm for The k-Dyck Edit Distance Problem

A Dyck sequence is a sequence of opening and closing parentheses (of var...
research
10/01/2019

Compositional and Abstraction-Based Approach for Synthesis of Edit Functions for Opacity Enforcement

This paper develops a novel compositional and abstraction-based approach...
research
01/17/2018

Interactive in-base street model edit: how common GIS software and a database can serve as a custom Graphical User Interface

Our modern world produces an increasing quantity of data, and especially...
research
08/23/2016

Tracking Amendments to Legislation and Other Political Texts with a Novel Minimum-Edit-Distance Algorithm: DocuToads

Political scientists often find themselves tracking amendments to politi...
research
10/31/2018

Learning to Represent Edits

We introduce the problem of learning distributed representations of edit...
research
08/02/2021

Learning Domain-Specific Edit Operations from Model Repositories with Frequent Subgraph Mining

Model transformations play a fundamental role in model-driven software d...

Please sign up or login with your details

Forgot password? Click here to reset