Faster algorithms for learning to link, align sequences, and price two-part tariffs

04/07/2022
by   Maria-Florina Balcan, et al.
0

Data-driven algorithm configuration is a promising, learning-based approach for beyond worst-case analysis of algorithms with tunable parameters. An important open problem is the design of efficient data-driven algorithms for algorithm families with more than one parameter. In this work we provide algorithms for efficient (output-polynomial) multidimensional parameter tuning, i.e. for families with a small constant number of parameters, for three very different combinatorial problems – linkage-based clustering, dynamic programming for sequence alignment, and auction design for two-part tariff schemes. We extend the single-parameter clustering algorithm of Balcan et al. 2020 arXiv:1907.00533 to multiple parameters and to the sequence alignment problem by proposing an execution graph which compactly represents all the states the algorithm could attain for all possible parameter values. A key problem-specific challenge is to efficiently compute how the partition of the parameter space (into regions with unique algorithmic states) changes with a single algorithmic step. We give algorithms which improve on the runtime of previously best known results for linkage-based clustering, sequence alignment and two-part tariff pricing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2020

Data-driven Algorithm Design

Data driven algorithm design is an important aspect of modern data scien...
research
11/14/2016

Learning-Theoretic Foundations of Algorithm Configuration for Combinatorial Partitioning Problems

Max-cut, clustering, and many other partitioning problems that are of si...
research
09/19/2018

Data-Driven Clustering via Parameterized Lloyd's Families

Algorithms for clustering points in metric spaces is a long-studied area...
research
05/26/2019

Learning to Optimize Computational Resources: Frugal Training with Generalization Guarantees

Algorithms typically come with tunable parameters that have a considerab...
research
12/12/2019

Robust Data-driven Profile-based Pricing Schemes

To enable an efficient electricity market, a good pricing scheme is of v...
research
04/29/2023

Maximum Match Subsequence Alignment Algorithm Finely Grained (MMSAA FG)

Sequence alignment is common nowadays as it is used in many fields to de...
research
01/26/2021

Unsupervised clustering of series using dynamic programming and neural processes

Following the work of arXiv:2101.09512, we are interested in clustering ...

Please sign up or login with your details

Forgot password? Click here to reset