Improved Diversity Maximization Algorithms for Matching and Pseudoforest

07/10/2023
by   Sepideh Mahabadi, et al.
0

In this work we consider the diversity maximization problem, where given a data set X of n elements, and a parameter k, the goal is to pick a subset of X of size k maximizing a certain diversity measure. [CH01] defined a variety of diversity measures based on pairwise distances between the points. A constant factor approximation algorithm was known for all those diversity measures except “remote-matching”, where only an O(log k) approximation was known. In this work we present an O(1) approximation for this remaining notion. Further, we consider these notions from the perpective of composable coresets. [IMMM14] provided composable coresets with a constant factor approximation for all but “remote-pseudoforest” and “remote-matching”, which again they only obtained a O(log k) approximation. Here we also close the gap up to constants and present a constant factor composable coreset algorithm for these two notions. For remote-matching, our coreset has size only O(k), and for remote-pseudoforest, our coreset has size O(k^1+ε) for any ε > 0, for an O(1/ε)-approximate coreset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2018

Diversity maximization in doubling metrics

Diversity maximization is an important geometric optimization problem wi...
research
08/16/2022

Deletion Robust Non-Monotone Submodular Maximization over Matroids

Maximizing a submodular function is a fundamental task in machine learni...
research
05/16/2023

Diversity, Agreement, and Polarization in Elections

We consider the notions of agreement, diversity, and polarization in ord...
research
02/08/2020

A General Coreset-Based Approach to Diversity Maximization under Matroid Constraints

Diversity maximization is a fundamental problem in web search and data m...
research
02/25/2020

2-Dimensional Palindromes with k Mismatches

This paper extends the problem of 2-dimensional palindrome search into t...
research
06/15/2021

Learning-based Support Estimation in Sublinear Time

We consider the problem of estimating the number of distinct elements in...
research
08/29/2022

Fitting Metrics and Ultrametrics with Minimum Disagreements

Given x ∈ (ℝ_≥ 0)^[n]2 recording pairwise distances, the METRIC VIOLATIO...

Please sign up or login with your details

Forgot password? Click here to reset