Diversity maximization in doubling metrics

09/25/2018
by   Alfonso Cevallos, et al.
0

Diversity maximization is an important geometric optimization problem with many applications in recommender systems, machine learning or search engines among others. A typical diversification problem is as follows: Given a finite metric space (X,d) and a parameter k ∈N, find a subset of k elements of X that has maximum diversity. There are many functions that measure diversity. One of the most popular measures, called remote-clique, is the sum of the pairwise distances of the chosen elements. In this paper, we present novel results on three widely used diversity measures: Remote-clique, remote-star and remote-bipartition. Our main result are polynomial time approximation schemes for these three diversification problems under the assumption that the metric space is doubling. This setting has been discussed in the recent literature. The existence of such a PTAS however was left open. Our results also hold in the setting where the distances are raised to a fixed power q≥ 1, giving rise to more variants of diversity functions, similar in spirit to the variations of clustering problems depending on the power applied to the distances. Finally, we provide a proof of NP-hardness for remote-clique with squared distances in doubling metric spaces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2023

Improved Diversity Maximization Algorithms for Matching and Pseudoforest

In this work we consider the diversity maximization problem, where given...
research
07/30/2022

Streaming Algorithms for Diversity Maximization with Fairness Constraints

Diversity maximization is a fundamental problem with wide applications i...
research
08/14/2020

A polynomial algorithm for the maximum clique

In this paper, we present a polynomial-time algorithm for the maximum cl...
research
02/08/2020

A General Coreset-Based Approach to Diversity Maximization under Matroid Constraints

Diversity maximization is a fundamental problem in web search and data m...
research
02/04/2018

Testing to distinguish measures on metric spaces

We study the problem of distinguishing between two distributions on a me...
research
02/15/2023

Fully dynamic clustering and diversity maximization in doubling metrics

We present approximation algorithms for some variants of center-based cl...
research
08/28/2020

Is the space complexity of planted clique recovery the same as that of detection?

We study the planted clique problem in which a clique of size k is plant...

Please sign up or login with your details

Forgot password? Click here to reset