Approximating Happiness Maximizing Set Problems

02/06/2021
by   Phoomraphee Luenam, et al.
0

A Happiness Maximizing Set (HMS) is a useful concept in which a smaller subset of a database is selected while mostly preserving the best scores along every possible utility function. In this paper, we study the k-Happiness Maximizing Sets (k-HMS) and Average Happiness Maximizing Sets (AHMS) problems. Specifically, k-HMS selects r records from the database such that the minimum happiness ratio between the k-th best score in the database and the best score in the selected records for any possible utility function is maximized. Meanwhile, AHMS maximizes the average of this ratio within a distribution of utility functions. k-HMS and AHMS are equivalent to the more established k-Regret Minimizing Sets (k-RMS) and Average Regret Minimizing Sets (ARMS) problems, but allow for the derivation of stronger theoretical results and more natural approximation schemes. In this paper, we show that the problem of approximating k-HMS within any finite factor is NP-Hard when the dimensionality of the database is unconstrained and extend the result to an inapproximability proof of k-RMS. We then provide approximation algorithms for AHMS with better approximation ratios and time complexities than known algorithms for ARMS. Finally, we provide dataset reduction schemes which can be used to reduce the runtime of existing heuristic based algorithms, as well as to derive polynomial-time approximation schemes for both k-HMS and AHMS when dimensionality is fixed. Finally, we provide experimental validation showing that our AHMS algorithm achieves the same happiness as the existing Greedy Shrink FAM algorithm while running faster by over 2 orders of magnitude on even a small dataset of 17265 data points while our reduction scheme was able to reduce runtimes by up to 93 (from 4.2 hours to 16.7 minutes) while keeping happiness within 90% of the original on the largest tested settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2018

Finding Average Regret Ratio Minimizing Set in Database

Selecting a certain number of data points (or records) from a database w...
research
05/29/2020

A Fully Dynamic Algorithm for k-Regret Minimizing Sets

Selecting a small set of representatives from a large database is import...
research
06/26/2019

A Tight Analysis of Greedy Yields Subexponential Time Approximation for Uniform Decision Tree

Decision Tree is a classic formulation of active learning: given n hypot...
research
03/14/2020

Approximation Schemes for Subset Sum Ratio Problems

We consider the Subset Sum Ratio Problem (SSR), in which given a set of ...
research
05/31/2021

Optimal Algorithms for Multiwinner Elections and the Chamberlin-Courant Rule

We consider the algorithmic question of choosing a subset of candidates ...
research
08/13/2022

Happiness Maximizing Sets under Group Fairness Constraints (Technical Report)

Finding a happiness maximizing set (HMS) from a database, i.e., selectin...
research
07/19/2020

GRMR: Generalized Regret-Minimizing Representatives

Extracting a small subset of representative tuples from a large database...

Please sign up or login with your details

Forgot password? Click here to reset