ProtoBandit: Efficient Prototype Selection via Multi-Armed Bandits

10/04/2022
by   Arghya Roy Chaudhuri, et al.
0

In this work, we propose a multi-armed bandit based framework for identifying a compact set of informative data instances (i.e., the prototypes) that best represents a given target set. Prototypical examples of a given dataset offer interpretable insights into the underlying data distribution and assist in example-based reasoning, thereby influencing every sphere of human decision making. A key challenge is the large-scale setting, in which similarity comparison between pairs of data points needs to be done for almost all possible pairs. We propose to overcome this limitation by employing stochastic greedy search on the space of prototypical examples and multi-armed bandit approach for reducing the number of similarity comparisons. A salient feature of the proposed approach is that the total number of similarity comparisons needed is independent of the size of the target set. Empirically, we observe that our proposed approach, ProtoBandit, reduces the total number of similarity computation calls by several orders of magnitudes (100-1000 times) while obtaining solutions similar in quality to those from existing state-of-the-art approaches.

READ FULL TEXT

page 19

page 20

page 21

research
06/29/2020

Multi-armed bandit approach to password guessing

The multi-armed bandit is a mathematical interpretation of the problem a...
research
12/23/2015

Satisficing in multi-armed bandit problems

Satisficing is a relaxation of maximizing and allows for less risky deci...
research
06/11/2023

Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering

In this work, we study multi-source test-time model adaptation from user...
research
06/05/2021

Differentially Private Multi-Armed Bandits in the Shuffle Model

We give an (ε,δ)-differentially private algorithm for the multi-armed ba...
research
06/11/2019

Ultra Fast Medoid Identification via Correlated Sequential Halving

The medoid of a set of n points is the point in the set that minimizes t...
research
06/11/2020

Bandit-PAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits

Clustering is a ubiquitous task in data science. Compared to the commonl...
research
11/09/2020

Adaptive Learning of Rank-One Models for Efficient Pairwise Sequence Alignment

Pairwise alignment of DNA sequencing data is a ubiquitous task in bioinf...

Please sign up or login with your details

Forgot password? Click here to reset