Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering

06/11/2023
by   Hai Ye, et al.
0

In this work, we study multi-source test-time model adaptation from user feedback, where K distinct models are established for adaptation. To allow efficient adaptation, we cast the problem as a stochastic decision-making process, aiming to determine the best adapted model after adaptation. We discuss two frameworks: multi-armed bandit learning and multi-armed dueling bandits. Compared to multi-armed bandit learning, the dueling framework allows pairwise collaboration among K models, which is solved by a novel method named Co-UCB proposed in this work. Experiments on six datasets of extractive question answering (QA) show that the dueling framework using Co-UCB is more effective than other strong baselines for our studied problem.

READ FULL TEXT
research
04/20/2023

Optimal Activation of Halting Multi-Armed Bandit Models

We study new types of dynamic allocation problems the Halting Bandit mod...
research
05/10/2021

Sense-Bandits: AI-based Adaptation of Sensing Thresholds for Heterogeneous-technology Coexistence Over Unlicensed Bands

In this paper, we present Sense-Bandits, an AI-based framework for distr...
research
10/02/2015

A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

Adaptive and sequential experiment design is a well-studied area in nume...
research
04/26/2022

Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling

As two popular schools of machine learning, online learning and evolutio...
research
10/04/2022

ProtoBandit: Efficient Prototype Selection via Multi-Armed Bandits

In this work, we propose a multi-armed bandit based framework for identi...
research
08/03/2017

Reinforcement learning techniques for Outer Loop Link Adaptation in 4G/5G systems

Wireless systems perform rate adaptation to transmit at highest possible...
research
08/17/2019

A Batched Multi-Armed Bandit Approach to News Headline Testing

Optimizing news headlines is important for publishers and media sites. A...

Please sign up or login with your details

Forgot password? Click here to reset