Computing Whittle (and Gittins) Index in Subcubic Time

03/10/2022
by   Nicolas Gast, et al.
0

Whittle index is a generalization of Gittins index that provides very efficient allocation rules for restless multiarmed bandits. In this paper, we develop an algorithm to test the indexability and compute the Whittle indices of any finite-state Markovian bandit problem. This algorithm works in the discounted and non-discounted cases. As a byproduct, it can also be used to compute Gittins index. Our algorithm builds on three tools: (1) a careful characterization of Whittle index that allows one to compute recursively the th smallest index from the (– 1)th smallest, and to test indexability, (2) the use of Sherman-Morrison formula to make this recursive computation efficient, and (3) a sporadic use of fast matrix inversion and multiplication to obtain a subcubic complexity. We show that an efficient use of the Sherman-Morrison formula leads to an algorithm that computes Whittle index in (2⇑3) 3 + (3) arithmetic operations, where is the number of states of the arm. The careful use of fast matrix multiplication leads to the first subcubic algorithm to compute Whittle (or Gittins) index. By using the current fastest matrix multiplications, our algorithm runs in (2.5286). We also conduct a series of experiments that demonstrate that our algorithm is very efficient in practice and can compute indices of Markov chains with several thousands of states in a few seconds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/08/2019

Fast generalized DFTs for all finite groups

For any finite group G, we give an arithmetic algorithm to compute gener...
research
01/03/2019

On Fast Matrix Inversion via Fast Matrix Multiplication

Volker Strassen first suggested an algorithm to multiply matrices with w...
research
07/20/2020

Solving Sparse Linear Systems Faster than Matrix Multiplication

Can linear systems be solved faster than matrix multiplication? While th...
research
10/05/2021

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL

Whittle index policy is a powerful tool to obtain asymptotically optimal...
research
03/27/2019

The minimum value of the Colless index

The Colless index is one of the oldest and most widely used balance indi...
research
05/17/2019

Randomization of Approximate Bilinear Computation for Matrix Multiplication

We present a method for randomizing a formula for bilinear computation o...
research
09/13/2021

Computation of the nearest structured matrix triplet with common null space

We study computational methods for computing the distance to singularity...

Please sign up or login with your details

Forgot password? Click here to reset