IEEE ISIT 2020 || Los Angeles, California, USA || 21-26 June 2020

Paper Detail

Paper ID

L.12.4

Paper Title

Detecting an Odd Restless Markov Arm with a Trembling Hand

Authors

PN Karthik, Rajesh Sundaresan, Indian Institute of Science, India

Session

L.12: Multi-Arm Bandits

Presentation

Lecture

Track

Statistics and Learning Theory

Manuscript

Click here to download the manuscript

Virtual Presentation

Click here to watch in the Virtual Symposium

Abstract

Consider a multi-armed bandit whose arms are independent Markov processes on a common underlying state space. The transition probability matrix of one of the arms (the odd arm) is different from the common transition probability matrix of all the other arms. The goal is to identify the odd arm as quickly as possible while keeping the probability of decision error small. We study the case of restless Markov observations and identify an asymptotic lower bound on the expected stopping time for a decision with vanishing error probability. We then propose a sequential test and show that the asymptotic behaviour of its expected stopping time comes arbitrarily close to that of the lower bound. Prior works dealt with iid arms and rested Markov arms, whereas our work deals with restless Markov arms.

Plan Ahead

IEEE ISIT 2021

2021 IEEE International Symposium on Information Theory

11-16 July 2021 | Melbourne, Victoria, Australia

Visit Website!

2020 IEEE International Symposium
on Information Theory

21-26 June 2020 • Los Angeles, California, USA

2020 IEEE International Symposium
on Information Theory

21-26 June 2020 • Los Angeles, California, USA

Technical Program

Paper Detail

IEEE ISIT 2021