扩充状态空间下不休止多臂机的Whittle指数

WHITTLE INDEX FOR RESTLESS BANDITS WITH EXPANDING STATE SPACES

作　　者：刘克勤 Liu Keqin(Department of Mathematics,Nanjing University,Nanjing 210093)

出　　处：《高等学校计算数学学报》2020年第4期372-384,共13页Numerical Mathematics A Journal of Chinese Universities

摘　　要：We investigate Whittle index policy for a restless bandit model whose statespace may be enlarged under passive actions.This model arises in many important ap-plications and extends the classical model introduced by Whittle in 1988.In the classicalmodel,one chooses a subset of arms to play at each time and accrue certain reward de-termined by the states of all arms and the subset of chosen arms.The state of each armevolves according to a Markov process whose parameters(transition matrix)may dependon whether or not the arm is selected.The objective is to maximize the time-average re-ward over long-term.Weber and Weiss in 1990 proved the asymptotic optimality of Whit-tle index under a sufficient condition for the classical model,where Whittle's indexabilitywas required.In this paper,we extend Whittle index to the general model as consideredhere.Our extension is based on policy continuation and tie-breaking ordering of Whittleindex when new states join the system.By requiring a positive recurrent sub-state-spaceand boundedness of immediate rewards,we show that randomization can achieve optimal-ity under Whittle's relaxed constraint.We further analyze the fluid dynamics of our modeland show that the asymptotic optimality of Whittle index under the strict constraint canalso be extended.

关键词：Restless multi-armed bandits Whittle index state expansion policy contin-uation optimality under relaxed constraints fluid model asymptotic optimality

分类号：O1[理学—数学]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

扩充状态空间下不休止多臂机的Whittle指数

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

扩充状态空间下不休止多臂机的Whittle指数

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索