3CHRISTOPHER M ANDERSON.Ambiguity aversion in multi-armed bandit problems[J].Theory and Decision,2012,72(1):15-33.
4GU M Z,LU X W.The expected asymptotical ratio for preemptive stochastic online problem[J].Theoretical Computer Science,2013,49 (5):96-112.
5ISAAS M SONIN.A generalized Gittins index for a Markov chain and its recursive calculation[J].Statistics and Probability Letters,2008,78(12):1526-1553.
6U DINESH KUMAR,HARITHA SARANGA.Optimal selection of obsolescence mitigation strategies using a restless bandit model[J].European Journal of Operational Research,2010,200(1):170-180.
7SI P B,JI H,YU F R.Optimal network selection in heterogeneous wireless multimedia networks[J].Wireless Networks,2010,16(5):1277-1288.
8GLAZEBROOK K D,GAVER D P,JACOBS P A.On a military scheduling problem[R].Monterey CA:Naval Postgraduate School,2001.
9BARKDOLL T C,GAVER D P,GLAZEBROOK K D,et al.Suppression of enemy air defense (SEAD) as an information duel[D].Monterey:Naval Postgraduate School Working Paper,2001.
10GLAZEBROOK K D,WASHBURN A.Shoot-Look-Shoot:A review and extension[J].Operations Research,2004,52(3):454-463.