多人不完备信息博弈的一种解法及改进

A solution of multi-player imperfect information game and improvements

导出

摘要多人不完备信息博弈是一类存在私有信息而出现信息不完备、不对称的多人博弈.以扑克牌游戏这类典型的多人不完备信息博弈为原型提出一般动态博弈模型GDGM.在该模型框架下,提出一种基于MU算法的多人不完备信息博弈算法MMU,并将MMU算法分别与经典博弈算法Paranoid和MCTS结合,消除该算法对经验值的依赖.最后实验从胜率和得分两个角度对算法进行评价.结果表明,结合了经典博弈算法Paranoid和MCTS算法的PN-MMU和MT-MMU算法可有效处理以扑克牌游戏为代表的多人不完备信息博弈问题,并且与PN-MMU相比,MT-MMU具有更好的博弈能力. Due to the private information in multi-player imperfect information game,the information of each game player is incomplete and asymmetric.General dynamic game model（GDGM） is proposed based on poker game,which is typical multi-player imperfect information game.Under the frame of GDGM,maxn-Monte Carlo sampling-UCT（MMU） algorithm for multi-player imperfect information game is presented based on Monte Carlo sampling-UCT（MU） algorithm,and further MMU is combined with Paranoid and Monte Carlo tree search（MCTS） respectively to eliminate its dependence on experience value.Finally,both algorithms are evaluated from the perspectives of winning rate and score by experiments.The experimental results show that the Paranoild algorithm MMU（PN-MMU） and Monte Carlo three search MMU（MT-MMU） algorithm combined with Paranoid and MCTS respectively can effectively deal with the problems of poker games.Compared with PN-MMU,MT-MMU has better performance of game.

作者徐涛赵慧伟吕宗磊

机构地区中国民航大学计算机科学与技术学院中国民航信息技术科研基地

出处《武汉大学学报（工学版）》 CAS CSCD 北大核心 2011年第6期792-796,805,共6页 Engineering Journal of Wuhan University

基金国家863计划课题(编号:2006AA12A106) 民航软科学研究项目(编号:MHRD201007)

关键词人工智能多人不完备信息博弈博弈模型 PN-MMU算法 MT-MMU算法 artificial Intelligence multi-player imperfect information game game model PN-MMU algorithm MT-MMU algorithm

分类号 TP182 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献16

1张利群.五道棋计算机博弈程序的设计与实现[J].计算机工程,2010,36(10):221-222. 被引量：5
2Russell S, Norvig P. Artificial Intelligence -A Modern Approach[M]. The Second Edition. London.. Prentice Hall, 2003..126-145.
3贾春福,钟安鸣,张炜,马勇.网络安全不完全信息动态博弈模型[J].计算机研究与发展,2006,43(z2):530-533. 被引量：11
4Xia ZY, Wang J. Analyze and guess type of piece in the computer game intelligent system[C]//Proceedings of the 2nd International Conference on Fuzzy Systems and Knowledge Discovery, 2005..1174-1183.
5王轩,许朝阳.时序差分学习在非完备信息机器博弈中的应用[C]//2007中国机器博弈学术研讨会.重庆:中国人工智能学会,2007:55-58.
6Chung M,Buro M,Schaffer J. Monte carlo planning in RTS games[C]//Proceedings of IEEE 2005 Symposi- um on Computationa! Intelligence and Games, 2005 : 1-8.
7He S J, Gao Y, Yang J J, et al. Creating challenge- able and satisfactory game opponent by the use of CI approaches[J]. International Journal of Advancements in Computing Technology, 2010,2(1)..67-70.
8Zhang J J, Wang X, Lin J. UCT algorithm in imper- fect information ulti-player military chess game[C]// Proceedings of Joint Conference on Information Sci- ences, 2008:1-9.
9Sturtevant N. An analysis of UCT in multi-player games[C]3//Proceedings o{ the 6th International Con- ference on Computers and Games, 2008 : 37-49.
10Sturtevant N. Multi-player games., algorithms and ap- proaches [ D]. California: University of California, 2003.

二级参考文献39

1徐心和,王骄.中国象棋计算机博弈关键技术分析[J].小型微型计算机系统,2006,27(6):961-969. 被引量：62
2冀俊忠,刘椿年,阎静.一种快速的贝叶斯网结构学习算法[J].计算机研究与发展,2007,44(3):412-419. 被引量：9
3[1]B V John.A conceptual model of hacker development and motivations.Journal of E-Business,2001,1(2):1-9
4[2]B Schneier.Attack trees:Modeling security threats.Dr Dobb's Journal of Software Tools,1999,24(12):21-29
5[3]M Rogers.Psychology of hackers:A new taxonomy available.http://ww.infowar.com,2001
6罗鉴江.民间棋类游戏[M].北京:农村读物出版社,2003.
7van den Herik H Jaap,Uiterwijk Jos W H M,van Rijswijck Jack.Games solved:Now and in the future[J].Artificial Intelligence,2001,134:277-311.
8Schaeffer J.A gamut of games[J].AI Magazine,2001,22(3):29-46.
9Ginsberg M L.GIB:Imperfect information in a computationally challenging game[J].Journal of Artificial Intelligence Research (JAIR),2001,14:303-358.
10Billings D,Burch N,et al.Approximating game-theoretic optimal strategies for full-scale poker[C]//Proc of IJCAI-03.San Francisco:Morgan Kaufmann,2003.

共引文献24

1刘益,闵兰.确定有限自动机的逻辑形式定义[J].西南师范大学学报（自然科学版）,2008,33(5):134-136. 被引量：5
2石乐义,贾春福,吕述望.服务跳变抗DoS机制的博弈理论分析[J].电子与信息学报,2009,31(1):228-232. 被引量：7
3闵兰,刘益.奇偶校验自动机的逻辑形式描述[J].西南师范大学学报（自然科学版）,2009,34(3):107-109.
4朱文倩,贺巧,昌春艳,雷红轩.基于整数加群模糊自动机及其在对策论中的应用[J].内江师范学院学报,2009,24(8):41-43. 被引量：1
5孟祥宏.信息安全攻防博弈研究[J].计算机技术与发展,2010,20(4):159-162. 被引量：4
6刘益.DFA最小化算法中状态等价判断方法[J].宜宾学院学报,2010,10(6):55-56.
7马骁,王轩,王晓龙.一类非完备信息博弈的信息模型[J].计算机研究与发展,2010,47(12):2100-2109. 被引量：5
8娄燕强,宋如顺,马永彩.基于RBF神经网络的攻防博弈模型[J].计算机应用与软件,2011,28(1):99-101. 被引量：1
9王桂平,张帅.基于双向广度优先搜索的魔力方块问题求解[J].计算机工程,2011,37(20):219-222. 被引量：3
10闵兰,刘益,陈晓敏.确定有限自动机推理的可计算逻辑分析[J].重庆邮电大学学报（自然科学版）,2011,23(6):761-764. 被引量：1

1Daffodil.极限24点[J].电脑爱好者,2001(11):81-83.
2蒋帅,朱相东.游戏的规律[J].小学生之友（智力探索版）（中旬）,2016,0(5):40-41.
3黄继源,杨庆华（指导）.有趣的“24点”[J].快乐作文（高年级版）,2012(6):21-21.
4蒋帅.游戏的规律[J].小学生导刊（高年级版）,2015,0(1):17-17.
5蒋帅.游戏的规律[J].数学大王（中高年级）（3-6年级）,2014(7):40-40.
6蒋帅.玩出了规律[J].小学生导读,2013(12):20-21.
7张斌,徐艳群.自适应遗传算法在象棋博弈系统中的应用[J].电脑编程技巧与维护,2012(16):122-123.
8赵玉勇,刘凤亮.扑克牌游戏DIY[J].软件,2002,23(6):52-55.
9王牌登场[J].微型计算机,2011(10):1-1.
10陆恒如.取胜的策略[J].小学生学习指导（高年级）,2009(12):28-29.

武汉大学学报（工学版）

2011年第6期

浏览历史

内容加载中请稍等...

多人不完备信息博弈的一种解法及改进

参考文献16

二级参考文献39

共引文献24

相关作者

相关机构

相关主题

浏览历史