计算机工程2012,Vol.38Issue(11):163-166,4.DOI:10.3969/j.issn.1000-3428.2012.11.050
计算机博弈中估值算法与博弈训练的研究
Study on Valuation Algorithm and Game Training in Computer Game
摘要
Abstract
Situation valuation is the most difficult issue in all kinds of computer game programs. A valuation method named BP-TD(X) is presented combining temporal difference algorithm and back propagation neural network, which can solve the problem of adjusting the parameter values of valuation function. On this basis, in order to enhance the performance of game training, the strategy of setting different parameter values is proposed for opening and middle game phases. The game system RenjuTD is implemented using Renju as application background. Experimental results show the game level of program is significantly improved.关键词
计算机博弈/差分学习/反向传播神经网络/估值算法/增强学习/博弈训练Key words
computer game/ difference learning/ back propagation neural network/ valuation algorithm/ reinforcement learning/ game training分类
信息技术与安全科学引用本文复制引用
吕艳辉,宫瑞敏..计算机博弈中估值算法与博弈训练的研究[J].计算机工程,2012,38(11):163-166,4.基金项目
国家自然科学基金资助项目(60873010) (60873010)
新世纪优秀人才支持计划基金资助项目(NCET-05-0288) (NCET-05-0288)