首页|期刊导航|计算机工程|计算机博弈中估值算法与博弈训练的研究

计算机博弈中估值算法与博弈训练的研究

吕艳辉宫瑞敏

计算机工程2012，Vol.38Issue(11)：163-166,4.

计算机工程2012，Vol.38Issue(11)：163-166,4.DOI:10.3969/j.issn.1000-3428.2012.11.050

计算机博弈中估值算法与博弈训练的研究

Study on Valuation Algorithm and Game Training in Computer Game

吕艳辉 ¹宫瑞敏¹

作者信息

1. 沈阳理工大学信息科学与工程学院,沈阳110159
折叠

摘要

Abstract

Situation valuation is the most difficult issue in all kinds of computer game programs. A valuation method named BP-TD(X) is presented combining temporal difference algorithm and back propagation neural network, which can solve the problem of adjusting the parameter values of valuation function. On this basis, in order to enhance the performance of game training, the strategy of setting different parameter values is proposed for opening and middle game phases. The game system RenjuTD is implemented using Renju as application background. Experimental results show the game level of program is significantly improved.

关键词

计算机博弈/差分学习/反向传播神经网络/估值算法/增强学习/博弈训练

Key words

computer game/ difference learning/ back propagation neural network/ valuation algorithm/ reinforcement learning/ game training

分类

信息技术与安全科学

引用本文复制引用

吕艳辉,宫瑞敏..计算机博弈中估值算法与博弈训练的研究[J].计算机工程,2012,38(11):163-166,4.

基金项目

国家自然科学基金资助项目(60873010) （60873010）

新世纪优秀人才支持计划基金资助项目(NCET-05-0288) （NCET-05-0288）

计算机工程

OACSCDCSTPCD

ISSN：1000-3428

访问量0

下载量0

段落导航