|国家科技期刊平台
首页|期刊导航|指挥控制与仿真|一种基于生成对抗模仿学习的作战决策方法

一种基于生成对抗模仿学习的作战决策方法OACSTPCD

A decision-making method based on generative adversarial imitation learning

中文摘要英文摘要

为研究有限作战指挥样本下的智能决策方法,针对作战决策经验难以表达和智能决策学习训练样本稀缺等问题,基于联合战役仿真推演环境,提出了一种基于生成对抗模仿学习的作战决策方法.该方法整合了作战决策经验表示与学习过程,在上层决策和底层动作分层的基础上,采用规则定义特定任务执行逻辑,并利用生成对抗模仿学习算法提升智能体场景泛化能力.在构设的典型对抗场景中,该方法达到了预期效果,算法训练收敛,智能体输出决策合理.实验结果初步表明,生成对抗模仿学习作为一种智能作战决策方法,具有进一步研究价值.

To study the intelligent decision making methods under limited decision samples,aiming at the problems that op-erational decision-making experience is difficult to express and the training samples for intelligent decision learning are limit-ed,based on the joint operational simulation and drill environment,a decision-making method based on generative adversari-al imitation learning is proposed.This method integrates the operational decision-making experience representation and learn-ing process.On the basis of high-level decision-making and low-level action,rule definitions are used to specify the logic of task execution,and generative adversarial imitation learning algorithms are utilized to improve the generalization ability of in-telligent agents in scenarios.This method achieved expected results in the constructed typical adversarial scenarios.The algo-rithm training converged and the decisions output by the intelligent agent are reasonable.Preliminary experimental results in-dicate that generative adversarial imitation learning,as an intelligent operational decision-making method,has value for fur-ther research.

李东;许霄;吴琳

国防大学联合作战学院, 北京 100091

智能决策作战决策基于规则的方法生成对抗模仿学习

intelligent decision-makingoperational decision-makingrule-based methodgenerative adversarial imitation learning

《指挥控制与仿真》 2024 (002)

基于分层强化学习的作战决策高效策略学习方法

18-23 / 6

国家自然科学基金(62006235)

10.3969/j.issn.1673-3819.2024.02.003

评论