一种基于生成对抗模仿学习的作战决策方法OACSTPCD
A decision-making method based on generative adversarial imitation learning
为研究有限作战指挥样本下的智能决策方法,针对作战决策经验难以表达和智能决策学习训练样本稀缺等问题,基于联合战役仿真推演环境,提出了一种基于生成对抗模仿学习的作战决策方法.该方法整合了作战决策经验表示与学习过程,在上层决策和底层动作分层的基础上,采用规则定义特定任务执行逻辑,并利用生成对抗模仿学习算法提升智能体场景泛化能力.在构设的典型对抗场景中,该方法达到了预期效果,算法训练收敛,智能体输出决策合理.实验结果初步表明,生成对抗模仿学习作为一种智能作战决策方法,具有进一步研究价值.
To study the intelligent decision making methods under limited decision samples,aiming at the problems that op-erational decision-making experience is difficult to express and the training samples for intelligent decision learning are limit-ed,based on the joint operational simulation and drill environment,a decision-making method based on generative adversari-al imitation learning is proposed.This method integrates the operational decision-making experience representation and learn-ing process.On the basis of high-level decision-making and low-level action,rule definitions are used to specify the logic of task execution,and generative adversarial imitation learning algorithms are utilized to improve the generalization ability of in-telligent agents in scenarios.This method achieved expected results in the constructed typical adversarial scenarios.The algo-rithm training converged and the decisions output by the intelligent agent are reasonable.Preliminary experimental results in-dicate that generative adversarial imitation learning,as an intelligent operational decision-making method,has value for fur-ther research.
李东;许霄;吴琳
国防大学联合作战学院, 北京 100091
智能决策作战决策基于规则的方法生成对抗模仿学习
intelligent decision-makingoperational decision-makingrule-based methodgenerative adversarial imitation learning
《指挥控制与仿真》 2024 (002)
18-23 / 6
国家自然科学基金(62006235)
评论