一种基于生成对抗模仿学习的作战决策方法OACSTPCD
A decision-making method based on generative adversarial imitation learning
为研究有限作战指挥样本下的智能决策方法,针对作战决策经验难以表达和智能决策学习训练样本稀缺等问题,基于联合战役仿真推演环境,提出了一种基于生成对抗模仿学习的作战决策方法.该方法整合了作战决策经验表示与学习过程,在上层决策和底层动作分层的基础上,采用规则定义特定任务执行逻辑,并利用生成对抗模仿学习算法提升智能体场景泛化能力.在构设的典型对抗场景中,该方法达到了预期效果,算法训练收敛,智能体输出决策合理.实验结果初步表明,生成对抗模仿学习作为一种智能…查看全部>>
To study the intelligent decision making methods under limited decision samples,aiming at the problems that op-erational decision-making experience is difficult to express and the training samples for intelligent decision learning are limit-ed,based on the joint operational simulation and drill environment,a decision-making method based on generative adversari-al imitation learning is proposed.This method integrates the operational decision-making experience…查看全部>>
李东;许霄;吴琳
国防大学联合作战学院, 北京 100091国防大学联合作战学院, 北京 100091国防大学联合作战学院, 北京 100091
智能决策作战决策基于规则的方法生成对抗模仿学习
intelligent decision-makingoperational decision-makingrule-based methodgenerative adversarial imitation learning
《指挥控制与仿真》 2024 (2)
基于分层强化学习的作战决策高效策略学习方法
18-23,6
国家自然科学基金(62006235)
评论