火力与指挥控制2023,Vol.48Issue(11):17-24,8.DOI:10.3969/j.issn.1002-0640.2023.11.003
智能蓝军作战行为决策模型优化技术研究
Research on Optimization Technology of Intelligent Blue Army Combat Behavior Decision-making Model
摘要
Abstract
To deal with such problems as poor adaptability and learning ability of blue army model,optimization technology of intelligent blue army combat behavior decision-making model integrating decision-making tree and PPO reinforcement learning is proposed,in the running process of decision tree,if intelligent agent falls into a decision-making dilemma.A network model based on PPO training algorithm is utilized to generate the optimal action,to ensure its continued smooth and efficient execution.Finally,a comparative experiment is carried out based on the miaosuan·zhisheng platform to verify the feasibility and effectiveness of the optimization technology.关键词
智能蓝军/决策树/PPO/强化学习Key words
intelligent blue army/decision tree/PPO/reinforcement learning分类
信息技术与安全科学引用本文复制引用
章乐贵,曹雷,陈希亮,汤伟,王军,张启阳..智能蓝军作战行为决策模型优化技术研究[J].火力与指挥控制,2023,48(11):17-24,8.基金项目
国家自然科学基金(61806221) (61806221)
国防科技重点实验室基金(6142101180304) (6142101180304)
国防科技创新特区163计划(1916311LZ00100301) (1916311LZ00100301)
"十三五"全军共用信息系统装备预研资助项目(×××××) (×××××)