首页|期刊导航|北京信息科技大学学报（自然科学版）|基于联邦深度强化学习的多无人机轨迹规划算法

基于联邦深度强化学习的多无人机轨迹规划算法

王鉴威李学华陈硕

北京信息科技大学学报（自然科学版）2023，Vol.38Issue(6)：1-10,10.

北京信息科技大学学报（自然科学版）2023，Vol.38Issue(6)：1-10,10.DOI:12.16508/j.cnki.11-5866/n.2023.06.001

基于联邦深度强化学习的多无人机轨迹规划算法

A multi-UAVs trajectory planning algorithm based on federated deep reinforcement learning

王鉴威 ¹李学华 ¹陈硕¹

作者信息

1. 北京信息科技大学现代测控技术教育部重点实验室,北京 100101
折叠

摘要

Abstract

For the multi-UAVs cooperative service ground user mobile edge computing service,a model was constructed with the objectives of multi-UAVs service ground user fairness and computation delay weighted sum maximization,and the scheduling of UAV trajectories and task offloading ratios were jointly optimized.A multi-UAVs trajectory planning algorithm based on federated deep reinforcement learning in mobile edge computing scenarios was proposed.The algorithm first deployed independent deep reinforcement learning models on each UAV,so that each UAV learned to acquire a locally optimal model based on its own acquired information.Secondly,a federated learning framework was introduced to enable multi-UAVs to collaboratively serve ground users through information aggregation,so that the service effect reached the global optimum.Simulation results show that the proposed scheme effectively improves the fairness and delay compared with the multi-intelligence deep reinforcement learning without information interaction.

关键词

无人机通信/移动边缘计算/深度强化学习/联邦学习/轨迹规划/公平性

Key words

UAV communication/mobile edge computing/deep reinforcement learning/federated learning/trajectory planning/fairness

分类

信息技术与安全科学

引用本文复制引用

王鉴威,李学华,陈硕..基于联邦深度强化学习的多无人机轨迹规划算法[J].北京信息科技大学学报（自然科学版）,2023,38(6):1-10,10.

基金项目

国家自然科学基金项目(61901043) （61901043）

北京信息科技大学"勤信人才"培育计划(QXTCPB202101) （QXTCPB202101）

北京市教委科研计划科技一般项目(KM202211232010) （KM202211232010）

北京信息科技大学学报（自然科学版）

ISSN：1674-6864

访问量0

下载量0

段落导航