首页|期刊导航|信号处理|基于通感一体的多无人机感知联合优化方法

基于通感一体的多无人机感知联合优化方法

王锦宇孙凤松王寅昊王海涛张先超

信号处理2026，Vol.42Issue(2)：131-147,17.

信号处理2026，Vol.42Issue(2)：131-147,17.DOI:10.12466/xhcl.2026.02.002

基于通感一体的多无人机感知联合优化方法

Joint Optimization for Multi-UAV Sensing Based on Integrated Sensing and Communication

王锦宇 ¹孙凤松 ¹王寅昊 ¹王海涛 ²张先超³

作者信息

1. 北京邮电大学信息与通信工程学院,北京 100876
2. 浙江大学工程师学院,浙江杭州 310015
3. 嘉兴大学全省多模态感知与智能系统重点实验室,浙江嘉兴 314001
折叠

摘要

Abstract

Low-Altitude Wireless Networks(LAWNs)serve as a critical infrastructure pillar for achieving extensive wide-area coverage and intelligent sensing capabilities.Nevertheless,the overall efficacy of these networks is severely constrained by rigid limitations,specifically the scarcity of spectrum resources and the restricted capabilities of onboard hardware.To mitigate these challenges,Integrated Sensing and Communication(ISAC)technology has emerged as a promising solution.By enabling the sharing of spectrum and hardware resources,ISAC not only enhances resource utili-zation efficiency but also facilitates a deeper integration of sensing and communication functions.Building upon ISAC,multi-Unmanned Aerial Vehicle(UAV)cooperative sensing offers a pathway to transcend the performance bottlenecks inherent to single-UAV systems,particularly regarding coverage scope and sensing precision.Consequently,this ap-proach represents a pivotal strategy for satisfying the stringent requirements of low-altitude supervision and regulation.However,the introduction of multi-UAV collaboration engenders complex spatial geometric constraints and co-channel interference.Furthermore,in three-dimensional dynamic environments,the strong coupling between communication and sensing tasks renders the joint optimization of UAV trajectories,transmit power,subcarrier allocation,and user as-sociation a highly intractable Mixed-Integer Non-Linear Programming(MINLP)problem.Additionally,classical Multi-Agent Reinforcement Learning(MARL)algorithms encounter significant bottlenecks when applied to large-scale net-works,including state space explosion,sluggish convergence speeds,and low efficiency in policy search.These limita-tions hinder their ability to meet the stringent real-time demands of dynamic cooperative sensing tasks.In response to these challenges,this paper proposes a Quantum-enhanced Hierarchical Multi-Agent Proximal Policy Optimization(Q-H-MAPPO)algorithm,designed to maximize the joint effectiveness of multi-UAV cooperative sensing and communica-tion.Initially,a joint optimization model tailored for cooperative sensing is constructed.The Cramér-Rao Lower Bound(CRLB)is adopted as the key performance metric for sensing to quantify the impact of multi-UAV geometric configura-tions on sensing accuracy.The objective is to minimize the positioning error while simultaneously guaranteeing the Qual-ity of Service(QoS)for communication tasks.Subsequently,a Centralized Training and Decentralized Execution(CTDE)framework is employed to design a Hierarchical Markov Decision Process(H-MDP).This hierarchical struc-ture facilitates task decomposition,thereby achieving effective decoupling of discrete and continuous variables.Further-more,the study introduces an amplitude encoding mechanism based on quantum variational circuits and designs a Graph Attention Mechanism inspired by the concept of quantum swap tests.By simulating the feature mapping capabilities of quantum states within a classical computing framework,the proposed method efficiently extracts non-linear cooperative relationships among multiple agents as well as critical channel state information.Simulation results indicate that the pro-posed Q-H-MAPPO algorithm demonstrates superior performance in multi-target,high-load dynamic scenarios.In a spe-cific scenario involving six targets,the algorithm reduces the sensing positioning CRLB to approximately 0.18 m,repre-senting a reduction of at least 7%compared with other benchmark methods.In a large-scale networking scenario involv-ing 15 UAVs,the system sum rate achieves approximately 31.5 Mbps,reflecting an improvement of approximately 21%to 44%over the aforementioned baseline methods.Moreover,when the network scales to 20 UAVs,the inference la-tency remains stable between 20 and 26 ms that is a reduction of approximately 81%to 87%compared with typical base-lines.The algorithm also exhibits the fastest convergence speed,achieving stability within merely 150 to 200 training episodes.These findings validate the significant advantages of Q-H-MAPPO in enhancing the cooperative sensing accu-racy,system throughput,and decision-making real-time performance within large-scale low-altitude networks.

关键词

通感一体化/多无人机协作感知/多智能体强化学习/量子计算/资源分配

Key words

integrated sensing and communication(ISAC)/multi-UAV collaboration sensing/multi-agent reinforce-ment learning(MARL)/quantum computing/resource allocation

分类

信息技术与安全科学

引用本文复制引用

王锦宇,孙凤松,王寅昊,王海涛,张先超..基于通感一体的多无人机感知联合优化方法[J].信号处理,2026,42(2):131-147,17.

基金项目

浙江省自然科学基金项目(LD24F020009) Zhejiang Provincial Natural Science Foundation of China(LD24F020009) （LD24F020009）

信号处理

ISSN：1003-0530

访问量0

下载量0

段落导航