计算机与数字工程2025,Vol.53Issue(1):21-25,62,6.DOI:10.3969/j.issn.1672-9722.2025.01.005
基于确定性策略的卫星通信动态功率分配算法
An Algorithm of Dynamic Power Allocation Based on Deterministic Strategy for Satellite Communication
摘要
Abstract
With the increasing demand of ground users for data and the demand for resource allocation between satellite multi beams,efficient and stable dynamic resource allocation technology has become the key to restrict the current satellite communica-tion industry.Compared with traditional algorithms,the dynamic algorithm based on artificial intelligence introduces multi-dimen-sional resource allocation and complex space-time constraints into multi beam satellite communication.Through the established multi beam satellite communication model,a twin delayed deep deterministic policy gradient(TD3)algorithm based on determinis-tic policy is proposed.In this algorithm,a value network is constructed to evaluate strategies,a strategy network is constructed to up-date action strategies,and a delay update strategy and a noise smoothing target strategy are also adopted.Simulation results show that the throughput of this algorithm is improved by 5.2%compared with other reinforcement learning algorithms and traditional algo-rithms.In addition,by setting the number of neurons in the hidden layer,the most stable hidden layer parameters of the network model are found.关键词
功率分配/多波束卫星/深度强化学习/确定策略Key words
power distribution/multi beam satellite/deep reinforcement learning/determine strategy分类
信息技术与安全科学引用本文复制引用
兰松,李晖,徐永杰,彭号杰..基于确定性策略的卫星通信动态功率分配算法[J].计算机与数字工程,2025,53(1):21-25,62,6.基金项目
国家自然科学基金项目(编号:61661018) (编号:61661018)
江苏省基础研究计划青年基金项目(编号:BK20210064)资助. (编号:BK20210064)