河南理工大学学报(自然科学版)2025,Vol.44Issue(6):191-200,10.DOI:10.16186/j.cnki.1673-9787.2023020018
基于深度强化学习的车联网动态卸载成本优化
Dynamic offloading cost optimization of Internet of vehicles based on deep reinforcement learning
摘要
Abstract
Objectives The study aimed to solve the key problems of task offloading and resource allocation in the Internet of vehicles with imperfect channel,and reduce the computational cost.Methods Combined the imperfect channel characteristics to abstract the basic vehicle-connected task offload environment,jointly optimized the task offload ratio,power selection and server resource allocation,and established a long-term average cost minimization problem model for all users.Using a dynamic offloading optimization scheme based on deep reinforcement learning,and considering the continuity of solution variables,SP-DDPG(deep deterministic policy gradient with importance sampling and prioritized experience replay)algo-rithm was proposed to solve the problem model.Compared with some existing deep reinforcement learning methods,the performance of SP-DDPG algorithm under the influence of a single variable was studied,and two important indicators of average offloading cost and task discard number were calculated respectively.Results Compared with the complete task offloading algorithm F-DDPG and DDQN,the average task offloading cost was reduced by about 36.13%and 44.02%,and the number of dropped tasks was reduced by at least 4.38%and 9.76%respectively.Compared with the partial offloading algorithm DDPG,the aver-age offloading cost and the number of dropped tasks decreased by 13.34%and 3.17%.The experimental re-sults were averaged after multiple runs(tradeoff factor of delay and energy consumption ω=0.5,channel es-timation accuracy ρ=0.95),which had good reliability.Conclusions Compared with some conventional deep reinforcement learning algorithms,the proposed optimized deep deterministic policy gradient algo-rithm SP-DDPG had lower computational cost and better performance in the environment of vehicle network-ing with unstable and complex changes.关键词
车联网/部分卸载/资源分配/深度确定性策略梯度/不完美信道Key words
Internet of vehicles/partial offloading/resource allocation/deep deterministic policy gradient/imperfect channel分类
信息技术与安全科学引用本文复制引用
赵珊,贾宗璞,朱小丽,庞晓艳,谷坤源..基于深度强化学习的车联网动态卸载成本优化[J].河南理工大学学报(自然科学版),2025,44(6):191-200,10.基金项目
国家自然科学基金资助项目(62276092) (62276092)
河南省高校青年骨干教师资助计划项目(2019GGJS061) (2019GGJS061)