首页|期刊导航|河南理工大学学报（自然科学版）|基于深度强化学习的车联网动态卸载成本优化

基于深度强化学习的车联网动态卸载成本优化

赵珊贾宗璞朱小丽庞晓艳谷坤源

河南理工大学学报（自然科学版）2025，Vol.44Issue(6)：191-200,10.

河南理工大学学报（自然科学版）2025，Vol.44Issue(6)：191-200,10.DOI:10.16186/j.cnki.1673-9787.2023020018

基于深度强化学习的车联网动态卸载成本优化

Dynamic offloading cost optimization of Internet of vehicles based on deep reinforcement learning

赵珊 ¹贾宗璞 ¹朱小丽 ¹庞晓艳 ¹谷坤源¹

作者信息

1. 河南理工大学计算机学院,河南焦作 454000
折叠

摘要

Abstract

Objectives The study aimed to solve the key problems of task offloading and resource allocation in the Internet of vehicles with imperfect channel,and reduce the computational cost.Methods Combined the imperfect channel characteristics to abstract the basic vehicle-connected task offload environment,jointly optimized the task offload ratio,power selection and server resource allocation,and established a long-term average cost minimization problem model for all users.Using a dynamic offloading optimization scheme based on deep reinforcement learning,and considering the continuity of solution variables,SP-DDPG(deep deterministic policy gradient with importance sampling and prioritized experience replay)algo-rithm was proposed to solve the problem model.Compared with some existing deep reinforcement learning methods,the performance of SP-DDPG algorithm under the influence of a single variable was studied,and two important indicators of average offloading cost and task discard number were calculated respectively.Results Compared with the complete task offloading algorithm F-DDPG and DDQN,the average task offloading cost was reduced by about 36.13%and 44.02%,and the number of dropped tasks was reduced by at least 4.38%and 9.76%respectively.Compared with the partial offloading algorithm DDPG,the aver-age offloading cost and the number of dropped tasks decreased by 13.34%and 3.17%.The experimental re-sults were averaged after multiple runs(tradeoff factor of delay and energy consumption ω=0.5,channel es-timation accuracy ρ=0.95),which had good reliability.Conclusions Compared with some conventional deep reinforcement learning algorithms,the proposed optimized deep deterministic policy gradient algo-rithm SP-DDPG had lower computational cost and better performance in the environment of vehicle network-ing with unstable and complex changes.

关键词

车联网/部分卸载/资源分配/深度确定性策略梯度/不完美信道

Key words

Internet of vehicles/partial offloading/resource allocation/deep deterministic policy gradient/imperfect channel

分类

信息技术与安全科学

引用本文复制引用

赵珊,贾宗璞,朱小丽,庞晓艳,谷坤源..基于深度强化学习的车联网动态卸载成本优化[J].河南理工大学学报（自然科学版）,2025,44(6):191-200,10.

基金项目

国家自然科学基金资助项目(62276092) （62276092）

河南省高校青年骨干教师资助计划项目(2019GGJS061) （2019GGJS061）

河南理工大学学报（自然科学版）

OA北大核心

ISSN：1673-9787

访问量0

下载量0

段落导航