计算机应用与软件2024,Vol.41Issue(7):207-214,8.DOI:10.3969/j.issn.1000-386x.2024.07.031
容器集群GPU资源共享调度优化
OPTIMIZATION OF GPU RESOURCE SHARING SCHEDULING FOR CONTAINER CLUSTERS
摘要
Abstract
In a container cluster environment,the entire physical GPU resource can usually only be scheduled exclusively by a single container,and there is a lot of waste of resources.Existing GPU sharing scheduling schemes still have problems of scheduling failure,high resource overhead and lack of resource isolation.Improved GPU sharing used the LD_PRELOAD mechanism to effectively isolate GPU memory resources,and it optimized the original scheduling algorithm,so that the utilization of cluster video memory resources was greatly improved.The experimental results verify the effectiveness of the improved GPU Sharing in the realization of resource isolation.At the same time,the improved GPU sharing has only 1.008%extra overhead for executing applications on the physical machine,and the optimized scheduling algorithm has increased by 53.01%GPU memory utilization.关键词
GPU集群/GPU共享调度/容器/资源共享/GPU利用率Key words
GPU cluster/GPU shared scheduling/Container/Resource sharing/GPU utilization rate分类
信息技术与安全科学引用本文复制引用
罗恋,顾进广,李奇缘,高峰..容器集群GPU资源共享调度优化[J].计算机应用与软件,2024,41(7):207-214,8.基金项目
国家自然科学基金项目(61673304) (61673304)
国家社科基金重大计划项目(11&ZD189). (11&ZD189)