数据与计算发展前沿2025,Vol.7Issue(1):99-107,9.DOI:10.11871/jfdc.issn.2096-742X.2025.01.007
面向高性能计算环境的智能任务编排架构研究
Research on Intelligent Task Orchestration for High Performance Computing Environment
摘要
Abstract
[Objective]A large-scale scientific computing task often includes multiple computing jobs or a job group,and there are execution orders and dependencies between multiple computing jobs.Users need to wait for the previous job to complete before submitting the next one.In order to reduce the user waiting time,there is an urgent need for new ways of submitting jobs that al-lows users to submit multiple jobs with dependencies at the same time.[Methods]This paper proposes an intelligent task orchestration scheme for high-performance computing environ-ments,which can automatically resolve dependencies between jobs,intelligently orchestrate job submission sequences,monitor job status,and submit the subsequent job after the depend-ing job is completed.[Results]From the perspective of practical application effects,the intelli-gent task orchestration service can effectively simplify user operations.[Conclusions]The scheme proposed achieves a good application effect.关键词
高性能计算环境/作业组/作业依赖/智能任务编排Key words
high performance computing environment/job group/job dependency/intelligent task orchestration引用本文复制引用
吴璨,肖海力,王小宁,卢莎莎,和荣..面向高性能计算环境的智能任务编排架构研究[J].数据与计算发展前沿,2025,7(1):99-107,9.基金项目
国家重点研发计划(2023YFB3002302) (2023YFB3002302)
中国科学院计算机网络信息中心项目"面向国产异构超级计算机的智能任务编排架构研究" ()