| 注册
首页|期刊导航|计算机工程|可靠性分解的云计算容错调度方法

可靠性分解的云计算容错调度方法

尹超 史旭华

计算机工程2026,Vol.52Issue(5):396-403,8.
计算机工程2026,Vol.52Issue(5):396-403,8.DOI:10.19678/j.issn.1000-3428.0069472

可靠性分解的云计算容错调度方法

Fault-tolerant Scheduling Method for Cloud Computing Based on Reliability Decomposition

尹超 1史旭华1

作者信息

  • 1. 宁波大学信息科学与工程学院,浙江宁波 315211
  • 折叠

摘要

Abstract

Workflow is a commonly adopted execution paradigm in cloud computing environments.Reliability is a crucial Quality of Service(QoS)metric in the process of executing cloud workflow tasks.Currently,methods that can simultaneously satisfy the reliability requirements of workflow computation while optimizing both time and cost are scarce.Neural network-based algorithms require substantial time to search for optimized parameter models when handling large-scale workflows,and the decomposition strategies of existing reliability-based algorithms require further improvement.To address these issues,this paper proposes a reliability decomposition-based fault-tolerant scheduling method.This heuristic method consists of the following steps:calculating task-scheduling priorities,determining reliability allocation weights,performing an initial decomposition of the overall reliability requirement,and selecting Virtual Machines(VMs)for task replicas.The core of this method lies in the optimization of two strategies,namely reliability decomposition and VM selection.The reliability decomposition strategy is designed based on the computational size of workflow tasks and their predecessor-successor dependencies,while the VM selection strategy operates based on a weighted function that balances relative task completion time and execution cost.Experiments are conducted using various workflow types,scales,and reliability requirements.The results indicate that the proposed method satisfies the specified reliability requirements.Moreover,it demonstrates superior comprehensive performance in balancing completion time and cost,outperforming three baseline algorithms:QFEC,QEEC+,and C_GM.This paper provides new solutions and insights for research on reliability decomposition and fault-tolerant scheduling in cloud workflow execution.

关键词

云计算/工作流/容错/可靠性/调度

Key words

cloud computing/workflow/fault-tolerance/reliability/scheduling

分类

信息技术与安全科学

引用本文复制引用

尹超,史旭华..可靠性分解的云计算容错调度方法[J].计算机工程,2026,52(5):396-403,8.

基金项目

国家自然科学基金(61773225) (61773225)

宁波市重点研发计划暨"揭榜挂帅"项目(2023Z067). (2023Z067)

计算机工程

1000-3428

访问量1
|
下载量0
段落导航相关论文