| 注册
首页|期刊导航|计算机工程|申威处理器上数据流运行时系统的设计与实现

申威处理器上数据流运行时系统的设计与实现

张鹏飞 陈俊仕 郑重 沈沛祺 安虹 许乐

计算机工程2023,Vol.49Issue(12):46-54,9.
计算机工程2023,Vol.49Issue(12):46-54,9.DOI:10.19678/j.issn.1000-3428.0066860

申威处理器上数据流运行时系统的设计与实现

Design and Implementation of Data-Flow Runtime System on Shenwei Processor

张鹏飞 1陈俊仕 1郑重 1沈沛祺 1安虹 1许乐1

作者信息

  • 1. 中国科学技术大学 计算机科学与技术学院,合肥 230026
  • 折叠

摘要

Abstract

The domestic Sunway heterogeneous many-core computing platform uses athread heterogeneous programming method.Exploring fine-grain parallelism in athread is challenging,as its synchronization method struggles to maintain an effective workload balancing among tasks and computing cores.The data-flow parallel programming model solves these problems well because of its natural parallelism and point-to-point synchronization.swTasklet,a data-flow runtime designed on Sunway,is developed based on the Codelet program execution model.It avoids using synchronous operation on the slave cores owing to its refined Codelet function mapping of Codelet machine model to master-slave kernel.The scheduling of tasks is completed by the master core,ensuring compatibility of the runtime and libraries.In order to evaluate the efficiency of swTasklet,NPB LU and vector-vector addition are used as case studies.Furthermore,when implemented in swTasklet,LU achieves an average speedup of 8,and vector-vector addition achieves an average speedup of 30.On a large scale,the swTasklet implementation of the LU program is 16%faster than athread version,and the vector-vector addition is twice as fast as athread version.

关键词

申威异构处理器/数据流运行时系统/Codelet程序执行模型/并行编程模型/众核加速

Key words

Shenwei heterogeneous processor/data-flow runtime system/Codelet program execution model/parallel programming model/many-core acceleration

分类

信息技术与安全科学

引用本文复制引用

张鹏飞,陈俊仕,郑重,沈沛祺,安虹,许乐..申威处理器上数据流运行时系统的设计与实现[J].计算机工程,2023,49(12):46-54,9.

基金项目

国家自然科学基金(62102389). (62102389)

计算机工程

OA北大核心CSCDCSTPCD

1000-3428

访问量0
|
下载量0
段落导航相关论文