计算机工程2023,Vol.49Issue(12):46-54,9.DOI:10.19678/j.issn.1000-3428.0066860
申威处理器上数据流运行时系统的设计与实现
Design and Implementation of Data-Flow Runtime System on Shenwei Processor
摘要
Abstract
The domestic Sunway heterogeneous many-core computing platform uses athread heterogeneous programming method.Exploring fine-grain parallelism in athread is challenging,as its synchronization method struggles to maintain an effective workload balancing among tasks and computing cores.The data-flow parallel programming model solves these problems well because of its natural parallelism and point-to-point synchronization.swTasklet,a data-flow runtime designed on Sunway,is developed based on the Codelet program execution model.It avoids using synchronous operation on the slave cores owing to its refined Codelet function mapping of Codelet machine model to master-slave kernel.The scheduling of tasks is completed by the master core,ensuring compatibility of the runtime and libraries.In order to evaluate the efficiency of swTasklet,NPB LU and vector-vector addition are used as case studies.Furthermore,when implemented in swTasklet,LU achieves an average speedup of 8,and vector-vector addition achieves an average speedup of 30.On a large scale,the swTasklet implementation of the LU program is 16%faster than athread version,and the vector-vector addition is twice as fast as athread version.关键词
申威异构处理器/数据流运行时系统/Codelet程序执行模型/并行编程模型/众核加速Key words
Shenwei heterogeneous processor/data-flow runtime system/Codelet program execution model/parallel programming model/many-core acceleration分类
信息技术与安全科学引用本文复制引用
张鹏飞,陈俊仕,郑重,沈沛祺,安虹,许乐..申威处理器上数据流运行时系统的设计与实现[J].计算机工程,2023,49(12):46-54,9.基金项目
国家自然科学基金(62102389). (62102389)