| 注册
首页|期刊导航|数据与计算发展前沿|一类Stencil应用在众核NUMA架构的性能研究

一类Stencil应用在众核NUMA架构的性能研究

高凌云 勾文进 刘夏真 袁武 张鉴 陆忠华

数据与计算发展前沿2023,Vol.5Issue(6):58-66,9.
数据与计算发展前沿2023,Vol.5Issue(6):58-66,9.DOI:10.11871/jfdc.issn.2096-742X.2023.06.006

一类Stencil应用在众核NUMA架构的性能研究

Performance Research of a Class of Stencil Applied in Many-Core NUMA Architecture

高凌云 1勾文进 2刘夏真 3袁武 1张鉴 1陆忠华1

作者信息

  • 1. 中国科学院计算机网络信息中心,北京 100083||中国科学院大学,北京 100049
  • 2. 华为技术有限公司,浙江杭州 310053
  • 3. 中国科学院计算机网络信息中心,北京 100083
  • 折叠

摘要

Abstract

[Application Background]Stencil is a typical algorithm for scientific computing such as CFD(Computational Fluid Dynamics),and its memory access performance has attracted attention.The NUMA architecture is widely used in the ARM architecture represented by the Kunpeng 920 processor due to its good scalability.[Methods]Performance analysis tools and benchmark programs are used to test the performance of the Kunpeng platform's memory access and com-munication subsystems.The hot spot analysis and performance test are carried out for the typi-cal stencil application software CCFD V3.0,and the Roofline model is established.[Results]The Kunpeng 920 processor relies on its many-core NUMA architecture.Its single-node float-ing-point performance,peak memory bandwidth,and communication latency are better than that of the Intel Xeon E5-2680v2 and another domestic processor.On a single node,the execution speed of CCFD V3.0 on the Kunpeng platform is about 2~3 times of that of the Intel platform and 1.5~2 times of that of the domestic processor.[Conclusions]The Kunpeng platform based on the ARM architecture is easy in program porting,and its NUMA architecture has advantages for memory-intensive applications such as stencil.

关键词

Stencil/鲲鹏920/性能评估/CFD

Key words

Stencil/Kunpeng 920/performance evaluation/CFD

引用本文复制引用

高凌云,勾文进,刘夏真,袁武,张鉴,陆忠华..一类Stencil应用在众核NUMA架构的性能研究[J].数据与计算发展前沿,2023,5(6):58-66,9.

基金项目

国家重点研发计划"面向复杂装备的CAE云服务平台研发"项目(2020YFB1709500) (2020YFB1709500)

数据与计算发展前沿

OACSCDCSTPCD

2096-742X

访问量5
|
下载量0
段落导航相关论文