| 注册
首页|期刊导航|空气动力学学报|间断Galerkin有限元隐式算法GPU并行化研究

间断Galerkin有限元隐式算法GPU并行化研究

高缓钦 陈红全 贾雪松 徐圣冠

空气动力学学报2024,Vol.42Issue(2):21-33,13.
空气动力学学报2024,Vol.42Issue(2):21-33,13.DOI:10.7638/kqdlxxb-2023.0033

间断Galerkin有限元隐式算法GPU并行化研究

GPU-parallelized implicit discontinuous Galerkin finite element algorithm

高缓钦 1陈红全 1贾雪松 1徐圣冠2

作者信息

  • 1. 南京航空航天大学 航空学院,非定常空气动力学与流动控制工信部重点实验室,南京 210016
  • 2. 南京航空航天大学 航空学院,非定常空气动力学与流动控制工信部重点实验室,南京 210016||南京工业大学 机械与动力工程学院,南京 211816
  • 折叠

摘要

Abstract

In order to improve the parallel efficiency of the discontinuous Galerkin(DG)finite element method,a graphics processing unit(GPU)parallelized implicit DG algorithm is developed for solving Euler equations with additional artificial viscosity terms.The classic Roe scheme is adopted to treat the numerical flux involved in the spatial discretization,and the implicit lower-upper symmetric Gauss-Seidel(LU-SGS)scheme is selected for time marching.In order to resolve the inherent data dependency of the traditional LU-SGS algorithm,which causes thread-racing conditions destabilizing numerical computation,a coloring method is presented for arbitrary meshes and applied to organize the computational elements into different color groups by painting neighboring elements with different colors.Algebraic operations of the elements in the same color group are independent in the algorithm and thus can be easily parallelized.Based on the presented coloring technique,the traditional LU-SGS algorithm is modified to be parallelized accordingly by performing calculations in a color-by-color manner.By taking advantage of the local compactness of the DG finite element method,a GPU-parallelized implicit DG algorithm based on the modified LU-SGS algorithm is then implemented under the compute unified device architecture(CUDA)programming model.The time marching procedure,which is the most time-consuming part of the algorithm,is assigned to be computed on GPU.The computational task is split into a set of small tasks,and element-based kernels are designed for these tasks with corresponding thread hierarchies and data structures.The resultant algorithm is verified by a set of typical two-and three-dimensional flow test cases and performance analysis,which shows that implicit GPU speedups can be achieved,and the obtained solutions agree well with experimental data or other computed results reported in the literature.

关键词

间断伽辽金方法/LU-SGS隐式格式/GPU并行化/单元着色分组/Euler方程

Key words

discontinuous Galerkin/implicit LU-SGS scheme/GPU-parallelized/coloring method/Euler equations

分类

航空航天

引用本文复制引用

高缓钦,陈红全,贾雪松,徐圣冠..间断Galerkin有限元隐式算法GPU并行化研究[J].空气动力学学报,2024,42(2):21-33,13.

基金项目

国家自然科学基金(11972189,12102185) (11972189,12102185)

空气动力学学报

OA北大核心CSTPCD

0258-1825

访问量0
|
下载量0
段落导航相关论文