计算机工程与科学2012,Vol.34Issue(8):147-153,7.DOI:10.3969/j.issn.1007-130X.2012.08.023
海洋环流数值模式POP的GPU并行优化
GPU Parallel Optimization of the Oceanic General Circulation Model POP
摘要
Abstract
POP is a global ocean circumfluence model, which is used widely to the ocean research and climate prediction. With the resolution increasing, the requirement of computing ability is geometrically increasing, which limits the development of the POP ocean model. Based on the analysis of the equations and numerical characteristics of the POP ocean model,this paper plants the POP ocean model to the GPU platform with the CUDA Fortran programming model,and adopts hybrid parallelism to implement coalesced access to the global memory, reduces the use of local memory,improves the data reusability with registers,and enlarges the code executing on the GPU to minimize the communication between CPU and GPU. Research and experiment show that the CUPOP running on one NVIDIA Tesla C2O7O card can a-chieve up to 8. 47 times and 1. 5 times respectively, compared with the serial program and six-MPI processes program running on the Intel Xeon X5675 CPU.关键词
CUDA GPU/parallel模式/GPUPOP/CUDA FortranKey words
CUDA GPU s parallel ocean program models GPUPOP/ CUDA Fortran分类
信息技术与安全科学引用本文复制引用
郭松,窦勇,雷元武..海洋环流数值模式POP的GPU并行优化[J].计算机工程与科学,2012,34(8):147-153,7.基金项目
自然科学基金杰青基金资助项目(61125201) (61125201)
教育部博士点基金资助项目(60911062) (60911062)