Special Issue Paper
Graphics processing unit optimizations for the dynamics of the HIRLAM weather forecast model
Article first published online: 22 NOV 2012
Copyright © 2012 John Wiley & Sons, Ltd.
Concurrency and Computation: Practice and Experience
Special Issue: High performance computing and simulation: architectures, systems, algorithms, technologies, services, and applications
Volume 25, Issue 10, pages 1376–1393, July 2013
How to Cite
Vu, V.T., Cats, G. and Wolters, L. (2013), Graphics processing unit optimizations for the dynamics of the HIRLAM weather forecast model. Concurrency Computat.: Pract. Exper., 25: 1376–1393. doi: 10.1002/cpe.2951
- Issue published online: 19 JUN 2013
- Article first published online: 22 NOV 2012
- Manuscript Accepted: 2 OCT 2012
- Manuscript Revised: 17 SEP 2012
- Manuscript Received: 12 JAN 2011
- Numerical Weather Prediction model;
Programmable graphics processing units (GPUs) nowadays offer tremendous computational resources for diverse applications. In this paper, we present the implementation of the dynamics routine of the HIRLAM weather forecast model on the NVIDIA GTX 480. The original Fortran code has been converted manually to C and CUDA. Empirically, it is determined what the optimal number of grid points per thread is, and what the best thread and block structures are. A significant amount of the elapsed time consists of transferring data between CPU and GPU. To reduce the impact of these transfer costs, we overlap calculation and transfer of data using multiple CUDA streams. We developed an algorithm that enables our code generator CTADEL to generate automatically the optimal CUDA streams program. Experiments are performed to find out if the applicability of GPUs is useful for Numerical Weather Prediction, in particular for the dynamics part. Copyright © 2012 John Wiley & Sons, Ltd.