Development of a GPU-based high-performance radiative transfer model for the Infrared Atmospheric Sounding Interferometer (IASI)

被引:48
|
作者
Huang, Bormin [1 ]
Mielikainen, Jarno [2 ]
Oh, Hyunjong [1 ]
Huang, Hung-Lung Allen [1 ]
机构
[1] Univ Wisconsin, Ctr Space Sci & Engn, Madison, WI 53706 USA
[2] Univ Eastern Finland, Dept Comp Sci, Kuopio, Finland
基金
芬兰科学院;
关键词
GPU; CUDA; Radiative transfer model; Infrared Atmospheric Sounding; Interferometer (IASI); ACCURATE TRANSMITTANCE MODEL; COMPUTATIONALLY FAST; ABSORBING GAS; PARALLEL; GRAPHICS;
D O I
10.1016/j.jcp.2010.09.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Satellite-observed radiance is a nonlinear functional of surface properties and atmospheric temperature and absorbing gas profiles as described by the radiative transfer equation (RTE). In the era of hyperspectral sounders with thousands of high-resolution channels, the computation of the radiative transfer model becomes more time-consuming. The radiative transfer model performance in operational numerical weather prediction systems still limits the number of channels we can use in hyperspectral sounders to only a few hundreds. To take the full advantage of such high-resolution infrared observations, a computationally efficient radiative transfer model is needed to facilitate satellite data assimilation. In recent years the programmable commodity graphics processing unit (GPU) has evolved into a highly parallel, multi-threaded, many-core processor with tremendous computational speed and very high memory bandwidth. The radiative transfer model is very suitable for the GPU implementation to take advantage of the hardware's efficiency and parallelism where radiances of many channels can be calculated in parallel in GPUs. In this paper, we develop a GPU-based high-performance radiative transfer model for the Infrared Atmospheric Sounding Interferometer (IASI) launched in 2006 onboard the first European meteorological polar-orbiting satellites, METOP-A. Each IASI spectrum has 8461 spectral channels. The IASI radiative transfer model consists of three modules. The first module for computing the regression predictors takes less than 0.004% of GPU time, while the second module for transmittance computation and the third module for radiance computation take approximately 92.5% and 7.5%, respectively. Our GPU-based IASI radiative transfer model is developed to run on a low-cost personal supercomputer with four GPUs with total 960 compute cores, delivering near 4 TFlops theoretical peak performance. By massively parallelizing the second and third modules, we reached 364x speedup for 1 GPU and 1455x speedup for all 4 GPUs, both with respect to the original GPU-based single-threaded Fortran code with the -O-2 compiling optimization. The significant 1455x speedup using a computer with four GPUs means that the proposed GPU-based high-performance forward model is able to compute one day's amount of 1,296,000 IASI spectra within nearly 10 min, whereas the original single GPU-based version will impractically take more than 10 days. This model runs over 80% of the theoretical memory bandwidth with asynchronous data transfer. A novel GPU-GPU pipeline implementation of the IASI radiative transfer model is proposed. The GPU-based high-performance IASI radiative transfer model is suitable for the assimilation of the IASI radiance observations into the operational numerical weather forecast model. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:2207 / 2221
页数:15
相关论文
共 42 条
  • [31] GPU-parallel performance of the community radiative transfer model (CRTM) with the optical depth in absorber space (ODAS)-based transmittance algorithm
    Mielikainen, Jarno
    Huang, Bormin
    Huang, Hung-Lung Allen
    Lee, Tsengdar
    HIGH-PERFORMANCE COMPUTING IN REMOTE SENSING II, 2012, 8539
  • [32] Tensor-CA: A high-performance cellular automata model for land use simulation based on vectorization and GPU
    Zhuang, Haoming
    Liu, Xiaoping
    Liang, Xun
    Yan, Yuchao
    He, Jinqiang
    Cai, Yiling
    Wu, Changjiang
    Zhang, Xinchang
    Zhang, Honghui
    TRANSACTIONS IN GIS, 2022, 26 (02) : 755 - 778
  • [33] A High-Performance Transfer Learning-Based Model for Microwave Structure Behavior Prediction
    Ma, Jiteng
    Dang, Shuping
    Watkins, Gavin
    Morris, Kevin
    Beach, Mark
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (12) : 4394 - 4398
  • [34] High-performance GPU-based rendering for real-time, rigid 2D/3D-image registration and motion prediction in radiation oncology
    Spoerk, Jakob
    Gendrin, Christelle
    Weber, Christoph
    Figl, Michael
    Pawiro, Supriyanto Ardjo
    Furtado, Hugo
    Fabri, Daniella
    Bloch, Christoph
    Bergmann, Helmar
    Groeller, Eduard
    Birkfellner, Wolfgang
    ZEITSCHRIFT FUR MEDIZINISCHE PHYSIK, 2012, 22 (01): : 13 - 20
  • [35] Development of a High-Performance, FPGA-Based Virtual Anemometer for Model-Based MPPT of Wind Generators
    La Tona, Giuseppe
    Luna, Massimiliano
    Di Piazza, Maria Carmela
    Pucci, Marcello
    Accetta, Angelo
    ELECTRONICS, 2020, 9 (01)
  • [36] Cocrystal engineering:towards high-performance near-infrared organic phototransistors based on donor-acceptor charge transfer cocrystals
    Fei Li
    Lei Zheng
    Yajing Sun
    Shuyu Li
    Lingjie Sun
    Fangxu Yang
    Weibing Dong
    Xiaotao Zhang
    Wenping Hu
    Science China(Chemistry), 2023, 66 (01) : 266 - 272
  • [37] Cocrystal engineering: towards high-performance near-infrared organic phototransistors based on donor-acceptor charge transfer cocrystals
    Li, Fei
    Zheng, Lei
    Sun, Yajing
    Li, Shuyu
    Sun, Lingjie
    Yang, Fangxu
    Dong, Weibing
    Zhang, Xiaotao
    Hu, Wenping
    SCIENCE CHINA-CHEMISTRY, 2023, 66 (01) : 266 - 272
  • [38] Cocrystal engineering: towards high-performance near-infrared organic phototransistors based on donor-acceptor charge transfer cocrystals
    Fei Li
    Lei Zheng
    Yajing Sun
    Shuyu Li
    Lingjie Sun
    Fangxu Yang
    Weibing Dong
    Xiaotao Zhang
    Wenping Hu
    Science China Chemistry, 2023, 66 : 266 - 272
  • [39] High-performance deep transfer learning model with batch normalization based on multiscale feature fusion for tomato plant disease identification and categorization
    Ramya, R.
    Kumar, P.
    ENVIRONMENTAL RESEARCH COMMUNICATIONS, 2023, 5 (12):
  • [40] Development and Calibration of a Phenomenological Material Model for Steel-Fiber-Reinforced High-Performance Concrete Based on Unit Cell Calculations
    Pise, Mangesh
    Brands, Dominik
    Schroeder, Joerg
    MATERIALS, 2024, 17 (10)