Data-Driven Equation Discovery of a Cloud Cover Parameterization

被引:7
|
作者
Grundner, Arthur [1 ,2 ]
Beucler, Tom [3 ]
Gentine, Pierre [2 ]
Eyring, Veronika [1 ,4 ]
机构
[1] Deutsch Zentrum Luft & Raumfahrt eV DLR, Inst Phys Atmosphare, Oberpfaffenhofen, Germany
[2] Columbia Univ, Ctr Learning Earth Artificial Intelligence & Phys, New York, NY 10027 USA
[3] Univ Lausanne, Inst Earth Surface Dynam, Lausanne, Switzerland
[4] Univ Bremen, Inst Environm Phys IUP, Bremen, Germany
基金
欧洲研究理事会;
关键词
symbolic regression; cloud fraction; cloud cover; parameterization; Pareto frontier; RELATIVE-HUMIDITY; ICON-A; STRATOCUMULUS; SIMULATIONS; CLIMATE; MODEL;
D O I
10.1029/2023MS003763
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
A promising method for improving the representation of clouds in climate models, and hence climate projections, is to develop machine learning-based parameterizations using output from global storm-resolving models. While neural networks (NNs) can achieve state-of-the-art performance within their training distribution, they can make unreliable predictions outside of it. Additionally, they often require post-hoc tools for interpretation. To avoid these limitations, we combine symbolic regression, sequential feature selection, and physical constraints in a hierarchical modeling framework. This framework allows us to discover new equations diagnosing cloud cover from coarse-grained variables of global storm-resolving model simulations. These analytical equations are interpretable by construction and easily transferable to other grids or climate models. Our best equation balances performance and complexity, achieving a performance comparable to that of NNs (R-2 = 0.94) while remaining simple (with only 11 trainable parameters). It reproduces cloud cover distributions more accurately than the Xu-Randall scheme across all cloud regimes (Hellinger distances < 0.09), and matches NNs in condensate-rich regimes. When applied and fine-tuned to the ERA5 reanalysis, the equation exhibits superior transferability to new data compared to all other optimal cloud cover schemes. Our findings demonstrate the effectiveness of symbolic regression in discovering interpretable, physically-consistent, and nonlinear equations to parameterize cloud cover.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] DATA-DRIVEN APPROACHES TO EMPIRICAL DISCOVERY
    LANGLEY, P
    ZYTKOW, JM
    ARTIFICIAL INTELLIGENCE, 1989, 40 (1-3) : 283 - 312
  • [22] Data-driven abductive discovery in mineralogy
    Hazen, Robert M.
    AMERICAN MINERALOGIST, 2014, 99 (11-12) : 2165 - 2170
  • [23] Data-driven discovery of intrinsic dynamics
    Floryan, Daniel
    Graham, Michael D. D.
    NATURE MACHINE INTELLIGENCE, 2022, 4 (12) : 1113 - 1120
  • [24] Data-driven drug discovery by AI
    Yamanishi, Yoshihiro
    CANCER SCIENCE, 2022, 113 : 1376 - 1376
  • [25] Data-driven discovery of intrinsic dynamics
    Daniel Floryan
    Michael D. Graham
    Nature Machine Intelligence, 2022, 4 : 1113 - 1120
  • [26] Data-driven discovery of causal interactions
    Saisai Ma
    Lin Liu
    Jiuyong Li
    Thuc Duy Le
    International Journal of Data Science and Analytics, 2019, 8 : 285 - 297
  • [27] Data-driven Discovery of Modified Kortewegde Vries Equation, Kdv–Burger Equation and Huxley Equation by Deep Learning
    Yuexing Bai
    Temuer Chaolu
    Sudao Bilige
    Neural Processing Letters, 2022, 54 : 1549 - 1563
  • [28] Data-Driven Discovery of Soil Moisture Flow Governing Equation: A Sparse Regression Framework
    Song, Wenxiang
    Shi, Liangsheng
    Wang, Lijun
    Wang, Yanling
    Hu, Xiaolong
    WATER RESOURCES RESEARCH, 2022, 58 (08)
  • [29] Discovery of the data-driven differential equation-based models of continuous metocean process
    Maslyaev, Mikhail
    Hvatov, Alexander
    8TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE ON COMPUTATIONAL SCIENCE, YSC2019, 2019, 156 : 367 - 376
  • [30] Data-driven Discovery of Modified Kortewegde Vries Equation, Kdv-Burger Equation and Huxley Equation by Deep Learning
    Bai, Yuexing
    Chaolu, Temuer
    Bilige, Sudao
    NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1549 - 1563