Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design

被引:5
|
作者
Regenwetter, Lyle [1 ]
Srivastava, Akash [2 ]
Gutfreund, Dan [2 ]
Ahmed, Faez [1 ]
机构
[1] MIT, Dept Mech Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT IBM Watson AI Lab, 314 Main St, Cambridge, MA 02142 USA
关键词
Generative Models; Artificial Intelligence; Evaluation Metrics; Design Automation; Machine Learning; DIVERSITY; OPTIMIZATION;
D O I
10.1016/j.cad.2023.103609
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep generative models such as Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), Diffusion Models, and Transformers, have shown great promise in a variety of applications, including image and speech synthesis, natural language processing, and drug discovery. However, when applied to engineering design problems, evaluating the performance of these models can be challenging, as traditional statistical metrics based on likelihood may not fully capture the requirements of engineering applications. This paper doubles as a review and practical guide to evaluation metrics for deep generative models (DGMs) in engineering design. We first summarize the well-accepted 'classic' evaluation metrics for deep generative models grounded in machine learning theory. Using case studies, we then highlight why these metrics seldom translate well to design problems but see frequent use due to the lack of established alternatives. Next, we curate a set of design-specific metrics which have been proposed across different research communities and can be used for evaluating deep generative models. These metrics focus on unique requirements in design and engineering, such as constraint satisfaction, functional performance, novelty, and conditioning. Throughout our discussion, we apply the metrics to models trained on simple-to-visualize 2-dimensional example problems. Finally, we evaluate four deep generative models on a bicycle frame design problem and structural topology generation problem. In particular, we showcase the use of proposed metrics to quantify performance target achievement, design novelty, and geometric constraints. We publicly release the code for the datasets, models, and metrics used throughout the paper at https://decode. mit.edu/projects/metrics/.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] De novo design with deep generative models based on 3D similarity scoring (vol 44, 116308, 2021)
    Papadopoulos, Kostas
    Giblin, Kathryn A.
    Janet, Jon Paul
    Patronov, Atanas
    Engkvist, Ola
    BIOORGANIC & MEDICINAL CHEMISTRY, 2021, 46
  • [22] Design Guidelines for Prompt Engineering Text-to-Image Generative Models
    Liu, Vivian
    Chilton, Lydia B.
    PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
  • [23] Inverse design with deep generative models: next step in materials discovery
    Lu, Shuaihua
    Zhou, Qionghua
    Chen, Xinyu
    Song, Zhilong
    Wang, Jinlan
    NATIONAL SCIENCE REVIEW, 2022, 9 (08)
  • [24] Generating Stochastic Structural Planes Using Statistical Models and Generative Deep Learning Models: A Comparative Investigation
    Meng, Han
    Xu, Nengxiong
    Zhu, Yunfu
    Mei, Gang
    MATHEMATICS, 2024, 12 (16)
  • [25] Material transformers: deep learning language models for generative materials design
    Fu, Nihang
    Wei, Lai
    Song, Yuqi
    Li, Qinyang
    Xin, Rui
    Omee, Sadman Sadeed
    Dong, Rongzhi
    Siriwardane, Edirisuriya M. Dilanga
    Hu, Jianjun
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (01):
  • [26] Inverse design of nanoporous crystalline reticular materials with deep generative models
    Yao, Zhenpeng
    Sanchez-Lengeling, Benjamin
    Bobbitt, N. Scott
    Bucior, Benjamin J.
    Kumar, Sai Govind Hari
    Collins, Sean P.
    Burns, Thomas
    Woo, Tom K.
    Farha, Omar K.
    Snurr, Randall Q.
    Aspuru-Guzik, Alan
    NATURE MACHINE INTELLIGENCE, 2021, 3 (01) : 76 - 86
  • [27] Molecular design in drug discovery: a comprehensive review of deep generative models
    Cheng, Yu
    Gong, Yongshun
    Liu, Yuansheng
    Song, Bosheng
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [28] Inverse design with deep generative models: next step in materials discovery
    Shuaihua Lu
    Qionghua Zhou
    Xinyu Chen
    Zhilong Song
    Jinlan Wang
    NationalScienceReview, 2022, 9 (08) : 15 - 17
  • [29] DeepCAD: A Deep Generative Network for Computer-Aided Design Models
    Wu, Rundi
    Xiao, Chang
    Zheng, Changxi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6752 - 6762
  • [30] Generative Design of Inorganic Compounds Using Deep Diffusion Language Models
    Dong, Rongzhi
    Fu, Nihang
    Siriwardane, Edirisuriya M. D.
    Hu, Jianjun
    JOURNAL OF PHYSICAL CHEMISTRY A, 2024, 128 (29): : 5980 - 5989