Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design

被引:5
|
作者
Regenwetter, Lyle [1 ]
Srivastava, Akash [2 ]
Gutfreund, Dan [2 ]
Ahmed, Faez [1 ]
机构
[1] MIT, Dept Mech Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT IBM Watson AI Lab, 314 Main St, Cambridge, MA 02142 USA
关键词
Generative Models; Artificial Intelligence; Evaluation Metrics; Design Automation; Machine Learning; DIVERSITY; OPTIMIZATION;
D O I
10.1016/j.cad.2023.103609
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep generative models such as Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), Diffusion Models, and Transformers, have shown great promise in a variety of applications, including image and speech synthesis, natural language processing, and drug discovery. However, when applied to engineering design problems, evaluating the performance of these models can be challenging, as traditional statistical metrics based on likelihood may not fully capture the requirements of engineering applications. This paper doubles as a review and practical guide to evaluation metrics for deep generative models (DGMs) in engineering design. We first summarize the well-accepted 'classic' evaluation metrics for deep generative models grounded in machine learning theory. Using case studies, we then highlight why these metrics seldom translate well to design problems but see frequent use due to the lack of established alternatives. Next, we curate a set of design-specific metrics which have been proposed across different research communities and can be used for evaluating deep generative models. These metrics focus on unique requirements in design and engineering, such as constraint satisfaction, functional performance, novelty, and conditioning. Throughout our discussion, we apply the metrics to models trained on simple-to-visualize 2-dimensional example problems. Finally, we evaluate four deep generative models on a bicycle frame design problem and structural topology generation problem. In particular, we showcase the use of proposed metrics to quantify performance target achievement, design novelty, and geometric constraints. We publicly release the code for the datasets, models, and metrics used throughout the paper at https://decode. mit.edu/projects/metrics/.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] A Design Reuse Oriented Feature Similarity Recognition Method for CAD Engineering Models
    Sun, Chang-Le
    Ning, Da-Yong
    Xiong, Wei
    Wang, Hai-Tao
    Pan, Ren-Yu
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015), 2015, : 311 - 320
  • [42] A scalable crystal representation for reverse engineering of novel inorganic materials using deep generative models
    Bajpai, Rochan
    Shukla, Atharva
    Kumar, Janish
    Tewari, Abhishek
    COMPUTATIONAL MATERIALS SCIENCE, 2023, 230
  • [43] HIERARCHICAL DEEP GENERATIVE MODELS FOR DESIGN UNDER FREE-FORM GEOMETRIC UNCERTAINTY
    Chen, Wei
    Lee, Doksoo
    Balogun, Oluwaseyi
    Chen, Wei
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 3B, 2022,
  • [44] Deep-learning generative models enable design of synthetic orthologs of a signaling protein
    Lian, Xinran
    Praljak, Niksa
    Ferguson, Andrew L.
    Ranganathan, Rama
    BIOPHYSICAL JOURNAL, 2023, 122 (03) : 311A - 311A
  • [45] The TrollLabs open hackathon dataset: Generative AI and large language models for prototyping in engineering design
    Ege, Daniel Nygard
    Ovrebo, Henrik H.
    Stubberud, Vegar
    Berg, Martin F.
    Elverum, Christer
    Steinert, Martin
    Vestad, Havard
    DATA IN BRIEF, 2024, 54
  • [46] To P or not to P, is that the question? Rethinking experimental design and data analysis to improve biological significance beyond the statistical significance
    Neppelenbroek, Karin Hermana
    Honorio, Heitor Marques
    Garlet, Gustavo Pompermaier
    JOURNAL OF APPLIED ORAL SCIENCE, 2019, 27
  • [47] Design and Synthesis of DDR1 Inhibitors with a Desired Pharmacophore Using Deep Generative Models
    Yoshimori, Atsushi
    Asawa, Yasunobu
    Kawasaki, Enzo
    Tasaka, Tomohiko
    Matsuda, Seiji
    Sekikawa, Toru
    Tanabe, Satoshi
    Neya, Masahiro
    Natsugari, Hideaki
    Kanai, Chisato
    CHEMMEDCHEM, 2021, 16 (06) : 955 - 958
  • [48] Physics-informed geometric operators to support surrogate, dimension reduction and generative models for engineering design
    Khan, Shahroz
    Masood, Zahid
    Usama, Muhammad
    Kostas, Konstantinos
    Kaklis, Panagiotis
    Chen, Wei
    ADVANCED ENGINEERING INFORMATICS, 2025, 63
  • [49] DESIGN TARGET ACHIEVEMENT INDEX: A DIFFERENTIABLE METRIC TO ENHANCE DEEP GENERATIVE MODELS IN MULTI-OBJECTIVE INVERSE DESIGN
    Regenwetter, Lyle
    Ahmed, Faez
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 3B, 2022,
  • [50] Generation Method for HVAC Systems Design Schemes in Office Buildings Based on Deep Graph Generative Models
    Wang, Hongxin
    Jin, Ruiying
    Xu, Peng
    Gu, Jiefan
    BUILDINGS, 2024, 14 (11)