Generative Model-Based Testing on Decision-Making Policies

被引:4
|
作者
Li, Zhuo [1 ]
Wu, Xiongfei [1 ]
Zhu, Derui [2 ]
Cheng, Mingfei [3 ]
Chen, Siyuan [1 ]
Zhang, Fuyuan [1 ]
Xie, Xiaofei [3 ]
Ma, Lei [4 ,5 ]
Zhao, Jianjun [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Tech Univ Munich, Munich, Germany
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Tokyo, Tokyo, Japan
[5] Univ Alberta, Edmonton, AB, Canada
基金
加拿大自然科学与工程研究理事会; 新加坡国家研究基金会;
关键词
generative model; testing; decision-making policies; COMPREHENSIVE SURVEY; REINFORCEMENT; SYSTEMS; GO;
D O I
10.1109/ASE56229.2023.00153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliability of decision-making policies is urgently important today as they have established the fundamentals of many critical applications, such as autonomous driving and robotics. To ensure reliability, there have been a number of research efforts on testing decision-making policies that solve Markov decision processes (MDPs). However, due to the deep neural network (DNN)-based inherit and infinite state space, developing scalable and effective testing frameworks for decision-making policies still remains open and challenging. In this paper, we present an effective testing framework for decision-making policies. The framework adopts a generative diffusion model-based test case generator that can easily adapt to different search spaces, ensuring the practicality and validity of test cases. Then, we propose a termination state novelty-based guidance to diversify agent behaviors and improve the test effectiveness. Finally, we evaluate the framework on five widely used benchmarks, including autonomous driving, aircraft collision avoidance, and gaming scenarios. The results demonstrate that our approach identifies more diverse and influential failure-triggering test cases compared to current state-of-the-art techniques. Moreover, we employ the detected failure cases to repair the evaluated models, achieving better robustness enhancement compared to the baseline method.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 50 条
  • [31] A model-based sustainable productivity concept for the best decision-making in rough milling operations
    Pelayo, G. Urbikain
    Olvera-Trejo, D.
    Luo, M.
    Tang, K.
    de Lacalle, L. N. Lopez
    Elias-Zuniga, A.
    MEASUREMENT, 2021, 186 (186)
  • [32] The decision-making process in public policies
    Gonzalez Campo, Carlos Hernan
    Gomez Cardenas, Carlos Wladimir
    PROSPECTIVA, 2007, (12): : 75 - 104
  • [33] Generative Artificial Intelligence and Legal Decision-making
    Cardoso, Andre Guskow
    Chan, Elizabeth
    Quintao, Luisa
    Pereira, Cesar
    GLOBAL TRADE AND CUSTOMS JOURNAL, 2024, 19 (11-12): : 710 - 730
  • [34] A Multi-criteria Decision Making Framework for Real Time Model-Based Testing
    Abou Trab, Mohammad Saeed
    Alrouh, Bachar
    Counsell, Steve
    Hierons, Rob M.
    Ghinea, George
    TESTING - PRACTICE AND RESEARCH TECHNIQUES, 2010, 6303 : 194 - 197
  • [35] Investigating disorder-specific and transdiagnostic alterations in model-based and model-free decision-making
    Knolle, Franziska
    Sen, Pritha
    Culbreth, Adam
    Koch, Kathrin
    Schmitz-Koep, Benita
    Guersel, Deniz A.
    Wunderlich, Klaus
    Avram, Mihai
    Berberich, Goetz
    Sorg, Christian
    Brandl, Felix
    JOURNAL OF PSYCHIATRY & NEUROSCIENCE, 2024, 49 (06):
  • [36] DECISION-MAKING MODEL
    ARCHER, ER
    INDUSTRIAL ENGINEERING, 1975, 7 (04): : 27 - 29
  • [37] A Model of Decision-Making Based on Critical Thinking
    Ulucinar, Ufuk
    Aypay, Ahmet
    EGITIM VE BILIM-EDUCATION AND SCIENCE, 2016, 41 (185): : 251 - 268
  • [38] A Crime Decision-making Model Based on AHP
    Yan, Feixue
    Xia, Jing
    Shen, Guanqun
    Kang, Xusheng
    INTELLIGENT STRUCTURE AND VIBRATION CONTROL, PTS 1 AND 2, 2011, 50-51 : 885 - +
  • [39] Markup decision-making model based on ANN
    Yang, Lanrong
    Lu, Zhengding
    Zhang, Jinlong
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2002, 30 (05):
  • [40] A generative joint model for spike trains and saccades during perceptual decision-making
    Peter J. Cassey
    Garren Gaut
    Mark Steyvers
    Scott D. Brown
    Psychonomic Bulletin & Review, 2016, 23 : 1757 - 1778