Generative Model-Based Testing on Decision-Making Policies

被引:4
|
作者
Li, Zhuo [1 ]
Wu, Xiongfei [1 ]
Zhu, Derui [2 ]
Cheng, Mingfei [3 ]
Chen, Siyuan [1 ]
Zhang, Fuyuan [1 ]
Xie, Xiaofei [3 ]
Ma, Lei [4 ,5 ]
Zhao, Jianjun [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Tech Univ Munich, Munich, Germany
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Tokyo, Tokyo, Japan
[5] Univ Alberta, Edmonton, AB, Canada
基金
加拿大自然科学与工程研究理事会; 新加坡国家研究基金会;
关键词
generative model; testing; decision-making policies; COMPREHENSIVE SURVEY; REINFORCEMENT; SYSTEMS; GO;
D O I
10.1109/ASE56229.2023.00153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliability of decision-making policies is urgently important today as they have established the fundamentals of many critical applications, such as autonomous driving and robotics. To ensure reliability, there have been a number of research efforts on testing decision-making policies that solve Markov decision processes (MDPs). However, due to the deep neural network (DNN)-based inherit and infinite state space, developing scalable and effective testing frameworks for decision-making policies still remains open and challenging. In this paper, we present an effective testing framework for decision-making policies. The framework adopts a generative diffusion model-based test case generator that can easily adapt to different search spaces, ensuring the practicality and validity of test cases. Then, we propose a termination state novelty-based guidance to diversify agent behaviors and improve the test effectiveness. Finally, we evaluate the framework on five widely used benchmarks, including autonomous driving, aircraft collision avoidance, and gaming scenarios. The results demonstrate that our approach identifies more diverse and influential failure-triggering test cases compared to current state-of-the-art techniques. Moreover, we employ the detected failure cases to repair the evaluated models, achieving better robustness enhancement compared to the baseline method.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 50 条
  • [21] Online model-based reinforcement learning for decision-making in long distance routes
    Alcaraz, Juan J.
    Losilla, Fernando
    Caballero-Arnaldos, Luis
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 164
  • [22] Integrating dose estimation into a decision-making framework for model-based drug development
    Dunyak, James
    Mitchell, Patrick
    Hamren, Bengt
    Helmlinger, Gabriel
    Matcham, James
    Stanski, Donald
    Al-Huniti, Nidal
    PHARMACEUTICAL STATISTICS, 2018, 17 (02) : 155 - 168
  • [23] Testing the Expanded Sport Officials Decision-Making Model
    Ritchie, Jason
    Tenenbaum, Gershon
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2017, 39 : S308 - S308
  • [24] Role of a generative AI model in enhancing clinical decision-making in nursing
    Daungsupawong, Hinpetch
    Wiwanitkit, Viroj
    JOURNAL OF ADVANCED NURSING, 2024, 80 (11) : 4750 - 4751
  • [25] Competency framework on simulation model-based decision-making for Master of Public Health students
    Hrzic, R.
    Cade, M. V.
    Wong, B. L. H.
    McCreesh, N.
    Simon, J.
    Czabanowska, K.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2023, 33
  • [26] THE DECISION-MAKING GRID - A MODEL OF DECISION-MAKING STYLES
    HALL, J
    OLEARY, V
    WILLIAMS, M
    CALIFORNIA MANAGEMENT REVIEW, 1964, 7 (02) : 43 - 54
  • [27] The Decision-Making Model is Determined by the Decision-Making Cost
    Yong, Tan
    SOCIAL SCIENCE AND EDUCATION, 2013, 9 : 195 - 198
  • [28] A Mathematical Model-Based Integrated Decision-Making Approach for Lithium Battery Manufacturers Evaluation
    Wang, Chia-Nan
    Imperial, Kristofer Neal Castro
    Nhieu, Nhat-Luong
    Trieu, Nguyen Dang Minh
    IEEE ACCESS, 2024, 12 : 40037 - 40048
  • [29] Elevated State Anxiety Disturbs Model-Based Decision-Making Under Monetary Loss
    Hur, Jihyun
    Ahn, Woo-Young
    BIOLOGICAL PSYCHIATRY, 2021, 89 (09) : S312 - S312
  • [30] Enhancing portfolio decision-making: a capital asset pricing model-based clustering analysis
    Pooja, R.
    Kayal, Parthajit
    Maiti, Moinak
    JOURNAL OF ECONOMIC STUDIES, 2024, 51 (09) : 358 - 379