Generative Model-Based Testing on Decision-Making Policies

被引:4
|
作者
Li, Zhuo [1 ]
Wu, Xiongfei [1 ]
Zhu, Derui [2 ]
Cheng, Mingfei [3 ]
Chen, Siyuan [1 ]
Zhang, Fuyuan [1 ]
Xie, Xiaofei [3 ]
Ma, Lei [4 ,5 ]
Zhao, Jianjun [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Tech Univ Munich, Munich, Germany
[3] Singapore Management Univ, Singapore, Singapore
[4] Univ Tokyo, Tokyo, Japan
[5] Univ Alberta, Edmonton, AB, Canada
基金
加拿大自然科学与工程研究理事会; 新加坡国家研究基金会;
关键词
generative model; testing; decision-making policies; COMPREHENSIVE SURVEY; REINFORCEMENT; SYSTEMS; GO;
D O I
10.1109/ASE56229.2023.00153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The reliability of decision-making policies is urgently important today as they have established the fundamentals of many critical applications, such as autonomous driving and robotics. To ensure reliability, there have been a number of research efforts on testing decision-making policies that solve Markov decision processes (MDPs). However, due to the deep neural network (DNN)-based inherit and infinite state space, developing scalable and effective testing frameworks for decision-making policies still remains open and challenging. In this paper, we present an effective testing framework for decision-making policies. The framework adopts a generative diffusion model-based test case generator that can easily adapt to different search spaces, ensuring the practicality and validity of test cases. Then, we propose a termination state novelty-based guidance to diversify agent behaviors and improve the test effectiveness. Finally, we evaluate the framework on five widely used benchmarks, including autonomous driving, aircraft collision avoidance, and gaming scenarios. The results demonstrate that our approach identifies more diverse and influential failure-triggering test cases compared to current state-of-the-art techniques. Moreover, we employ the detected failure cases to repair the evaluated models, achieving better robustness enhancement compared to the baseline method.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 50 条
  • [42] TESTING A VOCATIONAL DECISION-MAKING MODEL IN AN EMPLOYMENT REHABILITATION SETTING
    PARKER, JL
    BULLETIN OF THE BRITISH PSYCHOLOGICAL SOCIETY, 1986, 39 : A62 - A63
  • [43] A generative joint model for spike trains and saccades during perceptual decision-making
    Cassey, Peter J.
    Gaut, Garren
    Steyvers, Mark
    Brown, Scott D.
    PSYCHONOMIC BULLETIN & REVIEW, 2016, 23 (06) : 1757 - 1778
  • [44] Testing the Expanded Sport Official's Decision-Making Model
    Kostrna, Jason
    Tenenbaum, Gershon
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2019, 41 : S73 - S74
  • [45] Human-Computer Interaction Design Testing Based on Decision-Making Process Model
    Huang, Baiqiao
    Zhang, Pengyi
    Wang, Chuan
    MAN-MACHINE-ENVIRONMENT SYSTEM ENGINEERING, MMESE 2018, 2019, 527 : 365 - 372
  • [46] Model-based decision making and model-free learning
    Drummond, Nicole
    Niv, Yael
    CURRENT BIOLOGY, 2020, 30 (15) : R860 - R865
  • [47] Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions
    Chen, Xiong-Hui
    Luo, Fan-Ming
    Yu, Yang
    Li, Qingyang
    Qin, Zhiwei
    Shang, Wenjie
    Ye, Jieping
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15260 - 15274
  • [48] Traffic Coordination at Road Intersections: Autonomous Decision-Making Algorithms Using Model-Based Heuristics
    de Campos, Gabriel Rodrigues
    Falcone, Paolo
    Hult, Robert
    Wymeersch, Henk
    Sjoberg, Jonas
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2017, 9 (01) : 8 - 21
  • [49] Model-based organizational decision making: A behavioral lens
    Luoma, Jukka
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 249 (03) : 816 - 826
  • [50] Model-based Decision Making with Imagination for Autonomous Parking
    Feng, Ziyue
    Chen, Shitao
    Chen, Yu
    Zheng, Nanning
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 2216 - 2223