Covering Number for Efficient Heuristic-Based POMDP Planning

被引:0
|
作者
Zhang, Zongzhang [1 ]
Hsu, David [1 ]
Lee, Wee Sun [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 117417, Singapore
基金
新加坡国家研究基金会;
关键词
ROBOTIC TASKS; APPROXIMATIONS; UNCERTAINTY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The difficulty of POMDP planning depends on the size of the search space involved. Heuristics are often used to reduce the search space size and improve computational efficiency; however, there are few theoretical bounds on their effectiveness. In this paper, we use the covering number to characterize the size of the search space reachable under heuristics and connect the complexity of POMDP planning to the effectiveness of heuristics. With insights from the theoretical analysis, we have developed a practical POMDP algorithm, Packing-Guided Value Iteration (PGVI). Empirically, PGVI is competitive with the state-of-the-art point-based POMDP algorithms on 65 small benchmark problems and outperforms them on 4 larger problems.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Design of an Interactive Scheduling Heuristic-Based Application
    Duay, Edmond
    Gondraneos, Gene Mark
    Indino-Pineda, Karisha Ann
    Seva, Rosemary
    INDUSTRIAL ENGINEERING AND APPLICATIONS-EUROPE, ICIEA-EU 2024, 2024, 507 : 95 - 106
  • [32] Heuristic-based Blockchain Assignment: An Empirical Study
    Chen, Jianyu
    Gai, Keke
    Jiang, Peng
    Zhu, Liehuang
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 916 - 923
  • [33] An Efficient Heuristic-based Role Mapping Framework for Secure and Fair Collaboration in SaaS Cloud
    Ghosh, Nirnay
    Chatterjee, Debangshu
    Ghosh, Soumya K.
    2014 INTERNATIONAL CONFERENCE ON CLOUD AND AUTONOMIC COMPUTING (ICCAC 2014), 2014, : 227 - 236
  • [34] Heuristic-based backtracking relaxation for propositional satisfiability
    Bhalla, Ateet
    Lynce, Ines
    De Sousa, Jose T.
    Marques-Silva, Joao
    JOURNAL OF AUTOMATED REASONING, 2005, 35 (1-3) : 3 - 24
  • [35] Heuristic-Based Recommendation for Metamodel - OCL Coevolution
    Batot, Edouard
    Kessentini, Wael
    Sahraoui, Houari
    Famelis, Michalis
    2017 ACM/IEEE 20TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS 2017), 2017, : 210 - 220
  • [36] Policy Generator (PG): A Heuristic-Based Fuzzer
    Felix, Alejandro
    Tappenden, Andrew F.
    Miller, James
    PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 5535 - 5544
  • [37] Heuristic-based backtracking relaxation for propositional satisfiability
    Bhalla, Ateet
    Lynce, Inês
    De Sousa, José T.
    Marques-Silva, João
    Journal of Automated Reasoning, 2005, 35 (1-3): : 3 - 24
  • [38] Heuristic-Based Backtracking Relaxation for Propositional Satisfiability
    Ateet Bhalla
    Inês Lynce
    José T. de Sousa
    João Marques-Silva
    Journal of Automated Reasoning, 2005, 35 : 3 - 24
  • [39] Seismic active control by a heuristic-based algorithm
    Tang, Y
    ENGINEERING MECHANICS: PROCEEDINGS OF THE 11TH CONFERENCE, VOLS 1 AND 2, 1996, : 232 - 235
  • [40] Heuristic-based Automatic Online Proctoring System
    Raj, Vishnu R. S.
    Narayanan, Athi S.
    Bijlani, Kamla
    15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2015), 2015, : 458 - 459