Belief Tree Search for Active Object Recognition

被引:0
|
作者
Malmir, Mohsen [1 ]
Cottrell, Garrison W. [1 ]
机构
[1] Univ Calif San Diego, Comp Sci & Engn Dept, 9500 Gilman Dr, San Diego, CA 92093 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active Object Recognition (AOR) has been approached as an unsupervised learning problem, in which optimal trajectories for object inspection are not known and to be discovered by reducing label uncertainty or training with reinforcement learning. Such approaches suffer from local optima and have no guarantees of the quality of their solution. In this paper, we treat AOR as a Partially Observable Markov Decision Process (POMDP) and find near-optimal values and corresponding action-values of training data using Belief Tree Search (BTS) on the AOR belief Markov Decision Process (MDP). AOR then reduces to the problem of knowledge transfer from these action-values to the test set. We train a Long Short Term Memory (LSTM) network on these values to predict the best next action on the training set rollouts and experimentally show that our method generalizes well to explore novel objects and novel views of familiar objects with high accuracy. We compare this supervised scheme against guided policy search, and show that the LSTM network reaches higher recognition accuracy compared to the guided policy search and guided Neurally Fitted Q-iteration. We further look into optimizing the observation function to increase the total collected reward during active recognition. In AOR, the observation function is known only approximately. We derive a gradient-based update for the observation function to increase the total expected reward. We show that by optimizing the observation function and retraining the supervised LSTM network, the AOR performance on the test set improves significantly.
引用
收藏
页码:4276 / 4283
页数:8
相关论文
共 50 条
  • [1] Object recognition by belief propagation
    Lu, Tongwei
    Sang, Nong
    Liu, Jizhong
    Gao, Xiaoying
    OPTICAL ENGINEERING, 2008, 47 (07)
  • [2] Active End-Effector Pose Selection for Tactile Object Recognition through Monte Carlo Tree Search
    Zhang, Mabel M.
    Atanasov, Nikolay
    Daniilidis, Kostas
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3258 - 3265
  • [3] Active Object Search
    Wu, Jie
    Chen, Tianshui
    Huang, Lishan
    Wu, Hefeng
    Li, Guanbin
    Tian, Ling
    Lin, Liang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 973 - 981
  • [4] Unified Optimization for Multiple Active Object Recognition Tasks with Feature Decision Tree
    Sun, Haibo
    Zhu, Feng
    Hao, Yingming
    Fu, Shuangfei
    Kong, Yanzi
    Xu, Chenglong
    Wang, Jianyu
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 103 (02)
  • [5] Unified Optimization for Multiple Active Object Recognition Tasks with Feature Decision Tree
    Haibo Sun
    Feng Zhu
    Yingming Hao
    Shuangfei Fu
    Yanzi Kong
    Chenglong Xu
    Jianyu Wang
    Journal of Intelligent & Robotic Systems, 2021, 103
  • [6] Selective Search for Object Recognition
    Uijlings, J. R. R.
    van de Sande, K. E. A.
    Gevers, T.
    Smeulders, A. W. M.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 104 (02) : 154 - 171
  • [7] Selective Search for Object Recognition
    J. R. R. Uijlings
    K. E. A. van de Sande
    T. Gevers
    A. W. M. Smeulders
    International Journal of Computer Vision, 2013, 104 : 154 - 171
  • [8] Object Recognition Base on Deep Belief Network
    Zhang, Yajun
    Liu, Zongtian
    Zhou, Wen
    Zhang, Yalan
    2015 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE), 2015, : 268 - 273
  • [10] Transinformation for active object recognition
    Schiele, B
    Crowley, JL
    SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, 1998, : 249 - 254