Belief Tree Search for Active Object Recognition

被引：0

作者：

Malmir, Mohsen ^{[1
]}

Cottrell, Garrison W. ^{[1
]}

机构：

[1] Univ Calif San Diego, Comp Sci & Engn Dept, 9500 Gilman Dr, San Diego, CA 92093 USA

来源：

2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2017年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Active Object Recognition (AOR) has been approached as an unsupervised learning problem, in which optimal trajectories for object inspection are not known and to be discovered by reducing label uncertainty or training with reinforcement learning. Such approaches suffer from local optima and have no guarantees of the quality of their solution. In this paper, we treat AOR as a Partially Observable Markov Decision Process (POMDP) and find near-optimal values and corresponding action-values of training data using Belief Tree Search (BTS) on the AOR belief Markov Decision Process (MDP). AOR then reduces to the problem of knowledge transfer from these action-values to the test set. We train a Long Short Term Memory (LSTM) network on these values to predict the best next action on the training set rollouts and experimentally show that our method generalizes well to explore novel objects and novel views of familiar objects with high accuracy. We compare this supervised scheme against guided policy search, and show that the LSTM network reaches higher recognition accuracy compared to the guided policy search and guided Neurally Fitted Q-iteration. We further look into optimizing the observation function to increase the total collected reward during active recognition. In AOR, the observation function is known only approximately. We derive a gradient-based update for the observation function to increase the total expected reward. We show that by optimizing the observation function and retraining the supervised LSTM network, the AOR performance on the test set improves significantly.

引用

页码：4276 / 4283

页数：8

共 50 条

[1] Object recognition by belief propagation
Lu, Tongwei
Sang, Nong
Liu, Jizhong
Gao, Xiaoying
OPTICAL ENGINEERING, 2008, 47 (07)
[2] Active End-Effector Pose Selection for Tactile Object Recognition through Monte Carlo Tree Search
Zhang, Mabel M.
Atanasov, Nikolay
Daniilidis, Kostas
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3258 - 3265
[3] Active Object Search
Wu, Jie
Chen, Tianshui
Huang, Lishan
Wu, Hefeng
Li, Guanbin
Tian, Ling
Lin, Liang
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 973 - 981
[4] Unified Optimization for Multiple Active Object Recognition Tasks with Feature Decision Tree
Sun, Haibo
Zhu, Feng
Hao, Yingming
Fu, Shuangfei
Kong, Yanzi
Xu, Chenglong
Wang, Jianyu
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 103 (02)
[5] Unified Optimization for Multiple Active Object Recognition Tasks with Feature Decision Tree
Haibo Sun
Feng Zhu
Yingming Hao
Shuangfei Fu
Yanzi Kong
Chenglong Xu
Jianyu Wang
Journal of Intelligent & Robotic Systems, 2021, 103
[6] Selective Search for Object Recognition
Uijlings, J. R. R.
van de Sande, K. E. A.
Gevers, T.
Smeulders, A. W. M.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 104 (02) : 154 - 171
[7] Selective Search for Object Recognition
J. R. R. Uijlings
K. E. A. van de Sande
T. Gevers
A. W. M. Smeulders
International Journal of Computer Vision, 2013, 104 : 154 - 171
[8] Object Recognition Base on Deep Belief Network
Zhang, Yajun
Liu, Zongtian
Zhou, Wen
Zhang, Yalan
2015 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE), 2015, : 268 - 273
[9] OBJECT RECOGNITION USING LOCAL GEOMETRIC CONSTRAINTS - A ROBUST ALTERNATIVE TO TREE-SEARCH
BRAY, AJ
LECTURE NOTES IN COMPUTER SCIENCE, 1990, 427 : 499 - 515
[10] Transinformation for active object recognition
Schiele, B
Crowley, JL
SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, 1998, : 249 - 254

← 1 2 3 4 5 →