ENHANCING MULTI-STEP ACTION PREDICTION FOR ACTIVE OBJECT DETECTION

被引:3
|
作者
Fang, Fen [1 ]
Xu, Qianli [1 ]
Gauthier, Nicolas [1 ]
Li, Liyuan [1 ]
Lim, Joo-Hwee [1 ,2 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
关键词
active object detection; reinforcement learning; view planning; deep q-learning network (DQN);
D O I
10.1109/ICIP42928.2021.9506078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active vision for robots is one promising solution to open world visual detection problems. A fundamental issue is view planning, i.e., predicting next best views to capture images of interest to reduce uncertainty. While multi-step action in a reinforcement learning (RL) setup can boost the efficiency of view planning, existing methods suffer from unstable detection outcome when the Q-values of multiple branches of action advantages (i.e., action range and action type) are combined naively. To tackle this issue, we propose a novel mechanism to disentangle action range from action type through a two-stage training strategy on a deep Q-network. It combines well-crafted loss functions with respect to action range and action type to enforce separated training of these two branches. We evaluate our method on two public datasets and show that it facilitates substantial gain in view planning efficiency, while enhancing detection accuracy.
引用
收藏
页码:2189 / 2193
页数:5
相关论文
共 50 条
  • [1] Multi-step prediction method for robust object tracking
    Firouznia, Marjan
    Faez, Karim
    Amindavar, Hamidreza
    Koupaei, Javad Alikhani
    Pantano, Pietro
    Bilotta, Eleonora
    DIGITAL SIGNAL PROCESSING, 2017, 70 : 94 - 104
  • [2] An active object detection model with multi-step prediction based on deep q-learning network and innovative training algorithm
    Wang, Jianyu
    Zhu, Feng
    Wang, Qun
    Cui, Yunge
    Sun, Haibo
    Zhao, Pengfei
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [3] Enhancing a Multi-Step Discharge Prediction with Deep Learning and a Response Time Parameter
    Thaisiam, Wandee
    Saelo, Warintra
    Wongchaisuwat, Papis
    WATER, 2022, 14 (18)
  • [4] Multi-step LSTM Prediction Model for Visibility Prediction
    Meng, Yunlong
    Qi, Fengliang
    Zuo, Heng
    Chen, Bo
    Yuan, Xian
    Xiao, Yao
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] Multi-step Prediction Algorithm for State Prediction Model
    Zhang, Zili
    Song, Hongwei
    SMART MATERIALS AND INTELLIGENT SYSTEMS, PTS 1 AND 2, 2011, 143-144 : 634 - 638
  • [6] Hybrid Multi-step Disfluency Detection
    Germesin, Sebastian
    Becker, Tilman
    Poller, Peter
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, PROCEEDINGS, 2008, 5237 : 185 - 195
  • [7] Multi-Step Ahead Prediction for Anomaly Detection of Geomagnetic Observation in HVDC Transmission
    Cai, Yin
    An, Zhaoliang
    Si, Guannan
    Chen, Jun
    Meng, Miaomiao
    Li, Shiying
    IEEE ACCESS, 2023, 11 : 145566 - 145578
  • [8] Enhancing multi-step quantum state tomography by PhaseLift
    Lu, Yiping
    Zhao, Qing
    ANNALS OF PHYSICS, 2017, 384 : 198 - 210
  • [9] Multi-step Ahead Visual Trajectory Prediction for Object Tracking using Echo State Networks
    Manibardo, Eric L.
    Lana, Ibai
    Del Ser, Javier
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 4782 - 4789
  • [10] A systematic survey on multi-step attack detection
    Navarro, Julio
    Deruyver, Aline
    Parrend, Pierre
    COMPUTERS & SECURITY, 2018, 76 : 214 - 249