Dynamic Attention-based Visual Odometry

被引:20
|
作者
Kuo, Xin-Yu [1 ]
Liu, Chien [1 ]
Lin, Kai-Chen [1 ]
Lee, Chun-Yi [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Elsa Lab, Hsinchu, Taiwan
关键词
D O I
10.1109/CVPRW50498.2020.00026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a dynamic attention-based visual odometry framework (DAVO), a learning-based VO method, for estimating the ego-motion of a monocular camera. DAVO dynamically adjusts the attention weights on different semantic categories for different motion scenarios based on optical flow maps. These weighted semantic categories can then be used to generate attention maps that highlight the relative importance of different semantic regions in input frames for pose estimation. In order to examine the proposed DAVO, we perform a number of experiments on the KITTI Visual Odometry and SLAM benchmark suite to quantitatively and qualitatively inspect the impacts of the dynamically adjusted weights on the accuracy of the evaluated trajectories. Moreover, we design a set of ablation analyses to justify each of our design choices, and validate the effectiveness as well as the advantages of DAVO. Our experiments on the KITTI dataset shows that the proposed DAVO framework does provide satisfactory performance in ego-motion estimation, and is able deliver competitive performance when compared to the contemporary VO methods.
引用
收藏
页码:160 / 169
页数:10
相关论文
共 50 条
  • [41] Relational attention-based Markov logic network for visual navigation
    Kang Zhou
    Chi Guo
    Huyin Zhang
    The Journal of Supercomputing, 2022, 78 : 9907 - 9933
  • [42] Leveraging attention-based visual clue extraction for image classification
    Cui, Yunbo
    Du, Youtian
    Wang, Xue
    Wang, Hang
    Su, Chang
    IET IMAGE PROCESSING, 2021, 15 (12) : 2937 - 2947
  • [43] Visual attention-based robot navigation using information sampling
    Winters, N
    Santos-Victor, J
    IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 1670 - 1675
  • [44] Attention-based Pyramid Aggregation Network for Visual Place Recognition
    Zhu, Yingying
    Wang, Jiong
    Xie, Lingxi
    Zheng, Liang
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 99 - 107
  • [45] Attention-Based Siamese Region Proposals Network for Visual Tracking
    Wang, Fan
    Yang, Bo
    Li, Jingting
    Hu, Xiaopeng
    Ji, Zhihang
    IEEE ACCESS, 2020, 8 (08): : 86595 - 86607
  • [46] A visual attention-based approach for automatic landmark selection and recognition
    Ouerhani, N
    Hügli, H
    Gruener, G
    Codourey, A
    ATTENTION AND PERFORMANCE IN COMPUTATIONAL VISION, 2005, 3368 : 183 - 195
  • [47] Attention-Based Keyword Localisation in Speech using Visual Grounding
    Olaleye, Kayode
    Kamper, Herman
    INTERSPEECH 2021, 2021, : 2991 - 2995
  • [48] Relational attention-based Markov logic network for visual navigation
    Zhou, Kang
    Guo, Chi
    Zhang, Huyin
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (07): : 9907 - 9933
  • [49] Global motion compensated visual attention-based video watermarking
    Oakes, Matthew
    Bhowmik, Deepayan
    Abhayaratne, Charith
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (06)
  • [50] Semantic-Based Visual Odometry Towards Dynamic Scenes
    Lu Jin
    Liu Yuhong
    Zhang Rongfen
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (06)