Generalizable Sequential Camera Pose Learning Using Surf Enhanced 3D CNN

被引:0
|
作者
Elmoogy, Ahmed [1 ]
Dong, Xiaodai [1 ]
Lu, Tao [1 ]
Westendorp, Robert [2 ]
Reddy, Kishore [2 ]
机构
[1] Univ Victoria, Elect & Comp Engn, Victoria, BC, Canada
[2] Fortinet, Burnaby, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/VTC2020-Fall49728.2020.9348447
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image based localization is a key block of visual simultaneous localization and mapping (SLAM) system where image data is used to localize the camera relative to an arbitrary reference frame. Although finding the location from one image or between two images is well studied in the literature, few works study the problem of finding the pose of multiple images in videos of different frame lengths. Here, we propose two different architectures to address this problem, one using a combination of 2D convolutional neural network (CNN) and recurrent neural networks (RNN) and the other using 3D CNN. We demonstrate that 3D CNN is better for pose estimation problem than CNN-RNN by visualizing the learned features per layer of both architectures and the accuracy performance. Further, instead of using RGB images as input to the networks, we use SURF descriptors to reduce the image dimension of 480x640x3 by more than 48 folds, making the training time much faster and the learning model less complex. Both architectures show competitive performance in comparison to the state of the art on indoor localization dataset with the ability to generalize to test scenes that are completely different from the training scenes.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Feature Evaluation and Management for Camera Pose Tracking on 3D Models
    Schumann, Martin
    Hoppenheit, Jan
    Mueller, Stefan
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 562 - 569
  • [42] 3D Face pose estimation and tracking from a monocular camera
    Ji, Q
    IMAGE AND VISION COMPUTING, 2002, 20 (07) : 499 - 511
  • [43] 3D face pose tracking from an uncalibrated monocular camera
    Zhu, ZW
    Ji, Q
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 400 - 403
  • [44] 3D Object-Camera and 3D Face-Camera Pose Estimation for Quadcopter Control: Application to Remote Labs
    Khattar, Fawzi
    Dornaika, Fadi
    Larroque, Benoit
    Luthon, Franck
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 99 - 111
  • [45] Excavator 3D pose estimation using deep learning and hybrid datasets
    Assadzadeh, Amin
    Arashpour, Mehrdad
    Li, Heng
    Hosseini, Reza
    Elghaish, Faris
    Baduge, Shanaka
    ADVANCED ENGINEERING INFORMATICS, 2023, 55
  • [46] Generalizable deep learning approach for 3D particle imaging using holographic microscopy (HM)
    Kumar, M. shyam
    Hong, Jiarong
    OPTICS EXPRESS, 2024, 32 (27): : 48159 - 48173
  • [47] Empowering Efficient Spatio-Temporal Learning with a 3D CNN for Pose-Based Action Recognition
    Ren, Ziliang
    Xiao, Xiongjiang
    Nie, Huabei
    SENSORS, 2024, 24 (23)
  • [48] Learning to Refine 3D Human Pose Sequences
    Mei, Jieru
    Chen, Xingyu
    Wang, Chunyu
    Yuille, Alan
    Lan, Xuguang
    Zeng, Wenjun
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 358 - 366
  • [49] Sequential 3D Human Pose Estimation Using Adaptive Point Cloud Sampling Strategy
    Zhang, Zihao
    Hu, Lei
    Deng, Xiaoming
    Xia, Shihong
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1330 - 1337
  • [50] 3D Human Pose Estimation With Adversarial Learning
    Meng, Wenming
    Hu, Tao
    Shuai, Li
    2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 93 - 99