Generalizable Sequential Camera Pose Learning Using Surf Enhanced 3D CNN

被引：0

作者：

Elmoogy, Ahmed ^{[1
]}

Dong, Xiaodai ^{[1
]}

Lu, Tao ^{[1
]}

Westendorp, Robert ^{[2
]}

Reddy, Kishore ^{[2
]}

机构：

[1] Univ Victoria, Elect & Comp Engn, Victoria, BC, Canada

[2] Fortinet, Burnaby, BC, Canada

来源：

2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL) | 2020年

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1109/VTC2020-Fall49728.2020.9348447

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image based localization is a key block of visual simultaneous localization and mapping (SLAM) system where image data is used to localize the camera relative to an arbitrary reference frame. Although finding the location from one image or between two images is well studied in the literature, few works study the problem of finding the pose of multiple images in videos of different frame lengths. Here, we propose two different architectures to address this problem, one using a combination of 2D convolutional neural network (CNN) and recurrent neural networks (RNN) and the other using 3D CNN. We demonstrate that 3D CNN is better for pose estimation problem than CNN-RNN by visualizing the learned features per layer of both architectures and the accuracy performance. Further, instead of using RGB images as input to the networks, we use SURF descriptors to reduce the image dimension of 480x640x3 by more than 48 folds, making the training time much faster and the learning model less complex. Both architectures show competitive performance in comparison to the state of the art on indoor localization dataset with the ability to generalize to test scenes that are completely different from the training scenes.

引用

页数：6

共 50 条

[41] Feature Evaluation and Management for Camera Pose Tracking on 3D Models
Schumann, Martin
Hoppenheit, Jan
Mueller, Stefan
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 3, 2014, : 562 - 569
[42] 3D Face pose estimation and tracking from a monocular camera
Ji, Q
IMAGE AND VISION COMPUTING, 2002, 20 (07) : 499 - 511
[43] 3D face pose tracking from an uncalibrated monocular camera
Zhu, ZW
Ji, Q
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 400 - 403
[44] 3D Object-Camera and 3D Face-Camera Pose Estimation for Quadcopter Control: Application to Remote Labs
Khattar, Fawzi
Dornaika, Fadi
Larroque, Benoit
Luthon, Franck
ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2018, 2018, 11182 : 99 - 111
[45] Excavator 3D pose estimation using deep learning and hybrid datasets
Assadzadeh, Amin
Arashpour, Mehrdad
Li, Heng
Hosseini, Reza
Elghaish, Faris
Baduge, Shanaka
ADVANCED ENGINEERING INFORMATICS, 2023, 55
[46] Generalizable deep learning approach for 3D particle imaging using holographic microscopy (HM)
Kumar, M. shyam
Hong, Jiarong
OPTICS EXPRESS, 2024, 32 (27): : 48159 - 48173
[47] Empowering Efficient Spatio-Temporal Learning with a 3D CNN for Pose-Based Action Recognition
Ren, Ziliang
Xiao, Xiongjiang
Nie, Huabei
SENSORS, 2024, 24 (23)
[48] Learning to Refine 3D Human Pose Sequences
Mei, Jieru
Chen, Xingyu
Wang, Chunyu
Yuille, Alan
Lan, Xuguang
Zeng, Wenjun
2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 358 - 366
[49] Sequential 3D Human Pose Estimation Using Adaptive Point Cloud Sampling Strategy
Zhang, Zihao
Hu, Lei
Deng, Xiaoming
Xia, Shihong
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1330 - 1337
[50] 3D Human Pose Estimation With Adversarial Learning
Meng, Wenming
Hu, Tao
Shuai, Li
2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 93 - 99

← 1 2 3 4 5 →