Saliency Driven Monocular Depth Estimation Based on Multi-scale Graph Convolutional Network

被引：0

作者：

Wu, Dunquan ^{[1
]}

Chen, Chenglizhao ^{[1
,2
,3
]}

机构：

[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao, Peoples R China

[2] Shandong Prov Key Lab Distributed Comp Software N, Jinan, Peoples R China

[3] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX | 2024年 / 14433卷

基金：

中国国家自然科学基金;

关键词：

Monocular Depth Estimation; Multi-scale Graph Convolutional Network; Saliency Detection;

D O I：

10.1007/978-981-99-8546-3_36

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Monocular depth estimation is a fundamental and crucial task in computer vision that enables scene understanding from a single image. This paper proposes a novel approach for saliency-driven monocular depth estimation based on a multi-scale Graph Convolutional Network (GCN). Our method utilizes saliency information to guide the depth estimation process and employs a multi-scale GCN to capture local and global contextual cues. The proposed framework constructs a graph structure using RGB images to represent the relationships between image regions. We designed a multi-scale feature fusion module called DS Fusion, by applying GCN at multiple scales, our method effectively integrates depth features and saliency features to predict accurate depth maps. Extensive experiments conducted on KITTI and NYU datasets demonstrate the superior performance of our approach compared to state-of-the-art techniques. Additionally, we perform indepth analysis of the network architecture and discuss the impact of saliency cues on depth estimation accuracy. Our proposed method showcases the potential of combining saliency information and GCN in monocular depth estimation, contributing to the progress of scene understanding and depth perception from a single image.

引用

页码：445 / 456

页数：12

共 50 条

[31] Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation
Xu, Dan
Ricci, Elisa
Ouyang, Wanli
Wang, Xiaogang
Sebe, Nicu
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 161 - 169
[32] Promoting Monocular Depth Estimation by Multi-Scale Residual Laplacian Pyramid Fusion
Zhang, Anmei
Ma, Yunchao
Liu, Jiangyu
Sun, Jian
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 205 - 209
[33] Monocular Depth Estimation Algorithm Integrating Parallel Transformer and Multi-Scale Features
Wang, Weiqiang
Tan, Chao
Yan, Yunbing
ELECTRONICS, 2023, 12 (22)
[34] Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection
Piao, Yongri
Ji, Wei
Li, Jingjing
Zhang, Miao
Lu, Huchuan
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7253 - 7262
[35] Multi-scale graph feature extraction network for panoramic image saliency detection
Zhang, Ripei
Chen, Chunyi
Peng, Jun
VISUAL COMPUTER, 2024, 40 (02): : 953 - 970
[36] Human Action Recognition Based on Multi-Scale Feature Augmented Graph Convolutional Network
Lv, Wangyang
Zhou, Yinghua
6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 112 - 118
[37] Multi-scale graph feature extraction network for panoramic image saliency detection
Ripei Zhang
Chunyi Chen
Jun Peng
The Visual Computer, 2024, 40 (2) : 953 - 970
[38] Occluded Gait Emotion Recognition Based on Multi-Scale Suppression Graph Convolutional Network
Zou, Yuxiang
He, Ning
Sun, Jiwu
Huang, Xunrui
Wang, Wenhua
CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (01): : 1255 - 1276
[39] Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition
Jang, Sungjun
Lee, Heansung
Kim, Woo Jin
Lee, Jungho
Woo, Sungmin
Lee, Sangyoun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7244 - 7258
[40] Multi-scale graph diffusion convolutional network for multi-view learning
Wang, Shiping
Li, Jiacheng
Chen, Yuhong
Wu, Zhihao
Huang, Aiping
Zhang, Le
ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (06)

← 1 2 3 4 5 →