Saliency Driven Monocular Depth Estimation Based on Multi-scale Graph Convolutional Network

被引:0
|
作者
Wu, Dunquan [1 ]
Chen, Chenglizhao [1 ,2 ,3 ]
机构
[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao, Peoples R China
[2] Shandong Prov Key Lab Distributed Comp Software N, Jinan, Peoples R China
[3] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular Depth Estimation; Multi-scale Graph Convolutional Network; Saliency Detection;
D O I
10.1007/978-981-99-8546-3_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular depth estimation is a fundamental and crucial task in computer vision that enables scene understanding from a single image. This paper proposes a novel approach for saliency-driven monocular depth estimation based on a multi-scale Graph Convolutional Network (GCN). Our method utilizes saliency information to guide the depth estimation process and employs a multi-scale GCN to capture local and global contextual cues. The proposed framework constructs a graph structure using RGB images to represent the relationships between image regions. We designed a multi-scale feature fusion module called DS Fusion, by applying GCN at multiple scales, our method effectively integrates depth features and saliency features to predict accurate depth maps. Extensive experiments conducted on KITTI and NYU datasets demonstrate the superior performance of our approach compared to state-of-the-art techniques. Additionally, we perform indepth analysis of the network architecture and discuss the impact of saliency cues on depth estimation accuracy. Our proposed method showcases the potential of combining saliency information and GCN in monocular depth estimation, contributing to the progress of scene understanding and depth perception from a single image.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 50 条
  • [31] Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation
    Xu, Dan
    Ricci, Elisa
    Ouyang, Wanli
    Wang, Xiaogang
    Sebe, Nicu
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 161 - 169
  • [32] Promoting Monocular Depth Estimation by Multi-Scale Residual Laplacian Pyramid Fusion
    Zhang, Anmei
    Ma, Yunchao
    Liu, Jiangyu
    Sun, Jian
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 205 - 209
  • [33] Monocular Depth Estimation Algorithm Integrating Parallel Transformer and Multi-Scale Features
    Wang, Weiqiang
    Tan, Chao
    Yan, Yunbing
    ELECTRONICS, 2023, 12 (22)
  • [34] Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection
    Piao, Yongri
    Ji, Wei
    Li, Jingjing
    Zhang, Miao
    Lu, Huchuan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7253 - 7262
  • [35] Multi-scale graph feature extraction network for panoramic image saliency detection
    Zhang, Ripei
    Chen, Chunyi
    Peng, Jun
    VISUAL COMPUTER, 2024, 40 (02): : 953 - 970
  • [36] Human Action Recognition Based on Multi-Scale Feature Augmented Graph Convolutional Network
    Lv, Wangyang
    Zhou, Yinghua
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 112 - 118
  • [37] Multi-scale graph feature extraction network for panoramic image saliency detection
    Ripei Zhang
    Chunyi Chen
    Jun Peng
    The Visual Computer, 2024, 40 (2) : 953 - 970
  • [38] Occluded Gait Emotion Recognition Based on Multi-Scale Suppression Graph Convolutional Network
    Zou, Yuxiang
    He, Ning
    Sun, Jiwu
    Huang, Xunrui
    Wang, Wenhua
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (01): : 1255 - 1276
  • [39] Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition
    Jang, Sungjun
    Lee, Heansung
    Kim, Woo Jin
    Lee, Jungho
    Woo, Sungmin
    Lee, Sangyoun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7244 - 7258
  • [40] Multi-scale graph diffusion convolutional network for multi-view learning
    Wang, Shiping
    Li, Jiacheng
    Chen, Yuhong
    Wu, Zhihao
    Huang, Aiping
    Zhang, Le
    ARTIFICIAL INTELLIGENCE REVIEW, 2025, 58 (06)