Saliency Driven Monocular Depth Estimation Based on Multi-scale Graph Convolutional Network

被引:0
|
作者
Wu, Dunquan [1 ]
Chen, Chenglizhao [1 ,2 ,3 ]
机构
[1] Qingdao Univ, Coll Comp Sci & Technol, Qingdao, Peoples R China
[2] Shandong Prov Key Lab Distributed Comp Software N, Jinan, Peoples R China
[3] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular Depth Estimation; Multi-scale Graph Convolutional Network; Saliency Detection;
D O I
10.1007/978-981-99-8546-3_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular depth estimation is a fundamental and crucial task in computer vision that enables scene understanding from a single image. This paper proposes a novel approach for saliency-driven monocular depth estimation based on a multi-scale Graph Convolutional Network (GCN). Our method utilizes saliency information to guide the depth estimation process and employs a multi-scale GCN to capture local and global contextual cues. The proposed framework constructs a graph structure using RGB images to represent the relationships between image regions. We designed a multi-scale feature fusion module called DS Fusion, by applying GCN at multiple scales, our method effectively integrates depth features and saliency features to predict accurate depth maps. Extensive experiments conducted on KITTI and NYU datasets demonstrate the superior performance of our approach compared to state-of-the-art techniques. Additionally, we perform indepth analysis of the network architecture and discuss the impact of saliency cues on depth estimation accuracy. Our proposed method showcases the potential of combining saliency information and GCN in monocular depth estimation, contributing to the progress of scene understanding and depth perception from a single image.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 50 条
  • [41] Anisotropic Multi-Scale Graph Convolutional Network for Dense Shape Correspondence
    Farazi, Mohammad
    Zhu, Wenhui
    Yang, Zhangsihao
    Wang, Yalin
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 3145 - 3154
  • [42] Multi-scale Graph Convolutional Network for understanding human action in videos
    Wang, Houlin
    Zhang, Shihui
    Tian, Qing
    Wang, Lei
    Luo, Bingchun
    Han, Xueqiang
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [43] Multi-Scale Convolutional Neural Network for Temporal Knowledge Graph Completion
    Liu, Wei
    Wang, Peijie
    Zhang, Zhihui
    Liu, Qiong
    COGNITIVE COMPUTATION, 2023, 15 (03) : 1016 - 1022
  • [44] MULTI-SCALE GRAPH CONVOLUTIONAL INTERACTION NETWORK FOR SALIENT OBJECT DETECTION
    Che, Wenqi
    Sun, Luoyi
    Xie, Zhifeng
    Ding, Youdong
    Han, Kaili
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 679 - 683
  • [45] Multi-scale attention graph convolutional recurrent network for traffic forecasting
    Xiong, Liyan
    Hu, Zhuyi
    Yuan, Xinhua
    Ding, Weihua
    Huang, Xiaohui
    Lan, Yuanchun
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3277 - 3291
  • [46] Multi-Scale Convolutional Neural Network for Temporal Knowledge Graph Completion
    Wei Liu
    Peijie Wang
    Zhihui Zhang
    Qiong Liu
    Cognitive Computation, 2023, 15 : 1016 - 1022
  • [47] Multi-Scale Sparse Graph Convolutional Network For the Assessment of Parkinsonian Gait
    Guo, Rui
    Shao, Xiangxin
    Zhang, Chencheng
    Qian, Xiaohua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1583 - 1594
  • [48] Dense monocular depth estimation for stereoscopic vision based on pyramid transformer and multi-scale feature fusion
    Xia, Zhongyi
    Wu, Tianzhao
    Wang, Zhuoyan
    Zhou, Man
    Wu, Boqi
    Chan, C. Y.
    Kong, Ling Bing
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [49] Dense monocular depth estimation for stereoscopic vision based on pyramid transformer and multi-scale feature fusion
    Zhongyi Xia
    Tianzhao Wu
    Zhuoyan Wang
    Man Zhou
    Boqi Wu
    C. Y. Chan
    Ling Bing Kong
    Scientific Reports, 14
  • [50] Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8811 - 8822