Multi-Scale Binocular Stereo Matching Based on Semantic Association

被引:0
|
作者
Jin ZHENG [1 ,2 ]
Botao JIANG [2 ]
Wei PENG [2 ]
Qiaohui ZHANG [2 ]
机构
[1] State Key Laboratory of Virtual Reality Techonology and Systems, Beihang University
[2] School of Computer Science and Engineering, Beihang
关键词
D O I
暂无
中图分类号
TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the low accuracy of existing binocular stereo matching and depth estimation methods, this paper proposes a multi-scale binocular stereo matching network based on semantic association. A semantic association module is designed to construct the contextual semantic association relationship among the pixels through semantic category and attention mechanism. The disparity of those regions where the disparity is easily estimated can be used to assist the disparity estimation of relatively difficult regions, so as to improve the accuracy of disparity estimation of the whole image. Simultaneously, a multi-scale cost volume computation module is proposed. Unlike the existing methods, which use a single cost volume, the proposed multi-scale cost volume computation module designs multiple cost volumes for features of different scales. The semantic association feature and multi-scale cost volume are aggregated, which fuses the high-level semantic information and the low-level local detailed information to enhance the feature representation for accurate stereo matching. We demonstrate the effectiveness of the proposed solutions on the KITTI2015 binocular stereo matching dataset, and our model achieves comparable or higher matching performance, compared to other seven classic binocular stereo matching algorithms.
引用
收藏
页码:1010 / 1022
页数:13
相关论文
共 50 条
  • [31] STEREO MATCHING BETWEEN IMAGES OF LARGE SLOPE BASED ON MULTI-SCALE DIRECTIONAL WAVELET TRANSFORM
    Xi-an, Zhao
    Zhi-xue, Chen
    Guang, Zhu
    Jing-guo, Lv
    Chang-feng, Jing
    2010 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2010, : 499 - 501
  • [32] Multi-scale 3D scene flow from binocular stereo sequences
    Li, Rui
    Sclaroff, Stan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (01) : 75 - 90
  • [33] Cascaded Multi-scale and Multi-dimension Convolutional Neural Network for Stereo Matching
    Lu, Haihua
    Xu, Hai
    Zhang, Li
    Ma, Yanbo
    Zhao, Yong
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [34] Algorithm of binocular stereo matching based on AEDNet
    Yang, Ge
    Liao, Yuting
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (03): : 24 - 28
  • [35] MSDC-Net: Multi-Scale Dense and Contextual Networks for Stereo Matching
    Rao, Zhibo
    He, Mingyi
    Dai, Yuchao
    Zhu, Zhidong
    Li, Bo
    He, Renjie
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 578 - 583
  • [36] Multi-scale inputs and context-aware aggregation network for stereo matching
    Shi, Liqing
    Xiong, Taiping
    Cui, Gengshen
    Pan, Minghua
    Cheng, Nuo
    Wu, Xiangjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75171 - 75194
  • [37] Lightweight multi-scale convolutional neural network for real time stereo matching
    Xue, Yanbing
    Zhang, Doudou
    Li, Leida
    Li, Shiyin
    Wang, Yuxin
    IMAGE AND VISION COMPUTING, 2022, 124
  • [38] Stereo Matching Using Multi-Level Cost volume and Multi-Scale Feature Constancy
    Liang, Zhengfa
    Guo, Yulan
    Feng, Yiliu
    Chen, Wei
    Qiao, Linbo
    Zhou, Li
    Zhang, Jianfeng
    Liu, Hengzhu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 300 - 315
  • [39] Multi-Scale Keypoint Matching
    Lotfian, Sina
    Foroosh, Hassan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5168 - 5175
  • [40] Asynchronous Event-Based Binocular Stereo Matching
    Rogister, Paul
    Benosman, Ryad
    Ieng, Sio-Hoi
    Lichtsteiner, Patrick
    Delbruck, Tobi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (02) : 347 - 353