Robust Scale-Aware Stereo Matching Network

被引:5
|
作者
Okae J. [1 ]
Li B. [1 ]
Du J. [1 ]
Hu Y. [1 ]
机构
[1] School of Automation Science and Engineering, South China University of Technology, Guangzhou
来源
关键词
Computer stereo vision; deep learning; disparity maps fusion; multiscale processing; stereo matching;
D O I
10.1109/TAI.2021.3115401
中图分类号
学科分类号
摘要
Recently, deep convolutional neural networks (CNNs) have emerged as powerful tools for the correspondence problem in stereo matching task. However, the existence of multiscale objects and inevitable ill-conditioned regions, such as textureless regions, in real-world scene images continue to challenge current CNN architectures. In this article, we present a robust scale-aware stereo matching network, which aims to predict multiscale disparity maps and fuse them to achieve a more accurate disparity map. To this end, powerful feature representations are extracted from stereo images and are concatenated into a 4-D feature volume. The feature volume is then fed into a series of connected encoder-decoder cost aggregation structures for the construction of multiscale cost volumes. Following this, we regress multiscale disparity maps from the multiscale cost volumes and feed them into a fusion module to predict final disparity map. However, uncertainty estimations at each scale and complex disparity relationships among neighboring pixels pose a challenge on the disparity fusion. To overcome this challenge, we design a robust learning-based scale-aware disparity map fusion model, which seeks to map multiscale disparity maps onto the ground truth disparity map by leveraging their complementary strengths. Experimental results show that the proposed network is more robust and outperforms recent methods on standard stereo evaluation benchmarks. © 2020 IEEE.
引用
收藏
页码:244 / 253
页数:9
相关论文
共 50 条
  • [41] Scale-Aware RPN for Vehicle Detection
    Ding, Lu
    Wang, Yong
    Laganiere, Robert
    Luo, Xinbin
    Fu, Shan
    ADVANCES IN VISUAL COMPUTING, ISVC 2018, 2018, 11241 : 487 - 499
  • [42] Multi scale-aware attention for pyramid convolution network on finger vein recognition
    Zhang, Huijie
    Sun, Weizhen
    Lv, Ling
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [43] Real-time crowd counting via lightweight scale-aware network
    Zhu, Fushun
    Yan, Hua
    Chen, Xinyue
    Li, Tong
    NEUROCOMPUTING, 2022, 472 : 54 - 67
  • [44] Learning Multi-Modal Scale-Aware Attentions for Efficient and Robust Road Segmentation
    Zhou, Yunjiao
    Yang, Jianfei
    Cao, Haozhi
    Zeng, Zhaoyang
    Zou, Han
    Xie, Lihua
    UNMANNED SYSTEMS, 2024, 12 (02) : 201 - 213
  • [45] Counting in congested crowd scenes with hierarchical scale-aware encoder–decoder network
    Han, Run
    Qi, Ran
    Lu, Xuequan
    Huang, Lei
    Lyu, Lei
    Expert Systems with Applications, 2024, 238
  • [46] Attention to Scale: Scale-aware Semantic Image Segmentation
    Chen, Liang-Chieh
    Yang, Yi
    Wang, Jiang
    Xu, Wei
    Yuille, Alan L.
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
  • [47] Scale-Aware Regional Collective Feature Enhancement Network for Scene Object Detection
    Li, Yiyao
    Liu, Jin
    Gao, Zhenyu
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6289 - 6310
  • [48] SSA Net: Small Scale-Aware Enhancement Network for Human Pose Estimation
    Li, Shaohua
    Zhang, Haixiang
    Ma, Hanjie
    Feng, Jie
    Jiang, Mingfeng
    SENSORS, 2023, 23 (17)
  • [49] ES-Net: Efficient Scale-Aware Network for Tiny Defect Detection
    Yu, Xuyi
    Lyu, Wentao
    Zhou, Di
    Wang, Chengqun
    Xu, Weiqiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [50] Bidirectional scale-aware upsampling network for arbitrary-scale video super-resolution
    Luo, Laigan
    Yi, Benshun
    Wang, Zhongyuan
    He, Zheng
    Zhu, Chao
    IMAGE AND VISION COMPUTING, 2024, 148