Automatic Network Architecture Search for RGB-D Semantic Segmentation

被引:2
|
作者
Wang, Wenna [1 ]
Zhuo, Tao [2 ]
Zhang, Xiuwei [1 ]
Sun, Mingjun [1 ]
Yin, Hanlin [1 ]
Xing, Yinghui [1 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Xian, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan, Peoples R China
基金
中国国家自然科学基金;
关键词
RGB-D semantic segmentation; NAS; grid-like network-level search space; hierarchical cell-level search space; search strategy;
D O I
10.1145/3581783.3612288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent RGB-D semantic segmentation networks are usually manually designed. However, due to limited human efforts and time costs, their performance might be inferior for complex scenarios. To address this issue, we propose the first Neural Architecture Search (NAS) method that designs the network automatically. Specifically, the target network consists of an encoder and a decoder. The encoder is designed with two independent branches, where each branch specializes in extracting features from RGB and depth images, respectively. The decoder fuses the features and generates the final segmentation result. Besides, for automatic network design, we design a grid-like network-level search space combined with a hierarchical cell-level search space. By further developing an effective gradient-based search strategy, the network structure with hierarchical cell architectures is discovered. Extensive results on two datasets show that the proposed method outperforms the state-of-the-art approaches, which achieves a mIoU score of 55.1% on the NYU-Depth v2 dataset and 50.3% on the SUN-RGBD dataset.
引用
收藏
页码:3777 / 3786
页数:10
相关论文
共 50 条
  • [1] RGB-D SEMANTIC SEGMENTATION: A REVIEW
    Hu, Yaosi
    Chen, Zhenzhong
    Lin, Weiyao
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [2] Semantic Progressive Guidance Network for RGB-D Mirror Segmentation
    Li, Chao
    Zhou, Wujie
    Zhou, Xi
    Yan, Weiqing
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2780 - 2784
  • [3] Cascaded Feature Network for Semantic Segmentation of RGB-D Images
    Lin, Di
    Chen, Guangyong
    Daniel Cohen-Or
    Heng, Pheng-Ann
    Huang, Hui
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1320 - 1328
  • [4] Cascaded Feature Network for Semantic Segmentation of RGB-D Images
    Lin, Di
    Chen, Guangyong
    Cohen-Or, Daniel
    Heng, Pheng-Ann
    Huang, Hui
    Proceedings of the IEEE International Conference on Computer Vision, 2017, 2017-October : 1320 - 1328
  • [5] Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
    Yang, Jun
    Bai, Lizhi
    Sun, Yaoru
    Tian, Chunqi
    Mao, Maoyu
    Wang, Guorun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1481 - 1492
  • [6] A Fusion Network for Semantic Segmentation Using RGB-D Data
    Yuan, Jiahui
    Zhang, Kun
    Xia, Yifan
    Qi, Lin
    Dong, Junyu
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [7] Correction to: Cascading context enhancement network for RGB-D semantic segmentation
    Xu Tang
    Zejun Zhang
    Yan Meng
    Jianxiao Xie
    Changbing Tang
    Weichuan Zhang
    Multimedia Tools and Applications, 2025, 84 (9) : 6005 - 6005
  • [8] Attention-based fusion network for RGB-D semantic segmentation
    Zhong, Li
    Guo, Chi
    Zhan, Jiao
    Deng, JingYi
    NEUROCOMPUTING, 2024, 608
  • [9] GANet: geometry-aware network for RGB-D semantic segmentation
    Tian, Chunqi
    Xu, Weirong
    Bai, Lizhi
    Yang, Jun
    Xu, Yanjun
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [10] DCANet: Differential convolution attention network for RGB-D semantic segmentation
    Bai, Lizhi
    Yang, Jun
    Tian, Chunqi
    Sun, Yaoru
    Mao, Maoyu
    Xu, Yanjun
    Xu, Weirong
    PATTERN RECOGNITION, 2025, 162