Semantic Segmentation Method Based on Multiscale Feature Alignment and Aggregation

被引:1
|
作者
Xu Zhaozhong [1 ]
Peng Li [1 ,2 ]
Dai Feifei [3 ]
机构
[1] Jiangnan Univ, Sch IoT Engn, Engn Res Ctr Internet Things Technol Applicat, Wuxi 214122, Jiangsu, Peoples R China
[2] Wuxi Taihu Coll, Jiangsu Prov Internet Things Applicat Technol Key, Wuxi 214122, Jiangsu, Peoples R China
[3] Taizhou Prod Qual & Safety Monitoring Inst, Taizhou 318000, Zhejiang, Peoples R China
关键词
machine vision; image semantic segmentation; feature alignment; multiscale feature; attention mechanism;
D O I
10.3788/LOP212814
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
During semantic segmentation of images, a convolutional neural network easily misplaces the high-level features with low-level features after down-sampling and padding operations. To solve the mismatch problem between high- and low-level features and better aggregate the multiscale feature information, this paper proposes a semantic segmentation method with a multiscale feature alignment aggregation (MFAA) module. The MFAA module adopts a learnable interpolation strategy to learn pixel transform migration, thereby alleviating the feature-misalignment problem of feature aggregation at different scales. The module includes an attention mechanism that improves the decoder's ability to recover the important details. Using multiple MFAA modules, the semantic information of high-level features, and the spatial information of low-level features, this method aligns and aggregates the high- and low-level features to refine the semantic segmentation effect. The proposed network structure was validated on PASCAL VOC 2012. Using a ResNet- 50 backbone network, the mean intersection-over-union reached 78. 4% on the validation set. Experimentally, the proposed method achieved better evaluation indices than several mainstream segmentation methods and effectively improved the image segmentation effect.
引用
收藏
页数:8
相关论文
共 22 条
  • [1] Design of Augmented Reality Head-up Display System Based on Image Semantic Segmentation
    An Zhe
    Xu Xiping
    Yang Jinhua
    Qiao Yang
    Liu Yang
    [J]. ACTA OPTICA SINICA, 2018, 38 (07)
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
  • [4] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [7] An Efficient Approach to Semantic Segmentation
    Csurka, Gabriela
    Perronnin, Florent
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 95 (02) : 198 - 212
  • [8] Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1026 - 1034
  • [9] Huang ZL, 2021, Arxiv, DOI arXiv:2003.00872
  • [10] Jaderberg M, 2015, ADV NEUR IN, V28