Semantic Segmentation Method Based on Multiscale Feature Alignment and Aggregation

被引：1

作者：

Xu Zhaozhong ^{[1
]}

Peng Li ^{[1
,2
]}

Dai Feifei ^{[3
]}

机构：

[1] Jiangnan Univ, Sch IoT Engn, Engn Res Ctr Internet Things Technol Applicat, Wuxi 214122, Jiangsu, Peoples R China

[2] Wuxi Taihu Coll, Jiangsu Prov Internet Things Applicat Technol Key, Wuxi 214122, Jiangsu, Peoples R China

[3] Taizhou Prod Qual & Safety Monitoring Inst, Taizhou 318000, Zhejiang, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2023年 / 60卷 / 02期

关键词：

machine vision; image semantic segmentation; feature alignment; multiscale feature; attention mechanism;

D O I：

10.3788/LOP212814

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

During semantic segmentation of images, a convolutional neural network easily misplaces the high-level features with low-level features after down-sampling and padding operations. To solve the mismatch problem between high- and low-level features and better aggregate the multiscale feature information, this paper proposes a semantic segmentation method with a multiscale feature alignment aggregation (MFAA) module. The MFAA module adopts a learnable interpolation strategy to learn pixel transform migration, thereby alleviating the feature-misalignment problem of feature aggregation at different scales. The module includes an attention mechanism that improves the decoder's ability to recover the important details. Using multiple MFAA modules, the semantic information of high-level features, and the spatial information of low-level features, this method aligns and aggregates the high- and low-level features to refine the semantic segmentation effect. The proposed network structure was validated on PASCAL VOC 2012. Using a ResNet- 50 backbone network, the mean intersection-over-union reached 78. 4% on the validation set. Experimentally, the proposed method achieved better evaluation indices than several mainstream segmentation methods and effectively improved the image segmentation effect.

引用

页数：8

共 22 条

[1] Design of Augmented Reality Head-up Display System Based on Image Semantic Segmentation
An Zhe
Xu Xiping
Yang Jinhua
Qiao Yang
Liu Yang
[J]. ACTA OPTICA SINICA, 2018, 38 (07)
[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[3] Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
[4] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[7] An Efficient Approach to Semantic Segmentation
Csurka, Gabriela
Perronnin, Florent
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 95 (02) : 198 - 212
[8] Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1026 - 1034
[9] Huang ZL, 2021, Arxiv, DOI arXiv:2003.00872
[10] Jaderberg M, 2015, ADV NEUR IN, V28

← 1 2 3 →