Improved Lightweight Semantic Segmentation Algorithm Based on DeepLabv3+ Network

被引：1

作者：

Yao Yan ^{[1
]}

Hu Likun ^{[1
]}

Guo Jun ^{[1
]}

机构：

[1] Guangxi Univ, Sch Elect Engn, Nanning 530004, Guangxi, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2022年 / 59卷 / 04期

关键词：

image processing; DeepLabv3+ model; MobileNetv3; lightweight; atrous spatial pyramid pooling;

D O I：

10.3788/LOP202259.0410015

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Due to the large number of semantic segmentation model parameters and time- consuming algorithm in deep learning, it is not suitable for deployment to mobile terminal. To solve this problem, a lightweight semantic segmentation algorithm based on improved DeepLabv3+ network is proposed. First, MobileNetv3 is used to replace the original DeepLabv3+ semantic segmentation model backbone network for feature extraction to reduce the complexity of the model and speed up the running speed of the model; second, the standard convolution in atrous spatial pyramid pooling module is replaced by depthwise separable convolution to improve the efficiency of model training; finally, the attention mechanism module and group normalization method are introduced to improve the segmentation accuracy. The proposed segmentation algorithm achieves a mean intersection over union (mIoU) of 72. 94% on the Cityscapes validation set of semantic segmentation dataset. Experimental results show that compared with common segmentation algorithms such as SegNet, Fast-SCNN, and ENet, the proposed algorithm can improve the segmentation effect while reducing the number of model parameters.

引用

页数：8

共 24 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[3] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[4] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[5] Chollet F., 2016, IEEE C COMP VIS PATT, P1251, DOI [DOI 10.1109/CVPR.2017.195, 10.48550/ARXIV.1610.02357]
[6] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[7] Searching for MobileNetV3
Howard, Andrew
Sandler, Mark
Chu, Grace
Chen, Liang-Chieh
Chen, Bo
Tan, Mingxing
Wang, Weijun
Zhu, Yukun
Pang, Ruoming
Vasudevan, Vijay
Le, Quoc V.
Adam, Hartwig
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324
[8] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[9] Automatic Extraction and Classification of Road Markings Based on Deep Learning
Huang Gang
Liu Xianlin
[J]. CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (08):
[10] Ioffe S, 2019, BATCH NORMALIZATION

← 1 2 3 →