Multi-scale Autoencoders in Autoencoder for Semantic Image Segmentation

被引:0
|
作者
Yusiong, John Paul T. [1 ,2 ]
Naval, Prospero C., Jr. [1 ]
机构
[1] Univ Philippines, Coll Engn, Dept Comp Sci, Comp Vis & Machine Intelligence Grp, Quezon City, Philippines
[2] Univ Philippines, Visayas Tacloban Coll, Div Nat Sci & Math, Tacloban City, Leyte, Philippines
关键词
Autoencoders in autoencoder; Semantic image segmentation; Stacked autoencoders;
D O I
10.1007/978-3-030-14799-0_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic image segmentation is essential for scene understanding. Several state-of-the-art deep learning-based approaches achieved remarkable results by increasing the network depth to improve performance. Using this principle, we introduce a novel encoder-decoder network architecture for semantic image segmentation of outdoor scenes called SAsiANet. SAsiANet utilizes multi-scale cascaded autoencoders at the decoder section of an autoencoder to achieve high accuracy pixel-wise prediction and involves exploiting features across multiple scales when upsampling the output of the encoder to obtain better spatial and contextual information effectively. The proposed network architecture is trained using the cross-entropy loss function but without incorporating any class balancing technique to the loss function. Our experimental results on two challenging outdoor scenes: the CamVid urban scenes dataset and the Freiburg forest dataset demonstrate that SAsiANet provides an effective way of producing accurate segmentation maps since it achieved state-of-the-art results on the test set of both datasets, 72.40% mIoU, and 89.90% mIoU, respectively.
引用
收藏
页码:587 / 599
页数:13
相关论文
共 50 条
  • [21] REPRESENTATION OF IMAGE CONTENT WITH MULTI-SCALE SEGMENTATION
    Zhang, Jing
    Zhao, Ya-Xin
    Li, Da
    Chen, Zhi-Hua
    Yuan, Yu-Bo
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1552 - 1555
  • [22] Multi-Scale and Multi-Match for Few-Shot Plant Disease Image Semantic Segmentation
    Yang, Wenji
    Hu, Wenchao
    Xie, Liping
    Yang, Zhenji
    AGRONOMY-BASEL, 2022, 12 (11):
  • [23] Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation
    Pissas, Theodoros
    Ravasio, Claudio S.
    Da Cruz, Lyndon
    Bergeles, Christos
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 413 - 429
  • [24] Multi-scale Field Distillation for Multi-task Semantic Segmentation
    Dong, Aimei
    Liu, Sidi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 508 - 519
  • [25] GMAlignNet: multi-scale lightweight brain tumor image segmentation with enhanced semantic information consistency
    Song, Jianli
    Lu, Xiaoqi
    Gu, Yu
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (11):
  • [26] Multi-scale Feature Extraction and Fusion Net: Research on UAVs Image Semantic Segmentation Technology
    Li X.
    Su D.
    Chang D.
    Liu J.
    Wang L.
    Tian Z.
    Wang S.
    Sun W.
    Journal of ICT Standardization, 2023, 11 (01): : 97 - 116
  • [27] SEMI-SUPERVISED MEDICAL IMAGE SEMANTIC SEGMENTATION WITH MULTI-SCALE GRAPH CUT LOSS
    Sun, Junxiao
    Zhang, Yan
    Zhu, Jian
    Wu, Jiasong
    Kong, Youyong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 624 - 628
  • [28] Multi-scale contextual semantic enhancement network for 3D medical image segmentation
    Xia, Tingjian
    Huang, Guoheng
    Pun, Chi-Man
    Zhang, Weiwen
    Li, Jiajian
    Ling, Wing-Kuen
    Lin, Chao
    Yang, Qi
    PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (22):
  • [29] Remote sensing image semantic segmentation network based on multi-scale feature enhancement fusion
    Wang, Feiting
    Zhang, Yuan
    Hu, Qiongqiong
    Zhu, Yu
    GEOCARTO INTERNATIONAL, 2024, 39 (01)
  • [30] Boundary-Guided Lightweight Semantic Segmentation With Multi-Scale Semantic Context
    Zhou, Quan
    Wang, Linjie
    Gao, Guangwei
    Kang, Bin
    Ou, Weihua
    Lu, Huimin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7887 - 7900