SAFENet: Semantic-Aware Feature Enhancement Network for unsupervised cross-domain road scene segmentation

被引:0
|
作者
Ren, Dexin [1 ,2 ]
Li, Minxian [1 ,2 ]
Wang, Shidong [3 ]
Ren, Mingwu [1 ,2 ]
Zhang, Haofeng [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, State Key Lab Intelligent Mfg Adv Construct Machin, Nanjing 210094, Peoples R China
[3] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, England
基金
中国国家自然科学基金;
关键词
Unsupervised domain adaptation; Semantic segmentation; Semantic-Aware Feature Enhancement; Adaptive instance normalization; Knowledge transfer;
D O I
10.1016/j.imavis.2024.105318
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised cross-domain road scene segmentation has attracted substantial interest because of its capability to perform segmentation on new and unlabeled domains, thereby reducing the dependence on expensive manual annotations. This is achieved by leveraging networks trained on labeled source domains to classify images on unlabeled target domains. Conventional techniques usually use adversarial networks to align inputs from the source and the target in either of their domains. However, these approaches often fall short in effectively integrating information from both domains due to Alignment in each space usually leads to bias problems during feature learning. To overcome these limitations and enhance cross-domain interaction while mitigating overfitting to the source domain, we introduce a novel framework called Semantic-Aware Feature Enhancement Network (SAFENet) for Unsupervised Cross-domain Road Scene Segmentation. SAFENet incorporates the Semantic-Aware Enhancement (SAE) module to amplify the importance of class information in segmentation tasks and uses the semantic space as anew domain to guide the alignment of the source and target domains. Additionally, we integrate Adaptive Instance Normalization with Momentum (AdaINM) techniques, which convert the source domain image style to the target domain image style, thereby reducing the adverse effects of source domain overfitting on target domain segmentation performance. Moreover, SAFENet employs a Knowledge Transfer (KT) module to optimize network architecture, enhancing computational efficiency during testing while maintaining the robust inference capabilities developed during training. To further improve the segmentation performance, we further employ Curriculum Learning, a self- training mechanism that uses pseudo-labels derived from the target domain to iteratively refine the network. Comprehensive experiments on three well-known datasets, "Synthia -> Cityscapes"and "GTA5 -> Cityscapes", demonstrate the superior performance of our method. In-depth examinations and ablation studies verify the efficacy of each module within the proposed method.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Semantic-Aware Trajectory Compression with Urban Road Network
    Ta, Na
    Li, Guoliang
    Chen, Bole
    Feng, Jianhua
    WEB-AGE INFORMATION MANAGEMENT, PT I, 2016, 9658 : 124 - 136
  • [22] ASGSA: global semantic-aware network for action segmentation
    Bian Q.
    Zhang C.
    Ren K.
    Yue T.
    Zhang Y.
    Neural Computing and Applications, 2024, 36 (22) : 13629 - 13645
  • [23] Semantic-Aware Dehazing Network With Adaptive Feature Fusion
    Zhang, Shengdong
    Ren, Wenqi
    Tan, Xin
    Wang, Zhi-Jie
    Liu, Yong
    Zhang, Jingang
    Zhang, Xiaoqin
    Cao, Xiaochun
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) : 454 - 467
  • [24] Unsupervised domain adaptation alignment method for cross-domain semantic segmentation of remote sensing images
    Shen Z.
    Ni H.
    Guan H.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2023, 52 (12): : 1 - 2
  • [25] Semantic-Aware Generative Adversarial Nets for Unsupervised Domain Adaptation in Chest X-Ray Segmentation
    Chen, Cheng
    Dou, Qi
    Chen, Hao
    Heng, Pheng-Ann
    MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 : 143 - 151
  • [26] A Semantic-Aware Detail Adaptive Network for Image Enhancement
    Fan, Linlin
    Wei, Xuekai
    Zhou, Mingliang
    Yan, Jielu
    Pu, Huayan
    Luo, Jun
    Li, Zhengguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1787 - 1800
  • [27] Towards Scene Understanding: Unsupervised Monocular Depth Estimation with Semantic-aware Representation
    Chen, Po-Yi
    Liu, Alexander H.
    Liu, Yen-Cheng
    Wang, Yu-Chiang Frank
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2619 - 2627
  • [28] A Cross-Domain Coupling Network for Semantic Segmentation of Remote Sensing Images
    Li, Xin
    Xu, Feng
    Tao, Feifei
    Tong, Yao
    Gao, Hongmin
    Liu, Fan
    Chen, Ziqi
    Lyu, Xin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [29] CAFA: Cross-Modal Attentive Feature Alignment for Cross-Domain Urban Scene Segmentation
    Liu, Peng
    Ge, Yanqi
    Duan, Lixin
    Li, Wen
    Lv, Fengmao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (10) : 11666 - 11675
  • [30] Single Cross-domain Semantic Guidance Network for Multimodal Unsupervised Image Translation
    Lan, Jiaying
    Cheng, Lianglun
    Huang, Guoheng
    Pun, Chi-Man
    Yuan, Xiaochen
    Lai, Shangyu
    Liu, HongRui
    Ling, Wing-Kuen
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 165 - 177