ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes

被引:167
|
作者
Chen, Yuhua [1 ]
Li, Wen [1 ]
Van Gool, Luc [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] Katholieke Univ Leuven, ESAT PSI, VISICS, Leuven, Belgium
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting synthetic data to learn deep models has attracted increasing attention in recent years. However, the intrinsic domain difference between synthetic and real images usually causes a significant performance drop when applying the learned model to real world scenarios. This is mainly due to two reasons: 1) the model overfits to synthetic images, making the convolutional filters incompetent to extract informative representation for real images; 2) there is a distribution difference between synthetic and real data, which is also known as the domain adaptation problem. To this end, we propose a new reality oriented adaptation approach for urban scene semantic segmentation by learning from synthetic data. First, we propose a target guided distillation approach to learn the real image style, which is achieved by training the segmentation model to imitate a pretrained real style model using real images. Second, we further take advantage of the intrinsic spatial structure presented in urban scene images, and propose a spatial aware adaptation scheme to effectively align the distribution of two domains. These two modules can be readily integrated with existing state-of-the-art semantic segmentation networks to improve their generalizability when adapting from synthetic to real urban scenes. We evaluate the proposed method on Cityscapes dataset by adapting from GTAV and SYNTHIA datasets, where the results demonstrate the effectiveness of our method.
引用
收藏
页码:7892 / 7901
页数:10
相关论文
共 50 条
  • [41] Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation
    Vobecky, Antonin
    Hurych, David
    Simeoni, Oriane
    Gidaris, Spyros
    Bursuc, Andrei
    Perez, Patrick
    Sivic, Josef
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [42] JAUNet: A U-Shape Network with Jump Attention for Semantic Segmentation of Road Scenes
    Fan, Zhiyong
    Liu, Kailai
    Hou, Jianmin
    Yan, Fei
    Zang, Qiang
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [43] Sensor Fusion of Intensity and Depth Cues using the ChiNet for Semantic Segmentation of Road Scenes
    John, V
    Nithilan, M. K.
    Mita, S.
    Tehrani, H.
    Konishi, M.
    Ishimaru, K.
    Oishi, T.
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 585 - 590
  • [44] DenseASPP for Semantic Segmentation in Street Scenes
    Yang, Maoke
    Yu, Kun
    Zhang, Chi
    Li, Zhiwei
    Yang, Kuiyuan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3684 - 3692
  • [45] Segmentation for Outdoor Urban Scenes
    Park, Jaehyun
    Choi, Sunglok
    Yu, Wonpil
    2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 745 - 748
  • [46] Evolutionary segmentation of road traffic scenes
    Park, SH
    Lee, JK
    Kim, HJ
    PROCEEDINGS OF 1997 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '97), 1997, : 397 - 400
  • [47] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Xuegang Hu
    Liyuan Jing
    Uroosa Sehar
    Applied Intelligence, 2022, 52 : 580 - 594
  • [48] ADAPTING SEMANTIC SEGMENTATION OF URBAN SCENES VIA MASK-AWARE GATED DISCRIMINATOR
    Lin, Yong-Xiang
    Tan, Daniel Stanley
    Cheng, Wen-Huang
    Hua, Kai-Lung
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 218 - 223
  • [49] BSDNet: Balanced Sample Distribution Network for Real-Time Semantic Segmentation of Road Scenes
    Ye, Lv
    Zeng, Jianxu
    Yang, Yue
    Chimaobi, Ashara Emmanuel
    Sekenya, Nyaradzo Mercy
    IEEE ACCESS, 2021, 9 : 84034 - 84044
  • [50] DRMNet: more efficient bilateral networks for real-time semantic segmentation of road scenes
    Zhang, Wenming
    Zhang, Shaotong
    Li, Yaqian
    Li, Haibin
    Song, Tao
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (06)