ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes

被引:167
|
作者
Chen, Yuhua [1 ]
Li, Wen [1 ]
Van Gool, Luc [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] Katholieke Univ Leuven, ESAT PSI, VISICS, Leuven, Belgium
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting synthetic data to learn deep models has attracted increasing attention in recent years. However, the intrinsic domain difference between synthetic and real images usually causes a significant performance drop when applying the learned model to real world scenarios. This is mainly due to two reasons: 1) the model overfits to synthetic images, making the convolutional filters incompetent to extract informative representation for real images; 2) there is a distribution difference between synthetic and real data, which is also known as the domain adaptation problem. To this end, we propose a new reality oriented adaptation approach for urban scene semantic segmentation by learning from synthetic data. First, we propose a target guided distillation approach to learn the real image style, which is achieved by training the segmentation model to imitate a pretrained real style model using real images. Second, we further take advantage of the intrinsic spatial structure presented in urban scene images, and propose a spatial aware adaptation scheme to effectively align the distribution of two domains. These two modules can be readily integrated with existing state-of-the-art semantic segmentation networks to improve their generalizability when adapting from synthetic to real urban scenes. We evaluate the proposed method on Cityscapes dataset by adapting from GTAV and SYNTHIA datasets, where the results demonstrate the effectiveness of our method.
引用
收藏
页码:7892 / 7901
页数:10
相关论文
共 50 条
  • [21] Robust semantic segmentation method of urban scenes in snowy environment
    Yin, Hanqi
    Yin, Guisheng
    Sun, Yiming
    Zhang, Liguo
    Tian, Ye
    MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [22] A Simple Weight Recall for Semantic Segmentation: Application to Urban Scenes
    Li, Xuhong
    Davoine, Franck
    Grandvalet, Yves
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1007 - 1012
  • [23] Unsupervised Domain Extension for Nighttime Semantic Segmentation in Urban Scenes
    Scherer, Sebastian
    Schoen, Robin
    Ludwig, Katja
    Lienhart, Rainer
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS (DELTA), 2021, : 38 - 47
  • [24] Semantic Segmentation of Urban Scenes Using Dense Depth Maps
    Zhang, Chenxi
    Wang, Liang
    Yang, Ruigang
    COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 : 708 - 721
  • [25] Semantic segmentation of urban scenes by learning local class interactions
    Volpi, Michele
    Ferrari, Vittorio
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [26] Failure Detection for Semantic Segmentation on Road Scenes Using Deep Learning
    Song, Junho
    Ahn, Woojin
    Park, Sangkyoo
    Lim, Myotaeg
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 22
  • [27] Deep Learning-Based Frameworks for Semantic Segmentation of Road Scenes
    Alokasi, Haneen
    Ahmad, Muhammad Bilal
    ELECTRONICS, 2022, 11 (12)
  • [28] An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation
    Yang, Jihan
    Xu, Ruijia
    Li, Ruiyu
    Qi, Xiaojuan
    Shen, Xiaoyong
    Li, Guanbin
    Lin, Liang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12613 - 12620
  • [29] ScribbleNet: Efficient interactive annotation of urban city scenes for semantic segmentation
    Sambaturu, Bhavani
    Gupta, Ashutosh
    Jawahar, C. V.
    Arora, Chetan
    PATTERN RECOGNITION, 2023, 133
  • [30] ScaleNet: Scale Invariant Network for Semantic Segmentation in Urban Driving Scenes
    Ansari, Mohammad Dawud
    Krauss, Stephan
    Wasenmueller, Oliver
    Stricker, Didier
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 399 - 404