ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes

被引:167
|
作者
Chen, Yuhua [1 ]
Li, Wen [1 ]
Van Gool, Luc [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] Katholieke Univ Leuven, ESAT PSI, VISICS, Leuven, Belgium
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00823
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploiting synthetic data to learn deep models has attracted increasing attention in recent years. However, the intrinsic domain difference between synthetic and real images usually causes a significant performance drop when applying the learned model to real world scenarios. This is mainly due to two reasons: 1) the model overfits to synthetic images, making the convolutional filters incompetent to extract informative representation for real images; 2) there is a distribution difference between synthetic and real data, which is also known as the domain adaptation problem. To this end, we propose a new reality oriented adaptation approach for urban scene semantic segmentation by learning from synthetic data. First, we propose a target guided distillation approach to learn the real image style, which is achieved by training the segmentation model to imitate a pretrained real style model using real images. Second, we further take advantage of the intrinsic spatial structure presented in urban scene images, and propose a spatial aware adaptation scheme to effectively align the distribution of two domains. These two modules can be readily integrated with existing state-of-the-art semantic segmentation networks to improve their generalizability when adapting from synthetic to real urban scenes. We evaluate the proposed method on Cityscapes dataset by adapting from GTAV and SYNTHIA datasets, where the results demonstrate the effectiveness of our method.
引用
收藏
页码:7892 / 7901
页数:10
相关论文
共 50 条
  • [31] UrbanLF: A Comprehensive Light Field Dataset for Semantic Segmentation of Urban Scenes
    Sheng, Hao
    Cong, Ruixuan
    Yang, Da
    Chen, Rongshan
    Wang, Sizhe
    Cui, Zhenglong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7880 - 7893
  • [32] Weakly supervised multi-class semantic video segmentation for road scenes
    Awan, Mehwish
    Shin, Jitae
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 230
  • [33] Parallel Complement Network for Real-Time Semantic Segmentation of Road Scenes
    Lv, Qingxuan
    Sun, Xin
    Chen, Changrui
    Dong, Junyu
    Zhou, Huiyu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4432 - 4444
  • [34] Attribute-aware Semantic Segmentation of Road Scenes for Understanding Pedestrian Orientations
    Sulistiyo, M. D.
    Kawanishi, Y.
    Deguchi, D.
    Hirayama, T.
    Ide, I.
    Zheng, J. Y.
    Murase, H.
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 2698 - 2703
  • [35] The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes
    Ros, German
    Sellart, Laura
    Materzynska, Joanna
    Vazquez, David
    Lopez, Antonio M.
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3234 - 3243
  • [36] Semantic Segmentation of Urban Scenes with a Location Prior Map Using Lidar Measurements
    Wang, Jeonghyeon
    Kim, Jinwhan
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 661 - 666
  • [37] RTFNet: RGB-Thermal Fusion Network for Semantic Segmentation of Urban Scenes
    Sun, Yuxiang
    Zuo, Weixun
    Liu, Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03): : 2576 - 2583
  • [38] Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation
    Yang, Zhengeng
    Yu, Hongshan
    Feng, Mingtao
    Sun, Wei
    Lin, Xuefei
    Sun, Mingui
    Mao, Zhi-Hong
    Mian, Ajmal
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5175 - 5190
  • [39] CNN based Semantic Segmentation for Urban Traffic Scenes using Fisheye Camera
    Deng, Liuyuan
    Yang, Ming
    Qian, Yeqiang
    Wang, Chunxiang
    Wang, Bing
    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 231 - 236
  • [40] FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion
    Sun, Yuxiang
    Zuo, Weixun
    Yun, Peng
    Wang, Hengli
    Liu, Ming
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 18 (03) : 1000 - 1011