Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing

被引:1
|
作者
Cho, Kyusik [1 ]
Lee, Suhyeon [1 ]
Seong, Hongje [1 ]
Kim, Euntai [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
关键词
D O I
10.1109/WACV56688.2023.00056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The network trained for domain adaptation is prone to bias toward the easy-to-transfer classes. Since the ground truth label on the target domain is unavailable during training, the bias problem leads to skewed predictions, forgetting to predict hard-to-transfer classes. To address this problem, we propose Cross-domain Moving Object Mixing (CMOM) that cuts several objects, including hard-to-transfer classes, in the source domain video clip and pastes them into the target domain video clip. Unlike image-level domain adaptation, the temporal context should be maintained to mix moving objects in two different videos. Therefore, we design CMOM to mix with consecutive video frames, so that unrealistic movements are not occurring. We additionally propose Feature Alignment with Temporal Context (FATC) to enhance target domain feature discriminability. FATC exploits the robust source domain features, which are trained with ground truth labels, to learn discriminative target domain features in an unsupervised manner by filtering unreliable predictions with temporal consensus. We demonstrate the effectiveness of the proposed approaches through extensive experiments. In particular, our model reaches mIoU of 53.81% on VIPER. Cityscapes-Seq benchmark and mIoU of 56.31% on SYNTHIA-Seq. Cityscapes-Seq benchmark, surpassing the state-of-the-art methods by large margins.
引用
收藏
页码:489 / 498
页数:10
相关论文
共 50 条
  • [21] Cross-Domain Semantic Segmentation on Inconsistent Taxonomy Using VLMs
    Lim, Jeongkee
    Kim, Yusung
    COMPUTER VISION - ECCV 2024, PT LXV, 2025, 15123 : 18 - 35
  • [22] Cross-Domain Transfer via Semantic Skill Imitation
    Pertsch, Karl
    Desai, Ruta
    Kumar, Vikash
    Meier, Franziska
    Lim, Joseph J.
    Batra, Dhruv
    Rai, Akshara
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 690 - 700
  • [23] Robust moving video object segmentation in the MPEG compressed domain
    Yu, XD
    Duan, LY
    Tian, Q
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 933 - 936
  • [24] Graph Adaptive Semantic Transfer for Cross-domain Sentiment Classification
    Zhang, Kai
    Liu, Qi
    Huang, Zhenya
    Cheng, Mingyue
    Zhang, Kun
    Zhang, Mengdi
    Wu, Wei
    Chen, Enhong
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1566 - 1576
  • [25] Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation
    Chen, Yiyang
    Zhao, Shanshan
    Ding, Changxing
    Tang, Liyao
    Wang, Chaoyue
    Tao, Dacheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3866 - 3875
  • [26] Cross-Domain Semantic Segmentation of Urban Scenes via Multi-Level Feature Alignment
    Zhang, Bin
    Zhao, Shengjie
    Zhang, Rongqing
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1912 - 1917
  • [27] Progressive cross-domain knowledge distillation for efficient unsupervised domain adaptive object detection
    Li, Wei
    Li, Lingqiao
    Yang, Huihua
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [28] Preserving Label-Related Domain-Specific Information for Cross-Domain Semantic Segmentation
    Liao, Muxin
    Tian, Shishun
    Zhang, Yuhang
    Hua, Guoguang
    Zou, Wenbin
    Li, Xia
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 14917 - 14931
  • [29] Uncertainty-aware consistency regularization for cross-domain semantic segmentation
    Zhou, Qianyu
    Feng, Zhengyang
    Gu, Qiqi
    Cheng, Guangliang
    Lu, Xuequan
    Shi, Jianping
    Ma, Lizhuang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 221
  • [30] Unsupervised domain adaptation alignment method for cross-domain semantic segmentation of remote sensing images
    Shen Z.
    Ni H.
    Guan H.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2023, 52 (12): : 1 - 2