Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing

被引:1
|
作者
Cho, Kyusik [1 ]
Lee, Suhyeon [1 ]
Seong, Hongje [1 ]
Kim, Euntai [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
关键词
D O I
10.1109/WACV56688.2023.00056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The network trained for domain adaptation is prone to bias toward the easy-to-transfer classes. Since the ground truth label on the target domain is unavailable during training, the bias problem leads to skewed predictions, forgetting to predict hard-to-transfer classes. To address this problem, we propose Cross-domain Moving Object Mixing (CMOM) that cuts several objects, including hard-to-transfer classes, in the source domain video clip and pastes them into the target domain video clip. Unlike image-level domain adaptation, the temporal context should be maintained to mix moving objects in two different videos. Therefore, we design CMOM to mix with consecutive video frames, so that unrealistic movements are not occurring. We additionally propose Feature Alignment with Temporal Context (FATC) to enhance target domain feature discriminability. FATC exploits the robust source domain features, which are trained with ground truth labels, to learn discriminative target domain features in an unsupervised manner by filtering unreliable predictions with temporal consensus. We demonstrate the effectiveness of the proposed approaches through extensive experiments. In particular, our model reaches mIoU of 53.81% on VIPER. Cityscapes-Seq benchmark and mIoU of 56.31% on SYNTHIA-Seq. Cityscapes-Seq benchmark, surpassing the state-of-the-art methods by large margins.
引用
收藏
页码:489 / 498
页数:10
相关论文
共 50 条
  • [31] Exploit Domain-Robust Optical Flow in Domain Adaptive Video Semantic Segmentation
    Gao, Yuan
    Wang, Zilei
    Zhuang, Jiafan
    Zhang, Yixin
    Li, Junjie
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 641 - 649
  • [32] Uncertainty-aware consistency regularization for cross-domain semantic segmentation
    Zhou, Qianyu
    Feng, Zhengyang
    Gu, Qiqi
    Cheng, Guangliang
    Lu, Xuequan
    Shi, Jianping
    Ma, Lizhuang
    Computer Vision and Image Understanding, 2022, 221
  • [33] A Cross-Domain Coupling Network for Semantic Segmentation of Remote Sensing Images
    Li, Xin
    Xu, Feng
    Tao, Feifei
    Tong, Yao
    Gao, Hongmin
    Liu, Fan
    Chen, Ziqi
    Lyu, Xin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [34] Confidence-and-Refinement Adaptation Model for Cross-Domain Semantic Segmentation
    Zhang, Xiaohong
    Chen, Yi
    Shen, Ziyi
    Shen, Yuming
    Zhang, Haofeng
    Zhang, Yudong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9529 - 9542
  • [35] Cross-Domain Palmprint Recognition via Regularized Adversarial Domain Adaptive Hashing
    Du, Xuefeng
    Zhong, Dexing
    Shao, Huikai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2372 - 2385
  • [36] Cross-Domain and Cross-Modal Knowledge Distillation in Domain Adaptation for 3D Semantic Segmentation
    Li, Miaoyu
    Zhang, Yachao
    Xie, Yuan
    Gao, Zuodong
    Li, Cuihua
    Zhang, Zhizhong
    Qu, Yanyun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3829 - 3837
  • [37] Semantic-aware short path adversarial training for cross-domain semantic segmentation
    Shan, Yuhu
    Chew, Chee Meng
    Lu, Wen Feng
    NEUROCOMPUTING, 2020, 380 : 125 - 132
  • [38] DAT: DOMAIN ADAPTIVE TRANSFORMER FOR DOMAIN ADAPTIVE SEMANTIC SEGMENTATION
    Park, Jinyoung
    Son, Minseok
    Lee, Sumin
    Kim, Changick
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4183 - 4187
  • [39] Domain transfer via cross-domain analogy
    Klenk, Matthew
    Forbus, Ken
    COGNITIVE SYSTEMS RESEARCH, 2009, 10 (03) : 240 - 250
  • [40] Adapting Object Detectors via Selective Cross-Domain Alignment
    Zhu, Xinge
    Pang, Jiangmiao
    Yang, Ceyuan
    Shi, Jianping
    Lin, Dahua
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 687 - 696