Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

被引:0
|
作者
Wei, Pengfei [1 ]
Kong, Lingdong [2 ]
Qu, Xinghua [1 ]
Ren, Yi [1 ]
Xu, Zhiqiang [3 ]
机构
[1] ByteDance, AI Lab, Beijing, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] MBZUAI, Abu Dhabi, U Arab Emirates
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised video domain adaptation is a practical yet challenging task. In this work, for the first time, we tackle it from a disentanglement view. Our key idea is to handle the spatial and temporal domain divergence separately through disentanglement. Specifically, we consider the generation of cross-domain videos from two sets of latent factors, one encoding the static information and another encoding the dynamic information. A Transfer Sequential VAE (TranSVAE) framework is then developed to model such generation. To better serve for adaptation, we propose several objectives to constrain the latent factors. With these constraints, the spatial divergence can be readily removed by disentangling the static domain-specific information out, and the temporal divergence is further reduced from both frame- and video-levels through adversarial learning. Extensive experiments on the UCF-HMDB, Jester, and Epic-Kitchens datasets verify the effectiveness and superiority of TranSVAE compared with several state-of-the-art approaches.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Improving Hazy Image Recognition by Unsupervised Domain Adaptation
    Yuan, Zhiyu
    Li, Yuhang
    Yang, Jianfei
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 311 - 316
  • [32] Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
    Sohn, Kihyuk
    Liu, Sifei
    Zhong, Guangyu
    Yu, Xiang
    Yang, Ming-Hsuan
    Chandraker, Manmohan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5917 - 5925
  • [33] Unsupervised Domain Adaptation for Human Activity Recognition in Radar
    Li, Xinyu
    Jing, Xiaojun
    He, Yuan
    2020 IEEE RADAR CONFERENCE (RADARCONF20), 2020,
  • [34] UNSUPERVISED DOMAIN ADAPTATION VIA DOMAIN ADVERSARIAL TRAINING FOR SPEAKER RECOGNITION
    Wang, Qing
    Rao, Wei
    Sun, Sining
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4889 - 4893
  • [35] Source-Free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition
    Xu, Yuecong
    Yang, Jianfei
    Cao, Haozhi
    Wu, Keyu
    Wu, Min
    Chen, Zhenghua
    COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 147 - 164
  • [36] Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition
    Ahsan, Unaiza
    Madhok, Rishi
    Essa, Irfan
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 179 - 189
  • [37] Student Engagement from Video using Unsupervised Domain Adaptation
    Thomas, Chinchu
    Purvaj, Seethamraju
    Jayagopi, Dinesh Babu
    IMPROVE: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND VISION ENGINEERING, 2022, : 118 - 125
  • [38] Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive Survey
    Xu, Yuecong
    Cao, Haozhi
    Xie, Lihua
    Li, Xiao-Li
    Chen, Zhenghua
    Yang, Jianfei
    ACM COMPUTING SURVEYS, 2024, 56 (12)
  • [39] In the Wild Video Violence Detection: An Unsupervised Domain Adaptation Approach
    Luca Ciampi
    Carlos Santiago
    Fabrizio Falchi
    Claudio Gennaro
    Giuseppe Amato
    SN Computer Science, 5 (7)
  • [40] AN UNSUPERVISED DOMAIN ADAPTATION METHOD FOR COMPRESSED VIDEO QUALITY ENHANCEMENT
    Wang Zeyang
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,