Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

被引：0

作者：

Wei, Pengfei ^{[1
]}

Kong, Lingdong ^{[2
]}

Qu, Xinghua ^{[1
]}

Ren, Yi ^{[1
]}

Xu, Zhiqiang ^{[3
]}

机构：

[1] ByteDance, AI Lab, Beijing, Peoples R China

[2] Natl Univ Singapore, Singapore, Singapore

[3] MBZUAI, Abu Dhabi, U Arab Emirates

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Unsupervised video domain adaptation is a practical yet challenging task. In this work, for the first time, we tackle it from a disentanglement view. Our key idea is to handle the spatial and temporal domain divergence separately through disentanglement. Specifically, we consider the generation of cross-domain videos from two sets of latent factors, one encoding the static information and another encoding the dynamic information. A Transfer Sequential VAE (TranSVAE) framework is then developed to model such generation. To better serve for adaptation, we propose several objectives to constrain the latent factors. With these constraints, the spatial divergence can be readily removed by disentangling the static domain-specific information out, and the temporal divergence is further reduced from both frame- and video-levels through adversarial learning. Extensive experiments on the UCF-HMDB, Jester, and Epic-Kitchens datasets verify the effectiveness and superiority of TranSVAE compared with several state-of-the-art approaches.

引用

页数：20

共 50 条

[31] Improving Hazy Image Recognition by Unsupervised Domain Adaptation
Yuan, Zhiyu
Li, Yuhang
Yang, Jianfei
2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 311 - 316
[32] Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
Sohn, Kihyuk
Liu, Sifei
Zhong, Guangyu
Yu, Xiang
Yang, Ming-Hsuan
Chandraker, Manmohan
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5917 - 5925
[33] Unsupervised Domain Adaptation for Human Activity Recognition in Radar
Li, Xinyu
Jing, Xiaojun
He, Yuan
2020 IEEE RADAR CONFERENCE (RADARCONF20), 2020,
[34] UNSUPERVISED DOMAIN ADAPTATION VIA DOMAIN ADVERSARIAL TRAINING FOR SPEAKER RECOGNITION
Wang, Qing
Rao, Wei
Sun, Sining
Xie, Lei
Chng, Eng Siong
Li, Haizhou
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4889 - 4893
[35] Source-Free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition
Xu, Yuecong
Yang, Jianfei
Cao, Haozhi
Wu, Keyu
Wu, Min
Chen, Zhenghua
COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 147 - 164
[36] Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition
Ahsan, Unaiza
Madhok, Rishi
Essa, Irfan
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 179 - 189
[37] Student Engagement from Video using Unsupervised Domain Adaptation
Thomas, Chinchu
Purvaj, Seethamraju
Jayagopi, Dinesh Babu
IMPROVE: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND VISION ENGINEERING, 2022, : 118 - 125
[38] Video Unsupervised Domain Adaptation with Deep Learning: A Comprehensive Survey
Xu, Yuecong
Cao, Haozhi
Xie, Lihua
Li, Xiao-Li
Chen, Zhenghua
Yang, Jianfei
ACM COMPUTING SURVEYS, 2024, 56 (12)
[39] In the Wild Video Violence Detection: An Unsupervised Domain Adaptation Approach
Luca Ciampi
Carlos Santiago
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
SN Computer Science, 5 (7)
[40] AN UNSUPERVISED DOMAIN ADAPTATION METHOD FOR COMPRESSED VIDEO QUALITY ENHANCEMENT
Wang Zeyang
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,

← 1 2 3 4 5 →