Domain learning joint with semantic adaptation for human action recognition

被引：15

作者：

Zhang, Junxuan ^{[1
]}

Hu, Haifeng ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China

来源：

PATTERN RECOGNITION | 2019年 / 90卷

基金：

中国国家自然科学基金;

关键词：

Knowledge adaptation; Two-stream network; Video representation; Action recognition; Cascaded convolution fusion strategy; REPRESENTATION; FEATURES;

D O I：

10.1016/j.patcog.2019.01.027

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Action recognition is a challenging task in the field of computer vision. The deficiency in training samples is a bottleneck problem in the current action recognition research. With the explosive growth of Internet data, some researchers try to use prior knowledge learned from various video sources to assist in recognizing the action video of the target domain, which is called knowledge adaptation. Based on this idea, we propose a novel framework for action recognition, called Semantic Adaptation based on the Vector of Locally Max Pooled deep learned Features (SA-VLMPF). The proposed framework consists of three parts: Two-Stream Fusion Network (TSFN), Vector of Locally Max-Pooled deep learned Features (VLMPF) and Semantic Adaptation Model (SAM). TSFN adopts a cascaded convolution fusion strategy to combine the convolutional features extracted from two-stream network. VLMPF retains the long-term information in videos and removes the irrelevant information by capturing multiple local features and extracting the features with the highest response to action category. SAM first maps the data of the auxiliary domain and the target domain into the high-level semantic representation through the deep network. Then the obtained high-level semantic representations from auxiliary domain are adapted into target domain in order to optimize the target classifier. Compared with the existing methods, the proposed methods can utilize the advantages of deep learning methods in obtaining the high-level semantic information to improve the performance of knowledge adaptation. At the same time, SA-VLMPF can make full use of the auxiliary data to make up for the insufficiency of training samples. Multiple experiments are conducted on several couples of datasets to validate the effectiveness of the proposed framework. The results show that the proposed SA-VLMPF outperforms the state-of-the-art knowledge adaptation methods. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页码：196 / 209

页数：14

共 50 条

[1] Joint Adversarial Learning for Domain Adaptation in Semantic Segmentation
Zhang, Yixin
Wang, Zilei
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6877 - 6884
[2] Joint adversarial learning for domain adaptation in semantic segmentation
Zhang, Yixin
Wang, Zilei
AAAI 2020 - 34th AAAI Conference on Artificial Intelligence, 2020, : 6877 - 6884
[3] Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition
Sun, Bin
Kong, Dehui
Wang, Shaofan
Wang, Lichun
Yin, Baocai
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (02)
[4] Phase Randomization: A data augmentation for domain adaptation in human action recognition
Mitsuzumi, Yu
Irie, Go
Kimura, Akisato
Nakazawa, Atsushi
PATTERN RECOGNITION, 2024, 146
[5] Action recognition by joint learning
Yuan, Yuan
Qi, Lei
Lu, Xiaoqiang
IMAGE AND VISION COMPUTING, 2016, 55 : 77 - 85
[6] JOINT LABEL-INTERACTION LEARNING FOR HUMAN ACTION RECOGNITION
Jin, Jiali
Wang, Zhenhua
Liu, Sheng
Zhang, Jianhua
Chen, Shengyong
Guan, Qiu
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4507 - 4511
[7] Learning Semantic Representations for Unsupervised Domain Adaptation
Xie, Shaoan
Zheng, Zibin
Chen, Liang
Chen, Chuan
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[8] Latent semantic learning with structured sparse representation for human action recognition
Lu, Zhiwu
Peng, Yuxin
PATTERN RECOGNITION, 2013, 46 (07) : 1799 - 1809
[9] Learning Disentangled Semantic Representation for Domain Adaptation
Cai, Ruichu
Li, Zijian
Wei, Pengfei
Qiao, Jie
Zhang, Kun
Hao, Zhifeng
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2060 - 2066
[10] Bidirectional Learning for Domain Adaptation of Semantic Segmentation
Li, Yunsheng
Yuan, Lu
Vasconcelos, Nuno
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6929 - 6938

← 1 2 3 4 5 →