Domain learning joint with semantic adaptation for human action recognition

被引:15
|
作者
Zhang, Junxuan [1 ]
Hu, Haifeng [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Knowledge adaptation; Two-stream network; Video representation; Action recognition; Cascaded convolution fusion strategy; REPRESENTATION; FEATURES;
D O I
10.1016/j.patcog.2019.01.027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition is a challenging task in the field of computer vision. The deficiency in training samples is a bottleneck problem in the current action recognition research. With the explosive growth of Internet data, some researchers try to use prior knowledge learned from various video sources to assist in recognizing the action video of the target domain, which is called knowledge adaptation. Based on this idea, we propose a novel framework for action recognition, called Semantic Adaptation based on the Vector of Locally Max Pooled deep learned Features (SA-VLMPF). The proposed framework consists of three parts: Two-Stream Fusion Network (TSFN), Vector of Locally Max-Pooled deep learned Features (VLMPF) and Semantic Adaptation Model (SAM). TSFN adopts a cascaded convolution fusion strategy to combine the convolutional features extracted from two-stream network. VLMPF retains the long-term information in videos and removes the irrelevant information by capturing multiple local features and extracting the features with the highest response to action category. SAM first maps the data of the auxiliary domain and the target domain into the high-level semantic representation through the deep network. Then the obtained high-level semantic representations from auxiliary domain are adapted into target domain in order to optimize the target classifier. Compared with the existing methods, the proposed methods can utilize the advantages of deep learning methods in obtaining the high-level semantic information to improve the performance of knowledge adaptation. At the same time, SA-VLMPF can make full use of the auxiliary data to make up for the insufficiency of training samples. Multiple experiments are conducted on several couples of datasets to validate the effectiveness of the proposed framework. The results show that the proposed SA-VLMPF outperforms the state-of-the-art knowledge adaptation methods. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:196 / 209
页数:14
相关论文
共 50 条
  • [1] Joint Adversarial Learning for Domain Adaptation in Semantic Segmentation
    Zhang, Yixin
    Wang, Zilei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6877 - 6884
  • [2] Joint adversarial learning for domain adaptation in semantic segmentation
    Zhang, Yixin
    Wang, Zilei
    AAAI 2020 - 34th AAAI Conference on Artificial Intelligence, 2020, : 6877 - 6884
  • [3] Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition
    Sun, Bin
    Kong, Dehui
    Wang, Shaofan
    Wang, Lichun
    Yin, Baocai
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (02)
  • [4] Phase Randomization: A data augmentation for domain adaptation in human action recognition
    Mitsuzumi, Yu
    Irie, Go
    Kimura, Akisato
    Nakazawa, Atsushi
    PATTERN RECOGNITION, 2024, 146
  • [5] Action recognition by joint learning
    Yuan, Yuan
    Qi, Lei
    Lu, Xiaoqiang
    IMAGE AND VISION COMPUTING, 2016, 55 : 77 - 85
  • [6] JOINT LABEL-INTERACTION LEARNING FOR HUMAN ACTION RECOGNITION
    Jin, Jiali
    Wang, Zhenhua
    Liu, Sheng
    Zhang, Jianhua
    Chen, Shengyong
    Guan, Qiu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4507 - 4511
  • [7] Learning Semantic Representations for Unsupervised Domain Adaptation
    Xie, Shaoan
    Zheng, Zibin
    Chen, Liang
    Chen, Chuan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [8] Latent semantic learning with structured sparse representation for human action recognition
    Lu, Zhiwu
    Peng, Yuxin
    PATTERN RECOGNITION, 2013, 46 (07) : 1799 - 1809
  • [9] Learning Disentangled Semantic Representation for Domain Adaptation
    Cai, Ruichu
    Li, Zijian
    Wei, Pengfei
    Qiao, Jie
    Zhang, Kun
    Hao, Zhifeng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2060 - 2066
  • [10] Bidirectional Learning for Domain Adaptation of Semantic Segmentation
    Li, Yunsheng
    Yuan, Lu
    Vasconcelos, Nuno
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6929 - 6938