Generalized Zero-Shot Learning for Action Recognition Fusing Text and Image GANs

被引:1
|
作者
Huang, Kaiqiang [1 ]
McKeever, Susan [1 ]
Miralles-Pechuan, Luis [1 ]
机构
[1] Technol Univ Dublin, Sch Comp Sci, Grangegorman, Dublin 7, Ireland
关键词
Generalized zero-shot action recognition; generalised zero-shot learning; generative adversarial networks; human action recognition;
D O I
10.1109/ACCESS.2024.3349510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generalized Zero-Shot Action Recognition (GZSAR) is geared towards recognizing classes that the model has not been trained on, while still maintaining robust performance on the familiar, trained classes. This approach mitigates the need for an extensive amount of labeled training data and enhances the efficient utilization of available datasets. The main contribution of this paper is a novel approach for GZSAR that combines the power of two Generative Adversarial Networks (GANs). One GAN is responsible for generating embeddings from visual representations, while the other GAN focuses on generating embeddings from textual representations. These generated embeddings are fused, with the selection of the maximum value from each array that represents the embeddings, and this fused data is then utilized to train a GZSAR classifier in a supervised manner. This framework also incorporates a feature refinement component and an out-of-distribution detector to mitigate the domain shift problem between seen and unseen classes. In our experiments, notable improvements were observed. On the UCF101 benchmark dataset, we achieved a 7.43% increase in performance, rising from 50.93% (utilizing images and Word2Vec alone) to 54.71% with the implementation of two GANs. Additionally, on the HMDB51 dataset, we saw a 7.06% improvement, advancing from 36.11% using Text and Word2Vec to 38.66% with the dual-GAN approach. These results underscore the efficacy of our dual-GAN framework in enhancing GZSAR performance. The rest of the paper shows the main contributions to the field of GZSAR and highlights the potential and future lines of research in this exciting area.
引用
收藏
页码:5188 / 5202
页数:15
相关论文
共 50 条
  • [41] Fabric Recognition Using Zero-Shot Learning
    Wang, Feng
    Liu, Huaping
    Sun, Fuchun
    Pan, Haihong
    TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (06) : 645 - 653
  • [42] Fabric Recognition Using Zero-Shot Learning
    Feng Wang
    Huaping Liu
    Fuchun Sun
    Haihong Pan
    Tsinghua Science and Technology, 2019, 24 (06) : 645 - 653
  • [43] An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild
    Chao, Wei-Lun
    Changpinyo, Soravit
    Gong, Boqing
    Sha, Fei
    COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 52 - 68
  • [44] Convolutional prototype learning for zero-shot recognition
    Liu, Zhizhe
    Zhang, Xingxing
    Zhu, Zhenfeng
    Zheng, Shuai
    Zhao, Yao
    Cheng, Jian
    IMAGE AND VISION COMPUTING, 2020, 98
  • [45] Hierarchical Prototype Learning for Zero-Shot Recognition
    Zhang, Xingxing
    Gui, Shupeng
    Zhu, Zhenfeng
    Zhao, Yao
    Liu, Ji
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (07) : 1692 - 1703
  • [46] Adaptive Metric Learning For Zero-Shot Recognition
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (09) : 1270 - 1274
  • [47] Fusing spatial and frequency features for compositional zero-shot image classification
    Li, Suyi
    Jiang, Chenyi
    Ye, Qiaolin
    Wang, Shidong
    Yang, Wankou
    Zhang, Haofeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [48] Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions
    Qian, Yijun
    Yu, Lijun
    Liu, Wenhe
    Hauptmann, Alexander G.
    COMPUTER VISION - ECCV 2022, PT IV, 2022, 13664 : 104 - 120
  • [49] Fine-grained Human Action Recognition Based on Zero-Shot Learning
    Zhao, Yahui
    Shi, Ping
    You, Jian
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 294 - 297
  • [50] A ZERO-SHOT ARCHITECTURE FOR ACTION RECOGNITION IN STILL IMAGES
    Safaei, Marjaneh
    Foroosh, Hassan
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 460 - 464