Generalized Zero-Shot Learning for Action Recognition Fusing Text and Image GANs

被引:1
|
作者
Huang, Kaiqiang [1 ]
McKeever, Susan [1 ]
Miralles-Pechuan, Luis [1 ]
机构
[1] Technol Univ Dublin, Sch Comp Sci, Grangegorman, Dublin 7, Ireland
关键词
Generalized zero-shot action recognition; generalised zero-shot learning; generative adversarial networks; human action recognition;
D O I
10.1109/ACCESS.2024.3349510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generalized Zero-Shot Action Recognition (GZSAR) is geared towards recognizing classes that the model has not been trained on, while still maintaining robust performance on the familiar, trained classes. This approach mitigates the need for an extensive amount of labeled training data and enhances the efficient utilization of available datasets. The main contribution of this paper is a novel approach for GZSAR that combines the power of two Generative Adversarial Networks (GANs). One GAN is responsible for generating embeddings from visual representations, while the other GAN focuses on generating embeddings from textual representations. These generated embeddings are fused, with the selection of the maximum value from each array that represents the embeddings, and this fused data is then utilized to train a GZSAR classifier in a supervised manner. This framework also incorporates a feature refinement component and an out-of-distribution detector to mitigate the domain shift problem between seen and unseen classes. In our experiments, notable improvements were observed. On the UCF101 benchmark dataset, we achieved a 7.43% increase in performance, rising from 50.93% (utilizing images and Word2Vec alone) to 54.71% with the implementation of two GANs. Additionally, on the HMDB51 dataset, we saw a 7.06% improvement, advancing from 36.11% using Text and Word2Vec to 38.66% with the dual-GAN approach. These results underscore the efficacy of our dual-GAN framework in enhancing GZSAR performance. The rest of the paper shows the main contributions to the field of GZSAR and highlights the potential and future lines of research in this exciting area.
引用
收藏
页码:5188 / 5202
页数:15
相关论文
共 50 条
  • [21] Learn to Adapt for Generalized Zero-Shot Text Classification
    Zhang, Yiwen
    Yuan, Caixia
    Wang, Xiaojie
    Bai, Ziwei
    Liu, Yongbin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 517 - 527
  • [22] Generalized Zero-Shot Text Classification for ICD Coding
    Song, Congzheng
    Zhang, Shanghang
    Sadoughi, Najmeh
    Xie, Pengtao
    Xing, Eric
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4018 - 4024
  • [23] Embarrassingly Easy Zero-Shot Image Recognition
    Song, Wenli
    Zhang, Lei
    Fu, Jingru
    BIOMETRIC RECOGNITION (CCBR 2019), 2019, 11818 : 126 - 133
  • [24] Wheel Hub Defects Image Recognition Base on Zero-Shot Learning
    Sun, Xiaohong
    Gu, Jinan
    Wang, Meimei
    Meng, Yanhua
    Shi, Huichao
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 16
  • [25] GENERALIZED ZERO-SHOT RECOGNITION THROUGH IMAGE-GUIDED SEMANTIC CLASSIFICATION
    Li, Fang
    Yeh, Mei-Chen
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2483 - 2487
  • [26] Zero-shot Learning via the fusion of generation and embedding for image recognition
    Zhao, Peng
    Zhang, Siying
    Liu, Jinhui
    Liu, Huiting
    INFORMATION SCIENCES, 2021, 578 (578) : 831 - 847
  • [27] Dual insurance for generalized zero-shot learning
    Liang, Jiahao
    Fang, Xiaozhao
    Kang, Peipei
    Han, Na
    Li, Chuang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 2111 - 2125
  • [28] Model Selection for Generalized Zero-Shot Learning
    Zhang, Hongguang
    Koniusz, Piotr
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 198 - 204
  • [29] Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
  • [30] Semantics Disentangling for Generalized Zero-Shot Learning
    Chen, Zhi
    Luo, Yadan
    Qiu, Ruihong
    Wang, Sen
    Huang, Zi
    Li, Jingjing
    Zhang, Zheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8692 - 8700