JSSE: Joint Sequential Semantic Encoder for Zero-Shot Event Recognition

被引:1
|
作者
Madapana N. [1 ]
Wachs J.P. [1 ]
机构
[1] Purdue University, School of Industrial Engineering, West Lafayette, 47906, IN
来源
基金
美国国家科学基金会; 美国医疗保健研究与质量局; 美国国家卫生研究院;
关键词
Action and gesture recognition; activity; semantic descriptors; transfer learning; zero-shot learning (ZSL);
D O I
10.1109/TAI.2022.3208860
中图分类号
学科分类号
摘要
Zero-shot learning (ZSL) is a paradigm in transfer learning that aims to recognize unknown categories by having a mere description of them. The problem of ZSL has been thoroughly studied in the domain of static object recognition; however, ZSL for dynamic events (zero-shot event recognition, ZSER) such as activities and gestures has hardly been investigated. In this context, this article addresses ZSER by relying on semantic attributes of events to transfer the learned knowledge from seen classes to unseen ones. First, we utilized the Amazon Mechanical Turk platform to create the first attribute-based gesture dataset, referred to as zero shot gestural learning (ZSGL), comprising the categories present in MSRC and Italian gesture datasets. Overall, our ZSGL dataset consisted of 26 categories, 65 discriminative attributes, and 16 attribute annotations and 400 examples per category. We used trainable recurrent networks and 3-D convolutional neural networks (CNNs) to learn the spatiotemporal features. Next, we propose a simple yet effective end-to-end approach for ZSER, referred to as joint sequential semantic encoder (JSSE), to explore temporal patterns, to efficiently represent events in the latent space, and to simultaneously optimize for both the semantic and classification tasks. We evaluate our model on ZSGL and two action datasets (UCF and HMDB), and compared the performance of JSSE against several existing baselines under four experimental conditions: 1) within-category, 2) across-category, 3) closed-set, and 4) open-set. Results show that JSSE considerably outperforms (p< 0.05) other approaches and performs favorably for both the datasets under all experimental conditions. © 2020 IEEE.
引用
收藏
页码:1472 / 1483
页数:11
相关论文
共 50 条
  • [21] Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder
    Xing, Nan
    Liu, Yang
    Zhu, Hong
    Wang, Jing
    Han, Jungong
    IEEE ACCESS, 2021, 9 : 733 - 742
  • [22] Bi-shifting semantic auto-encoder for zero-shot learning
    Wang, Yu
    ELECTRONIC RESEARCH ARCHIVE, 2022, 30 (01): : 140 - 167
  • [23] Zero-Shot Event Detection by Multimodal Distributional Semantic Embedding of Videos
    Elhoseiny, Mohamed
    Liu, Jingen
    Cheng, Hui
    Sawhney, Harpreet
    Elgammal, Ahmed
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3478 - 3486
  • [24] Zero-shot prompt-based video encoder for surgical gesture recognition
    Rao, Mingxing
    Qin, Yinhong
    Kolouri, Soheil
    Wu, Jie Ying
    Moyer, Daniel
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2025, 20 (02) : 311 - 321
  • [25] Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation
    Baek, Donghyeon
    Oh, Youngmin
    Ham, Bumsub
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9516 - 9525
  • [26] BERT-Sort: A Zero-shot MLM Semantic Encoder on Ordinal Features for AutoML
    Bahrami, Mehdi
    Chen, Wei-Peng
    Liu, Lei
    Prasad, Mukul
    INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, VOL 188, 2022, 188
  • [27] Double Discriminative Graph Regularized Semantic Auto-Encoder for Zero-shot Learning
    Tai, Debao
    Zhang, Zhonghao
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [28] Decoupling Zero-Shot Semantic Segmentation
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    Dai, Dengxin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11573 - 11582
  • [29] Exemplar-Based, Semantic Guided Zero-Shot Visual Recognition
    Zhang, Chunjie
    Liang, Chao
    Zhao, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3056 - 3065
  • [30] Indirect visual-semantic alignment for generalized zero-shot recognition
    Chen, Yan-He
    Yeh, Mei-Chen
    MULTIMEDIA SYSTEMS, 2024, 30 (02)