Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer

被引:7
|
作者
Zhuo, Junbao [1 ]
Zhu, Yan [2 ]
Cui, Shuhao [3 ]
Wang, Shuhui [1 ,4 ]
Ma, Bin [3 ]
Huang, Qingming [1 ,2 ]
Wei, Xiaoming [3 ]
Wei, Xiaolin [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Meituan Inc, Beijing, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Zero-shot Video Classification; Transfer Learning;
D O I
10.1145/3503161.3548008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Zero-shot video classification (ZSVC) that aims to recognize video classes that have never been seen during model training, has become a thriving research direction. ZSVC is achieved by building mappings between visual and semantic embeddings. Recently, ZSVC has been achieved by automatically mining the underlying objects in videos as attributes and incorporating external commonsense knowledge. However, the object mined from seen categories can not generalized to unseen ones. Besides, the category-object relationships are usually extracted from commonsense knowledge or word embedding, which is not consistent with video modality. To tackle these issues, we propose to mine associated objects and category-object relationships for each category from retrieved web images. The associated objects of all categories are employed as generic attributes and the mined category-object relationships could narrow the modality inconsistency for better knowledge transfer. Another issue of existing ZSVC methods is that the model sufficiently trained with labeled seen categories may not generalize well to distinct unseen categories. To encourage a more reliable transfer, we propose Task Similarity aware Representation Learning (TSRL). In TSRL, the similarity between seen categories and the unseen ones is estimated and used to regularize the model in an appropriate way. We construct a model for ZSVC based on the constructed attributes, the mined category-object relationships and the proposed TSRL. Experimental results on four public datasets, i.e., FCVID, UCF101, HMDB51 and Olympic Sports, show that our model performs favorably against state-of-the-art methods. Our codes are publicly available at https://github.com/junbaoZHUO/TSRL.
引用
收藏
页码:5761 / 5772
页数:12
相关论文
共 50 条
  • [41] Attribute relation learning for zero-shot classification
    Liu, Mingxia
    Zhang, Daoqiang
    Chen, Songcan
    NEUROCOMPUTING, 2014, 139 : 34 - 46
  • [42] Zero-shot Classification using Hyperdimensional Computing
    Ruffino, Samuele
    Karunaratne, Geethan
    Hersche, Michael
    Benini, Luca
    Abu Sebastian
    Rahimi, Abbas
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [43] Zero-Shot Knowledge Distillation in Deep Networks
    Nayak, Gaurav Kumar
    Mopuri, Konda Reddy
    Shaj, Vaisakh
    Babu, R. Venkatesh
    Chakraborty, Anirban
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [44] Gaze Embeddings for Zero-Shot Image Classification
    Karessli, Nour
    Akata, Zeynep
    Schiele, Bernt
    Bulling, Andreas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6412 - 6421
  • [45] Zero-shot classification with unseen prototype learning
    Zhong Ji
    Biying Cui
    Yunlong Yu
    Yanwei Pang
    Zhongfei Zhang
    Neural Computing and Applications, 2023, 35 : 12307 - 12317
  • [46] Multimodal Ensembling for Zero-Shot Image Classification
    Hickmon, Javon
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23747 - 23749
  • [47] Zero-shot classification with unseen prototype learning
    Ji, Zhong
    Cui, Biying
    Yu, Yunlong
    Pang, Yanwei
    Zhang, Zhongfei
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (17): : 12307 - 12317
  • [48] Retrieval Augmented Zero-Shot Text Classification
    Abdullahi, Tassallah
    Singh, Ritambhara
    Eickhoff, Carsten
    PROCEEDINGS OF THE 2024 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2024, 2024, : 195 - 203
  • [49] Zero-Shot Image Classification Based on Attribute
    Zhang, Wei
    Chen, Wenbai
    Chen, Xiangfeng
    Han, Hu
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 25 - 30
  • [50] Side-Scan Sonar Image Classification With Zero-Shot and Style Transfer
    Bai, Zhongyu
    Xu, Hongli
    Ding, Qichuan
    Zhang, Xiangyue
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15