Multi-label video classification via coupling attentional multiple instance learning with label relation graph *

被引:12
|
作者
Li, Xuewei [1 ]
Wu, Hongjun [1 ]
Li, Mengzhu [1 ]
Liu, Hongzhe [1 ]
机构
[1] Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China
关键词
Multi-label video classification; Multiple instance learning; Attentional feature learning; Label relation graph;
D O I
10.1016/j.patrec.2022.01.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label video classification is a challenging problem in pattern recognition field, as it is difficult to grasp the occurring localizations of a huge number of labels in videos. To solve this problem, we propose a general framework named MALL-CNN, i.e., Multi-Attention Label Relation Learning Convolutional Neural Network. MALL-CNN not only builds the correspondences between labels and videos by an attention mechanism, but also captures label co-occurrence by a graph learning approach. Specifically, we introduce multiple instance learning to composite a set of frame-level features into a video-level feature. Then, video-level feature is mapped into the content-aware category representations in an improved attentional manner. Further, these representations are enhanced by a series of label relation graphs, which transform global label relationships to the label relationships of current video. With the three processes, frame feature aggregation, video feature mapping, and label relationship construction can be achieved in MALL-CNN for multi-label video classification. Extensive experiments on real-world scene benchmark Youtube-8M verify that MALL-CNN with only frame feature surpasses the state of the arts with multi modal features as well as ensemble models.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 59
页数:7
相关论文
共 50 条
  • [31] Learning Label-Specific Multiple Local Metrics for Multi-Label Classification
    Mao, Jun-Xiang
    Hang, Jun-Yi
    Zhang, Min-Ling
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4742 - 4750
  • [32] Multi-Instance Multi-Label Active Learning
    Huang, Sheng-Jun
    Gao, Nengneng
    Chen, Songcan
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1886 - 1892
  • [33] Fast Multi-Instance Multi-Label Learning
    Huang, Sheng-Jun
    Gao, Wei
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2614 - 2627
  • [34] Active Multi-Instance Multi-Label Learning
    Retz, Robert
    Schwenker, Friedhelm
    ANALYSIS OF LARGE AND COMPLEX DATA, 2016, : 91 - 101
  • [35] Enhancing Multi-Label Classification via Dynamic Label-Order Learning
    Li, Jiangnan
    Zhang, Yice
    Chen, Shiwei
    Xu, Ruifeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18527 - 18535
  • [36] Joint learning of multi-label classification and label correlations
    He, Zhi-Fen
    Yang, Ming
    Liu, Hui-Dong
    Ruan Jian Xue Bao/Journal of Software, 2014, 25 (09): : 1967 - 1981
  • [37] A Multi-label Learning Approach Based on Mapping from Instance to Label
    Zhang, Huaxiang
    Hu, Bin
    Feng, Xinying
    PERVASIVE COMPUTING AND THE NETWORKED WORLD, 2014, 8351 : 743 - +
  • [38] Discover Multiple Novel Labels in Multi-Instance Multi-Label Learning
    Zhu, Yue
    Ting, Kai Ming
    Zhou, Zhi-Hua
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2977 - 2983
  • [39] Multi-label text classification via joint learning from label embedding and label correlation
    Liu, Huiting
    Chen, Geng
    Li, Peipei
    Zhao, Peng
    Wu, Xindong
    NEUROCOMPUTING, 2021, 460 : 385 - 398
  • [40] Scalable Label Distribution Learning for Multi-Label Classification
    Zhao, Xingyu
    An, Yuexuan
    Qi, Lei
    Geng, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,