Instance-Aware Deep Graph Learning for Multi-Label Classification

被引:5
|
作者
Wang, Yun [1 ,2 ]
Zhang, Tong [1 ,2 ]
Zhou, Chuanwei [1 ,2 ]
Cui, Zhen [1 ,2 ]
Yang, Jian [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens Inf, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Social, Nanjing 210094, Peoples R China
基金
中国国家自然科学基金;
关键词
Correlation; Adaptation models; Task analysis; Feature extraction; Image recognition; Convolutional neural networks; Sports; Graph convolutional neural network; image-dependent label correlation matrix; regions of interests; variational inference; IMAGE CLASSIFICATION;
D O I
10.1109/TMM.2021.3121559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task by modeling correlation among labels. In previous methods, label correlation is computed based on statistical information through label diffusion, and therefore the same for all samples. This, however, makes graph inference on labels insufficient to handle huge variations among numerous image instances. In this paper, we propose an instance-aware graph convolutional neural network (IA_GCN) framework for the multi-label classification. As a whole, two fused branches of sub-networks are involved in the framework: a global branch modeling the whole image and a local branch exploring dependencies among regions of interests (ROIs). For both the branches, an image-dependent label correlation matrix (ID_LCM), fusing both the statistical label correlation matrix (LCM) and an individual one of each image instance, is constructed to inject adaptive information of label-awareness into the learned features of the model through graph convolution. Specifically, the individual LCM of each image is obtained by mining the label dependencies based on the predicted label scores of those detected ROIs. In this process, considering the contribution differences of ROIs to multi-label classification, variational inference is introduced to learn adaptive scaling factors for those ROIs by considering their complex distribution. Finally, extensive experiments on MS-COCO and VOC datasets show that our proposed approach outperforms existing state-of-the-art methods.
引用
收藏
页码:90 / 99
页数:10
相关论文
共 50 条
  • [31] Active Multi-Instance Multi-Label Learning
    Retz, Robert
    Schwenker, Friedhelm
    ANALYSIS OF LARGE AND COMPLEX DATA, 2016, : 91 - 101
  • [32] A New multi-instance multi-label learning approach for image and text classification
    Yan, Kaobi
    Li, Zhixin
    Zhang, Canlong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (13) : 7875 - 7890
  • [33] Hierarchical multi-instance multi-label learning for Chinese patent text classification
    Liu, Yunduo
    Xu, Fang
    Zhao, Yushan
    Ma, Zichen
    Wang, Tengke
    Zhang, Shunxiang
    Tian, Yuhao
    CONNECTION SCIENCE, 2024, 36 (01)
  • [34] A New multi-instance multi-label learning approach for image and text classification
    Kaobi Yan
    Zhixin Li
    Canlong Zhang
    Multimedia Tools and Applications, 2016, 75 : 7875 - 7890
  • [35] Knowledge Graph Constraints for Multi-label Graph Classification
    Ringsquandl, Martin
    Lamparter, Steffen
    Thon, Ingo
    Lepratti, Raffaello
    Kroeger, Peer
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 121 - 127
  • [36] Image emotion multi-label classification based on multi-graph learning
    Wang, Meixia
    Zhao, Yuhai
    Wang, Yejiang
    Xu, Tongze
    Sun, Yiming
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [37] Learning a Deep ConvNet for Multi-label Classification with Partial Labels
    Durand, Thibaut
    Mehrasa, Nazanin
    Mori, Greg
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 647 - 657
  • [38] DeepBE: Learning Deep Binary Encoding for Multi-Label Classification
    Li, Chenghua
    Kang, Qi
    Ge, Guojing
    Song, Qiang
    Lu, Hanqing
    Cheng, Jian
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 744 - 751
  • [39] A Survey of Multi-label Text Classification Based on Deep Learning
    Chen, Xiaolong
    Cheng, Jieren
    Liu, Jingxin
    Xu, Wenghang
    Hua, Shuai
    Tang, Zhu
    Sheng, Victor S.
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 443 - 456
  • [40] Multi-Label Classification of Text Documents Using Deep Learning
    Mohammed, Hamza Haruna
    Dogdu, Erdogan
    Gorur, Abdul Kadir
    Choupani, Roya
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4681 - 4689