Complex Object Classification: A Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport

被引:36
|
作者
Yang, Yang [1 ]
Wu, Yi-Feng [1 ]
Zhan, De-Chuan [1 ]
Liu, Zhi-Bin [2 ]
Jiang, Yuan [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[2] Tencent WXG, Shenzhen, Peoples R China
基金
国家重点研发计划;
关键词
Multi-modal; Multi-instance; Multi-label; Optimal Transport;
D O I
10.1145/3219819.3220012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real world applications, complex objects are usually with multiple labels, and can be represented as multiple modal representations, e.g., the complex articles contain text and image information as well as are with multiple annotations. Previous methods assume that the homogeneous multi-modal data are consistent, while in real applications, the raw data are disordered, i.e., the article is constituted with variable number of inconsistent text and image instances. To solve this problem, Multi-modal Multi-instance Multi-label (M3) learning provides a framework for handling such task and has exhibited excellent performance. Besides, how to effectively utilize label correlation is also a challenging issue. In this paper, we propose a novel Multi-modal Multi-instance Multi-label Deep Network (M3DN), which learns the label prediction and exploits label correlation simultaneously based on the Optimal Transport, by considering the consistency principle between different modal bag-level prediction and the learned latent ground label metric. Experiments on benchmark datasets and real world WKG Game-Hub dataset validate the effectiveness of the proposed method.
引用
收藏
页码:2594 / 2603
页数:10
相关论文
共 50 条
  • [21] Constrained instance clustering in multi-instance multi-label learning
    Pei, Yuanli
    Fern, Xiaoli Z.
    PATTERN RECOGNITION LETTERS, 2014, 37 : 107 - 114
  • [22] Meta Multi-Instance Multi-Label learning by heterogeneous network fusion
    Qiu, Sichao
    Wang, Mengyi
    Yang, Yuanlin
    Yu, Guoxian
    Wang, Jun
    Yan, Zhongmin
    Domeniconi, Carlotta
    Guo, Maozu
    INFORMATION FUSION, 2023, 94 : 272 - 283
  • [23] Metric Learning-Based Multi-Instance Multi-Label Classification With Label Correlation
    Hu, Haifeng
    Cui, Zhikai
    Wu, Jiansheng
    Wang, Kun
    IEEE ACCESS, 2019, 7 : 109899 - 109909
  • [24] Correlative Multi-Label Multi-Instance Image Annotation
    Xue, Xiangyang
    Zhang, Wei
    Zhang, Jie
    Wu, Bin
    Fan, Jianping
    Lu, Yao
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 651 - 658
  • [25] A FRAMEWORK OF HASHING FOR MULTI-INSTANCE MULTI-LABEL LEARNING
    Liu, Man
    Xu, Xinshun
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2015, 11 (03): : 921 - 934
  • [26] Nearest neighbor-based approaches for multi-instance multi-label classification
    Zafra, Amelia
    Gibaja, Eva
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [27] Hierarchical multi-instance multi-label learning for Chinese patent text classification
    Liu, Yunduo
    Xu, Fang
    Zhao, Yushan
    Ma, Zichen
    Wang, Tengke
    Zhang, Shunxiang
    Tian, Yuhao
    CONNECTION SCIENCE, 2024, 36 (01)
  • [28] A New multi-instance multi-label learning approach for image and text classification
    Yan, Kaobi
    Li, Zhixin
    Zhang, Canlong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (13) : 7875 - 7890
  • [29] A New multi-instance multi-label learning approach for image and text classification
    Kaobi Yan
    Zhixin Li
    Canlong Zhang
    Multimedia Tools and Applications, 2016, 75 : 7875 - 7890
  • [30] A multi-instance multi-label learning algorithm based on instance correlations
    Liu, Chanjuan
    Chen, Tongtong
    Ding, Xinmiao
    Zou, Hailin
    Tong, Yan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (19) : 12263 - 12284