Complex Object Classification: A Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport

被引:36
|
作者
Yang, Yang [1 ]
Wu, Yi-Feng [1 ]
Zhan, De-Chuan [1 ]
Liu, Zhi-Bin [2 ]
Jiang, Yuan [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[2] Tencent WXG, Shenzhen, Peoples R China
基金
国家重点研发计划;
关键词
Multi-modal; Multi-instance; Multi-label; Optimal Transport;
D O I
10.1145/3219819.3220012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real world applications, complex objects are usually with multiple labels, and can be represented as multiple modal representations, e.g., the complex articles contain text and image information as well as are with multiple annotations. Previous methods assume that the homogeneous multi-modal data are consistent, while in real applications, the raw data are disordered, i.e., the article is constituted with variable number of inconsistent text and image instances. To solve this problem, Multi-modal Multi-instance Multi-label (M3) learning provides a framework for handling such task and has exhibited excellent performance. Besides, how to effectively utilize label correlation is also a challenging issue. In this paper, we propose a novel Multi-modal Multi-instance Multi-label Deep Network (M3DN), which learns the label prediction and exploits label correlation simultaneously based on the Optimal Transport, by considering the consistency principle between different modal bag-level prediction and the learned latent ground label metric. Experiments on benchmark datasets and real world WKG Game-Hub dataset validate the effectiveness of the proposed method.
引用
收藏
页码:2594 / 2603
页数:10
相关论文
共 50 条
  • [41] ISAFusionNet: Involution and soft attention based deep multi-modal fusion network for multi-label skin lesion classification
    Mohammed, Hussein M. A.
    Omeroglu, Asli Nur
    Oral, Emin Argun
    Ozbek, I. Yucel
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 122
  • [42] Abductive multi-instance multi-label learning for periodontal disease classification with prior domain knowledge
    Wu, Zi-Yuan
    Guo, Wei
    Zhou, Wei
    Ye, Han-Jia
    Jiang, Yuan
    Li, Houxuan
    Zhou, Zhi-Hua
    MEDICAL IMAGE ANALYSIS, 2025, 101
  • [43] A Multi-instance Multi-label Dual Learning Approach for Video Captioning
    Ji, Wanting
    Wang, Ruili
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (02)
  • [44] A Multi-Instance Multi-Label Learning Approach for Protein Domain Annotation
    Meng, Yang
    Deng, Lei
    Chen, Zhigang
    Zhou, Cheng
    Liu, Diwei
    Fan, Chao
    Yan, Ting
    INTELLIGENT COMPUTING IN BIOINFORMATICS, 2014, 8590 : 104 - 111
  • [45] Learning a Distance Metric from Multi-instance Multi-label Data
    Jin, Rong
    Wang, Shijun
    Zhou, Zhi-Hua
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 896 - +
  • [46] Improving Multi-Instance Multi-Label Learning by Extreme Learning Machine
    Yin, Ying
    Zhao, Yuhai
    Li, Chengguang
    Zhang, Bin
    APPLIED SCIENCES-BASEL, 2016, 6 (06):
  • [47] An Explainable Multi-Instance Multi-Label Classification Model for Full Slice Brain CT Images
    Song, Changwei
    Fu, Guanghui
    Li, Jianqiang
    Pei, Yan
    IFAC PAPERSONLINE, 2020, 53 (05): : 780 - 785
  • [48] Multi-label Supervised Manifold Ranking for Multi-instance Image Retrieval
    Zeng, Xianhua
    Lv, Renjie
    Lian, Hao
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, RSKT 2014, 2014, 8818 : 423 - 431
  • [49] A Multi-instance Multi-label Learning Algorithm Based on Feature Selection
    Chen Tong-tong
    Liu Chan-juan
    Zou Hai-lin
    Shen Qian
    Liu Ying
    Ding Xin-miao
    2015 10TH INTERNATIONAL CONFERENCE ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2015), 2015, : 587 - 590
  • [50] Discover Multiple Novel Labels in Multi-Instance Multi-Label Learning
    Zhu, Yue
    Ting, Kai Ming
    Zhou, Zhi-Hua
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2977 - 2983