Complex Object Classification: A Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport

被引:36
|
作者
Yang, Yang [1 ]
Wu, Yi-Feng [1 ]
Zhan, De-Chuan [1 ]
Liu, Zhi-Bin [2 ]
Jiang, Yuan [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[2] Tencent WXG, Shenzhen, Peoples R China
基金
国家重点研发计划;
关键词
Multi-modal; Multi-instance; Multi-label; Optimal Transport;
D O I
10.1145/3219819.3220012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real world applications, complex objects are usually with multiple labels, and can be represented as multiple modal representations, e.g., the complex articles contain text and image information as well as are with multiple annotations. Previous methods assume that the homogeneous multi-modal data are consistent, while in real applications, the raw data are disordered, i.e., the article is constituted with variable number of inconsistent text and image instances. To solve this problem, Multi-modal Multi-instance Multi-label (M3) learning provides a framework for handling such task and has exhibited excellent performance. Besides, how to effectively utilize label correlation is also a challenging issue. In this paper, we propose a novel Multi-modal Multi-instance Multi-label Deep Network (M3DN), which learns the label prediction and exploits label correlation simultaneously based on the Optimal Transport, by considering the consistency principle between different modal bag-level prediction and the learned latent ground label metric. Experiments on benchmark datasets and real world WKG Game-Hub dataset validate the effectiveness of the proposed method.
引用
收藏
页码:2594 / 2603
页数:10
相关论文
共 50 条
  • [1] Semi-Supervised Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport
    Yang, Yang
    Fu, Zhao-Yang
    Zhan, De-Chuan
    Liu, Zhi-Bin
    Jiang, Yuan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (02) : 696 - 709
  • [2] A Deep Multi-Modal CNN for Multi-Instance Multi-Label Image Classification
    Song, Lingyun
    Liu, Jun
    Qian, Buyue
    Sun, Mingxuan
    Yang, Kuan
    Sun, Meng
    Abbas, Samar
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (12) : 6025 - 6038
  • [3] Multi-Modal Multi-Instance Multi-Label Learning with Graph Convolutional Network
    Hang, Cheng
    Wang, Wei
    Zhan, De-Chuan
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Learning to Annotate Clothes in Everyday Photos: Multi-Modal, Multi-Label, Multi-Instance Approach
    Nogueira, Keiller
    Veloso, Adriano Alonso
    dos Santos, Jefersson A.
    2014 27TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2014, : 327 - 334
  • [5] Deep Multi-Label Multi-Instance Classification on 12-Lead ECG
    Feng, Yingjing
    Vigmond, Edward
    2020 COMPUTING IN CARDIOLOGY, 2020,
  • [6] Multi-label multi-instance learning with missing object tags
    Yi Shen
    Jinye Peng
    Xiaoyi Feng
    Jianping Fan
    Multimedia Systems, 2013, 19 : 17 - 36
  • [7] Multi-label multi-instance learning with missing object tags
    Shen, Yi
    Peng, Jinye
    Feng, Xiaoyi
    Fan, Jianping
    MULTIMEDIA SYSTEMS, 2013, 19 (01) : 17 - 36
  • [8] Multi-instance multi-label learning
    Zhou, Zhi-Hua
    Zhang, Min-Ling
    Huang, Sheng-Jun
    Li, Yu-Feng
    ARTIFICIAL INTELLIGENCE, 2012, 176 (01) : 2291 - 2320
  • [9] Multi-instance multi-label image classification: A neural approach
    Chen, Zenghai
    Chi, Zheru
    Fu, Hong
    Feng, Dagan
    NEUROCOMPUTING, 2013, 99 : 298 - 306
  • [10] Joint multi-label multi-instance learning for image classification
    Zha, Zheng-Jun
    Hua, Xian-Sheng
    Mei, Tao
    Wang, Jingdong
    Qi, Guo-Jun
    Wang, Zengfu
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 333 - +