A unified multimodal classification framework based on deep metric learning

被引:0
|
作者
Peng, Liwen [1 ,2 ]
Jian, Songlei [2 ]
Li, Minne [1 ]
Kan, Zhigang [1 ]
Qiao, Linbo [2 ]
Li, Dongsheng [2 ]
机构
[1] Intelligent Game & Decis Lab, Beijing 100080, Peoples R China
[2] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal classification; Deep metric learning; Multimodal learning; Fake news detection; Sentiment analysis; FUSION;
D O I
10.1016/j.neunet.2024.106747
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal classification algorithms play an essential role in multimodal machine learning, aiming to categorize distinct data points by analyzing data characteristics from multiple modalities. Extensive research has been conducted on distilling multimodal attributes and devising specialized fusion strategies for targeted classification tasks. Nevertheless, current algorithms mainly concentrate on a specific classification task and process data about the corresponding modalities. To address these limitations, we propose a unified multimodal classification framework proficient in handling diverse multimodal classification tasks and processing data from disparate modalities. UMCF is task-independent, and its unimodal feature extraction module can be adaptively substituted to accommodate data from diverse modalities. Moreover, we construct the multimodal learning scheme based on deep metric learning to mine latent characteristics within multimodal data. Specifically, we design the metric-based triplet learning to extract the intra-modal relationships within each modality and the contrastive pairwise learning to capture the inter-modal relationships across various modalities. Extensive experiments on two multimodal classification tasks, fake news detection and sentiment analysis, demonstrate that UMCF can extract multimodal data features and achieve superior classification performance than task- specific benchmarks. UMCF outperforms the best fake news detection baselines by 2.3% on average regarding F1 scores.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Metric Learning based Framework for Streaming Classification with Concept Evolution
    Wang, Zhuoyi
    Tao, Hemeng
    Kong, Zelun
    Chandra, Swarup
    Khan, Latifur
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [22] A Multi-label Multimodal Deep Learning Framework for Imbalanced Data Classification
    Pouyanfar, Samira
    Wang, Tianyi
    Chen, Shu-Ching
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 199 - 204
  • [23] A Context-Supported Deep Learning Framework for Multimodal Brain Imaging Classification
    Jiang, Jianmin
    Fares, Ahmed
    Zhong, Sheng-Hua
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2019, 49 (06) : 611 - 622
  • [24] A Unified Metric Learning-Based Framework for Co-Saliency Detection
    Han, Junwei
    Cheng, Gong
    Li, Zhenpeng
    Zhang, Dingwen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2473 - 2483
  • [25] Deep Learning Framework for Classification of Emoji Based Sentiments
    Shaikh, Nighat Parveen
    Mahar, Mumtaz Hussain
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3145 - 3158
  • [26] A unified supervised codebook learning framework for classification
    Lang, Congyan
    Feng, Songhe
    Cheng, Bing
    Ni, Bingbing
    Yan, Shuicheng
    NEUROCOMPUTING, 2012, 77 (01) : 281 - 288
  • [27] A Unified Multiple Proxy Deep Metric Learning Framework Embedded With Distribution Optimization for Fine-Grained Ship Classification in Remote Sensing Images
    Xu, Jianwen
    Lang, Haitao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5604 - 5620
  • [28] Image matching based on a structured deep coupled metric learning framework
    Fu, Guixia
    Zou, Guofeng
    Gao, Mingliang
    Wang, Zhenzhou
    Liu, Zheng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (06) : 1649 - 1657
  • [29] Image matching based on a structured deep coupled metric learning framework
    Guixia Fu
    Guofeng Zou
    Mingliang Gao
    Zhenzhou Wang
    Zheng Liu
    Signal, Image and Video Processing, 2022, 16 : 1649 - 1657
  • [30] Deep Metric Learning Based Citrus Disease Classification With Sparse Data
    Janarthan, Sivasubramaniam
    Thuseethan, Selvarajah
    Rajasegarar, Sutharshan
    Lyu, Qiang
    Zheng, Yongqiang
    Yearwood, John
    IEEE ACCESS, 2020, 8 : 162588 - 162600