A deep multimodal generative and fusion framework for class-imbalanced multimodal data

被引:0
|
作者
Qing Li
Guanyuan Yu
Jun Wang
Yuehao Liu
机构
[1] Southwestern University of Finance and Economics,Fintech Innovation Center and School of Economic Information Engineering
来源
关键词
Multimodal classification; Class-imbalanced data; Deep multimodal generative adversarial network; Deep multimodal hybrid fusion network;
D O I
暂无
中图分类号
学科分类号
摘要
The purpose of multimodal classification is to integrate features from diverse information sources to make decisions. The interactions between different modalities are crucial to this task. However, common strategies in previous studies have been to either concatenate features from various sources into a single compound vector or input them separately into several different classifiers that are then assembled into a single robust classifier to generate the final prediction. Both of these approaches weaken or even ignore the interactions among different feature modalities. In addition, in the case of class-imbalanced data, multimodal classification becomes troublesome. In this study, we propose a deep multimodal generative and fusion framework for multimodal classification with class-imbalanced data. This framework consists of two modules: a deep multimodal generative adversarial network (DMGAN) and a deep multimodal hybrid fusion network (DMHFN). The DMGAN is used to handle the class imbalance problem. The DMHFN identifies fine-grained interactions and integrates different information sources for multimodal classification. Experiments on a faculty homepage dataset show the superiority of our framework compared to several start-of-the-art methods.
引用
收藏
页码:25023 / 25050
页数:27
相关论文
共 50 条
  • [21] Class prediction for high-dimensional class-imbalanced data
    Rok Blagus
    Lara Lusa
    BMC Bioinformatics, 11
  • [22] A survey of multimodal deep generative models
    Suzuki, Masahiro
    Matsuo, Yutaka
    ADVANCED ROBOTICS, 2022, 36 (5-6) : 261 - 278
  • [23] Adversarial Kernel Sampling on Class-imbalanced Data Streams
    Yang, Peng
    Li, Ping
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2352 - 2362
  • [24] ViT and attention fusion for class-imbalanced PCB defect detection
    Chen, Junying
    Li, Zhaoyang
    Xi, Yueyun
    Liu, Chong
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (04): : 294 - 306
  • [25] Class-imbalanced classifiers for high-dimensional data
    Lin, Wei-Jiun
    Chen, James J.
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (01) : 13 - 26
  • [26] Clustering-based undersampling in class-imbalanced data
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Hu, Ya-Han
    Jhang, Jing-Shang
    INFORMATION SCIENCES, 2017, 409 : 17 - 26
  • [27] Deep Occlusion Framework for Multimodal Earth Observation Data
    Ekim, Burak
    Schmitt, Michael
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [28] SMOTE for high-dimensional class-imbalanced data
    Rok Blagus
    Lara Lusa
    BMC Bioinformatics, 14
  • [29] SMOTE for high-dimensional class-imbalanced data
    Blagus, Rok
    Lusa, Lara
    BMC BIOINFORMATICS, 2013, 14
  • [30] Multimodal dual perception fusion framework for multimodal affective analysis
    Lu, Qiang
    Sun, Xia
    Long, Yunfei
    Zhao, Xiaodi
    Zou, Wang
    Feng, Jun
    Wang, Xuxin
    INFORMATION FUSION, 2025, 115