Dictionary-Induced Manifold Learning for Incomplete Multi-modal Fusion

被引:0
|
作者
Xu, Bingliang [1 ]
Ye, Haizhou [1 ]
Zhang, Zheng [2 ]
Zhang, Daoqiang [1 ]
Zhu, Qi [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
[2] Harbin Inst Technol, Shenzhen Key Lab Visual Object Detect & Recognit, Shenzhen 518055, Peoples R China
来源
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Data recovery; Multi-modal data fusion; Dictionary learning; Manifold learning;
D O I
10.1007/978-3-031-25198-6_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data missing is a common problem in multi-modal fusion, and existing incomplete multi-modal methods usually only consider the case of two modalities and ignore the semantic information of samples during data recovery. In this paper, we propose dictionary-induced manifold incomplete multi-modal latent space representation, which reconstructs missing views with dictionary to assist consensus representation and captures the local manifold structure with reverse graph regularization. Specifically, we adopt dictionary learning to recover missing data with linear combinations of available samples for latent space alignment, and Laplacian matrix is utilized to embed the structural information of the high-dimensional space into the low-dimensional manifold latent space for optimizing the common representation. The proposed method can not only deal with multi-modal data fusion task, but also recovering missing data by effectively mining the structural information among different modalities. Experimental results demonstrate that our method performs better than other incomplete multi-modal fusion methods.
引用
收藏
页码:529 / 537
页数:9
相关论文
共 50 条
  • [1] Multi-Modal Convolutional Dictionary Learning
    Gao, Fangyuan
    Deng, Xin
    Xu, Mai
    Xu, Jingyi
    Dragotti, Pier Luigi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1325 - 1339
  • [2] Supervised Multi-modal Dictionary Learning for Clothing Representation
    Zhao, Qilu
    Wang, Jiayan
    Li, Zongmin
    PROCEEDINGS OF THE FIFTEENTH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS - MVA2017, 2017, : 51 - 54
  • [3] Incomplete multi-modal brain image fusion for epilepsy classification
    Zhu, Qi
    Li, Huijie
    Ye, Haizhou
    Zhang, Zhiqiang
    Wang, Ran
    Fan, Zizhu
    Zhang, Daoqiang
    INFORMATION SCIENCES, 2022, 582 : 316 - 333
  • [4] On Multi-modal Fusion Learning in constraint propagation
    Li, Yaoyi
    Lu, Hongtao
    INFORMATION SCIENCES, 2018, 462 : 204 - 217
  • [5] Flexible Dual Multi-Modal Hashing for Incomplete Multi-Modal Retrieval
    Wei, Yuhong
    An, Junfeng
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024,
  • [6] Multi-modal Fusion
    Liu, Huaping
    Hussain, Amir
    Wang, Shuliang
    INFORMATION SCIENCES, 2018, 432 : 462 - 462
  • [7] Dictionary-induced least squares framework for multi-view dimensionality reduction with multi-manifold embeddings
    Abeo, Timothy Apasiba
    Shen, Xiang-Jun
    Gou, Jian-Ping
    Mao, Qi-Rong
    Bao, Bing-Kun
    Li, Shuying
    IET COMPUTER VISION, 2019, 13 (02) : 97 - 108
  • [8] Semi-Supervised Multi-Modal Learning with Incomplete Modalities
    Yang, Yang
    Zhan, De-Chuan
    Sheng, Xiang-Rong
    Jiang, Yuan
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2998 - 3004
  • [9] RDMIF: Reverse dictionary model based on multi-modal information fusion
    Tian, Sicheng
    Huang, Shaobin
    Li, Rongsheng
    Wei, Chi
    NEUROCOMPUTING, 2025, 619
  • [10] Multi-modal deep convolutional dictionary learning for image denoising
    Sun, Zhonggui
    Zhang, Mingzhu
    Sun, Huichao
    Li, Jie
    Liu, Tingting
    Gao, Xinbo
    NEUROCOMPUTING, 2023, 562