Multimodal fusion recognition for digital twin

被引:4
|
作者
Zhou, Tianzhe [1 ]
Zhang, Xuguang [1 ]
Kang, Bing [1 ]
Chen, Mingkai [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Key Lab Broadband Wireless Commun & Sensor Network, Minist Educ, Nanjing 210003, Peoples R China
关键词
Digital twin; Multimodal fusion; Object recognition; Deep learning; Transfer learning; CLASSIFICATION; NETWORKS; FEATURES;
D O I
10.1016/j.dcan.2022.10.009
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
The digital twin is the concept of transcending reality, which is the reverse feedback from the real physical space to the virtual digital space. People hold great prospects for this emerging technology. In order to realize the upgrading of the digital twin industrial chain, it is urgent to introduce more modalities, such as vision, haptics, hearing and smell, into the virtual digital space, which assists physical entities and virtual objects in creating a closer connection. Therefore, perceptual understanding and object recognition have become an urgent hot topic in the digital twin. Existing surface material classification schemes often achieve recognition through machine learning or deep learning in a single modality, ignoring the complementarity between multiple modalities. In order to overcome this dilemma, we propose a multimodal fusion network in our article that combines two modalities, visual and haptic, for surface material recognition. On the one hand, the network makes full use of the potential correlations between multiple modalities to deeply mine the modal semantics and complete the data mapping. On the other hand, the network is extensible and can be used as a universal architecture to include more modalities. Experiments show that the constructed multimodal fusion network can achieve 99.42% classification accuracy while reducing complexity.
引用
收藏
页码:337 / 346
页数:10
相关论文
共 50 条
  • [11] Multimodal Emotion Recognition Based on Feature Fusion
    Xu, Yurui
    Wu, Xiao
    Su, Hang
    Liu, Xiaorui
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 7 - 11
  • [12] MULTIMODAL TRANSFORMER FUSION FOR CONTINUOUS EMOTION RECOGNITION
    Huang, Jian
    Tao, Jianhua
    Liu, Bin
    Lian, Zheng
    Niu, Mingyue
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3507 - 3511
  • [13] Fusion with Hierarchical Graphs for Multimodal Emotion Recognition
    Tang, Shuyun
    Luo, Zhaojie
    Nan, Guoshun
    Baba, Jun
    Yoshikawa, Yuichiro
    Ishiguro, Hiroshi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1288 - 1296
  • [14] Fusion Architectures for Multimodal Cognitive Load Recognition
    Kindsvater, Daniel
    Meudt, Sascha
    Schwenker, Friedhelm
    MULTIMODAL PATTERN RECOGNITION OF SOCIAL SIGNALS IN HUMAN-COMPUTER-INTERACTION, MPRSS 2016, 2017, 10183 : 36 - 47
  • [15] CONTINUOUS VISUAL SPEECH RECOGNITION FOR MULTIMODAL FUSION
    Benhaim, Eric
    Sahbi, Hichem
    Vitte, Guillaume
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [16] Weakly Paired Multimodal Fusion for Object Recognition
    Liu, Huaping
    Wu, Yupei
    Sun, Fuchun
    Fang, Bin
    Guo, Di
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (02) : 784 - 795
  • [17] A Multimodal Fusion Approach for Human Activity Recognition
    Koutrintzes, Dimitrios
    Spyrou, Evaggelos
    Mathe, Eirini
    Mylonas, Phivos
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (01)
  • [18] Multimodal fusion for alzheimer’s disease recognition
    Yangwei Ying
    Tao Yang
    Hong Zhou
    Applied Intelligence, 2023, 53 : 16029 - 16040
  • [19] Quality Fusion Based Multimodal Eye Recognition
    Zhou, Zhi
    Du, Eliza Yingzi
    Belcher, Craig
    Thomas, N. Luke
    Delp, Edward J.
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1297 - 1302
  • [20] Multimodal Transformer Fusion for Emotion Recognition: A Survey
    Belaref, Amdjed
    Seguier, Renaud
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 107 - 113