Multi-modal data clustering using deep learning: A systematic review

被引:0
|
作者
Raya, Sura [1 ]
Orabi, Mariam [1 ]
Afyouni, Imad [1 ]
Al Aghbari, Zaher [1 ]
机构
[1] Univ Sharjah, Coll Comp & Informat, Sharjah, U Arab Emirates
关键词
Multi-modal data; Clustering algorithms; Deep learning; Review article; FRAMEWORK; INFORMATION; TRENDS;
D O I
10.1016/j.neucom.2024.128348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal clustering represents a formidable challenge in the domain of unsupervised learning. The objective of multi-modal clustering is to categorize data collected from diverse modalities, such as audio, visual, and textual sources, into distinct clusters. These clustering techniques operate by extracting shared features across modalities in an unsupervised manner, where the identified common features exhibit high correlations within real-world objects. Recognizing the importance of perceiving the correlated nature of these features is vital for enhancing clustering accuracy in multi-modal settings. This survey explores Deep Learning (DL) techniques applied to multi-modal clustering, encompassing methodologies such as Convolutional Neural Networks (CNN), Autoencoders (AE), Recurrent Neural Networks (RNN), and Graph Convolutional Networks (GCN). Notably, this survey represents the first attempt to investigate DL techniques specifically for multi-modal clustering. The survey presents a novel taxonomy for DL-based multi-modal clustering, conducts a comparative analysis of various multi-modal clustering approaches, and deliberates on the datasets employed in the evaluation process. Additionally, the survey identifies research gaps within the realm of multi-modal clustering, offering insights into potential future avenues for research.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Multi-Modal Deep Learning for Vehicle Sensor Data Abstraction and Attack Detection
    Rofail, Mark
    Alsafty, Aysha
    Matousek, Matthias
    Kargl, Frank
    2019 IEEE INTERNATIONAL CONFERENCE OF VEHICULAR ELECTRONICS AND SAFETY (ICVES 19), 2019,
  • [32] Deep learning approaches for multi-modal sensor data analysis and abnormality detection
    Jadhav, Santosh Pandurang
    Srinivas, Angalkuditi
    Dipak Raghunath, Patil
    Ramkumar Prabhu, M.
    Suryawanshi, Jaya
    Haldorai, Anandakumar
    Measurement: Sensors, 33
  • [33] A Review of the Application of Multi-modal Deep Learning in Medicine: Bibliometrics and Future Directions
    Xiangdong Pei
    Ke Zuo
    Yuan Li
    Zhengbin Pang
    International Journal of Computational Intelligence Systems, 16
  • [34] A Review of the Application of Multi-modal Deep Learning in Medicine: Bibliometrics and Future Directions
    Pei, Xiangdong
    Zuo, Ke
    Li, Yuan
    Pang, Zhengbin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [35] Learning to Hash on Partial Multi-Modal Data
    Wang, Qifan
    Si, Luo
    Shen, Bin
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3904 - 3910
  • [36] Multi-modal tumor segmentation methods based on deep learning: a narrative review
    Xue, Hengzhi
    Yao, Yudong
    Teng, Yueyang
    QUANTITATIVE IMAGING IN MEDICINE AND SURGERY, 2024, 14 (01) : 1122 - 1140
  • [37] Twitter Demographic Classification Using Deep Multi-modal Multi-task Learning
    Vijayaraghavan, Prashanth
    Vosoughi, Soroush
    Roy, Deb
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 478 - 483
  • [38] Multi-Modal and Multi-Scale Oral Diadochokinesis Analysis using Deep Learning
    Department of Electrical Engineering and Computer Science, University of Missouri, Columbia
    MO, United States
    不详
    MO, United States
    Proc. Appl. Imagery Pattern. Recogn. Workshop, 2021,
  • [39] Multi-Modal Data Analysis for Pneumonia Status Prediction Using Deep Learning (MDA-PSP)
    Sheu, Ruey-Kai
    Chen, Lun-Chi
    Wu, Chieh-Liang
    Pardeshi, Mayuresh Sunil
    Pai, Kai-Chih
    Huang, Chien-Chung
    Chen, Chia-Yu
    Chen, Wei-Cheng
    DIAGNOSTICS, 2022, 12 (07)
  • [40] A Massive Multi-Modal Perception Data Classification Method Using Deep Learning Based on Internet of Things
    Linli Jiang
    Chunmei Wu
    International Journal of Wireless Information Networks, 2020, 27 : 226 - 233