CMIM: CROSS-MODAL INFORMATION MAXIMIZATION FOR MEDICAL IMAGING

被引:2
|
作者
Sylvain, Tristan [1 ,2 ]
Dutil, Francis [3 ]
Berthier, Tess [3 ]
Di Jorio, Lisa [3 ]
Luck, Margaux [1 ,2 ]
Hjelm, Devon [4 ]
Bengio, Yoshua [1 ,2 ]
机构
[1] Mila, Montreal, PQ, Canada
[2] Univ Montreal, Montreal, PQ, Canada
[3] Imagia Cybernet, Montreal, PQ, Canada
[4] Microsoft Res, Montreal, PQ, Canada
关键词
Deep learning; Medical Imaging; Multimodal data; Classification; Segmentation;
D O I
10.1109/ICASSP39728.2021.9414132
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In hospitals, data are siloed to specific information systems that make the same information available under different modalities such as the different medical imaging exams the patient undergoes (CT scans, MRI, PET, Ultrasound, etc.) and their associated radiology reports. This offers unique opportunities to obtain and use at train-time those multiple views of the same information that might not always be available at test-time. In this paper, we propose an innovative framework that makes the most of available data by learning good representations of a multi-modal input that are resilient to modality dropping at test-time, using recent advances in mutual information maximization. By maximizing cross-modal information at train time, we are able to outperform several state-of-the-art baselines in two different settings, medical image classification, and segmentation. In particular, our method is shown to have a strong impact on the inference-time performance of weaker modalities.
引用
收藏
页码:1190 / 1194
页数:5
相关论文
共 50 条
  • [21] Cross-modal information flows in highly automated vehicles
    Savchenko, V. V.
    Poddubko, S. N.
    INTERNATIONAL AUTOMOBILE SCIENTIFIC FORUM (IASF-2018), INTELLIGENT TRANSPORT SYSTEM TECHNOLOGIES AND COMPONENTS, 2019, 534
  • [22] Cross-modal source information and spoken word recognition
    Lachs, L
    Pisoni, DB
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2004, 30 (02) : 378 - 396
  • [23] Combining Generic and Specific Information for Cross-modal Retrieval
    Thi Quynh Nhi Tran
    Le Borgne, Nerve
    Crucianu, Michel
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 551 - 554
  • [24] Is Cross-Modal Information Retrieval Possible Without Training?
    Choi, Hyunjin
    Lee, Hyunjae
    Joe, Seongho
    Gwon, Youngjune
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT II, 2023, 13981 : 377 - 385
  • [25] Graph Embedding Learning for Cross-Modal Information Retrieval
    Zhang, Youcai
    Gu, Xiaodong
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 594 - 601
  • [26] Acoustic NLOS Imaging with Cross-Modal Knowledge Distillation
    Shin, Ui-Hyeon
    Jang, Seungwoo
    Kim, Kwangsu
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1405 - 1413
  • [27] Power control and resource allocation for return on investment maximization in cross-modal communications
    Wen, Mengtian
    Zhang, Zhe
    Chen, Jianxin
    Wei, Xin
    Chen, Mingkai
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2023, 34 (01)
  • [28] Multi-kernel Hashing with Semantic Correlation Maximization for Cross-Modal Retrieval
    Yang, Guangfei
    Miao, Huanghui
    Tang, Jun
    Liang, Dong
    Wang, Nian
    IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 23 - 34
  • [29] HCMSL: Hybrid Cross-modal Similarity Learning for Cross-modal Retrieval
    Zhang, Chengyuan
    Song, Jiayu
    Zhu, Xiaofeng
    Zhu, Lei
    Zhang, Shichao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [30] Contrastive Cross-Modal Pre-Training: A General Strategy for Small Sample Medical Imaging
    Liang, Gongbo
    Greenwell, Connor
    Zhang, Yu
    Xing, Xin
    Wang, Xiaoqin
    Kavuluru, Ramakanth
    Jacobs, Nathan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (04) : 1640 - 1649