Disentangled representation learning in cardiac image analysis

被引:115
|
作者
Chartsias, Agisilaos [1 ]
Joyce, Thomas [1 ]
Papanastasiou, Giorgos [2 ,3 ]
Semple, Scott [2 ,3 ]
Williams, Michelle [2 ,3 ]
Newby, David E. [2 ,3 ]
Dharmakumar, Rohan [4 ]
Tsaftaris, Sotirios A. [1 ,5 ]
机构
[1] Univ Edinburgh, Inst Digital Commun, Sch Engn, West Mains Rd, Edinburgh EH9 3FB, Midlothian, Scotland
[2] Edinburgh Imaging Facil QMRI, Edinburgh EH16 4TJ, Midlothian, Scotland
[3] Ctr Cardiovasc Sci, Edinburgh EH16 4TJ, Midlothian, Scotland
[4] Cedars Sinai Med Ctr, Los Angeles, CA 90048 USA
[5] Alan Turing Inst, London, England
基金
美国国家卫生研究院; 英国工程与自然科学研究理事会;
关键词
Disentangled representation learning; Cardiac magnetic resonance imaging; Semi-supervised segmentation; Multitask learning; WHOLE HEART SEGMENTATION;
D O I
10.1016/j.media.2019.101535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typically, a medical image offers spatial information on the anatomy (and pathology) modulated by imaging specific characteristics. Many imaging modalities including Magnetic Resonance Imaging (MRI) and Computed Tomography (CT) can be interpreted in this way. We can venture further and consider that a medical image naturally factors into some spatial factors depicting anatomy and factors that denote the imaging characteristics. Here, we explicitly learn this decomposed (disentangled) representation of imaging data, focusing in particular on cardiac images. We propose Spatial Decomposition Network (SDNet), which factorises 2D medical images into spatial anatomical factors and non-spatial modality factors. We demonstrate that this high-level representation is ideally suited for several medical image analysis tasks, such as semi-supervised segmentation, multi-task segmentation and regression, and image-to-image synthesis. Specifically, we show that our model can match the performance of fully supervised segmentation models, using only a fraction of the labelled images. Critically, we show that our factorised representation also benefits from supervision obtained either when we use auxiliary tasks to train the model in a multi-task setting (e.g. regressing to known cardiac indices), or when aggregating multimodal data from different sources (e.g. pooling together MRI and CT data). To explore the properties of the learned factorisation, we perform latent-space arithmetic and show that we can synthesise CT from MR and vice versa, by swapping the modality factors. We also demonstrate that the factor holding image specific information can be used to predict the input modality with high accuracy. Code will be made available at https://github.comiagis85/anatomy_modality_decomposition. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Multimodal Cardiac Segmentation Using Disentangled Representation Learning
    Chartsias, Agisilaos
    Papanastasiou, Giorgos
    Wang, Chengjia
    Stirrat, Colin
    Semple, Scott
    Newby, David
    Dharmakumar, Rohan
    Tsaftaris, Sotirios A.
    STATISTICAL ATLASES AND COMPUTATIONAL MODELS OF THE HEART: MULTI-SEQUENCE CMR SEGMENTATION, CRT-EPIGGY AND LV FULL QUANTIFICATION CHALLENGES, 2020, 12009 : 128 - 137
  • [2] Disentangled Representation Learning for Controllable Person Image Generation
    Xu, Wenju
    Long, Chengjiang
    Nie, Yongwei
    Wang, Guanghui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6065 - 6077
  • [3] Disentangled Representation Learning
    Wang, Xin
    Chen, Hong
    Tang, Si'ao
    Wu, Zihao
    Zhu, Wenwu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 9677 - 9696
  • [4] Unsupervised Retina Image Synthesis via Disentangled Representation Learning
    Li, Kang
    Yu, Lequan
    Wang, Shujun
    Heng, Pheng-Ann
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2019, 2019, 11827 : 32 - 41
  • [5] Unsupervised face image deblurring via disentangled representation learning
    Hu, Yufan
    Xia, Junyong
    Liu, Hongmin
    Wang, Xing
    PATTERN RECOGNITION LETTERS, 2024, 183 : 9 - 16
  • [6] Unsupervised Cross-modality Cardiac Image Segmentation via Disentangled Representation Learning and Consistency Regularization
    Wang, Runze
    Zheng, Guoyan
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 517 - 526
  • [7] Triple disentangled representation learning for multimodal affective analysis
    Zhou, Ying
    Liang, Xuefeng
    Chen, Han
    Zhao, Yin
    Chen, Xin
    Yu, Lida
    INFORMATION FUSION, 2024, 114
  • [8] A Review of Disentangled Representation Learning
    Wen Z.-D.
    Wang J.-R.
    Wang X.-X.
    Pan Q.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (02): : 351 - 374
  • [9] Disentangled Representation Learning for Multimedia
    Wang, Xin
    Chen, Hong
    Zhu, Wenwu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9702 - 9704
  • [10] An Interpretable Image Denoising Framework via Dual Disentangled Representation Learning
    Liang, Yunji
    Fan, Jiayuan
    Zheng, Xiaolong
    Wang, Yutong
    Huangfu, Luwen
    Ghavate, Vedant
    Yu, Zhiwen
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2016 - 2030