Automated interpretation of congenital heart disease from multi-view echocardiograms

被引:47
|
作者
Wang, Jing [1 ]
Liu, Xiaofeng [3 ,6 ]
Wang, Fangyun [2 ]
Zheng, Lin [2 ]
Gao, Fengqiao [1 ]
Zhang, Hanwen [1 ]
Zhang, Xin [2 ]
Xie, Wanqing [4 ,6 ]
Wang, Binbin [5 ]
机构
[1] Capital Med Univ, Sch Basic Med Sci, Dept Med Genet & Dev Biol, Beijing 10069, Peoples R China
[2] Capital Med Univ, Beijing Childrens Hosp, Heart Ctr, Natl Ctr Childrens Hlth, Beijing 10045, Peoples R China
[3] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15232 USA
[4] Harbin Engn Univ, Coll Math Sci, Harbin, Peoples R China
[5] Natl Res Inst Family Planning, Ctr Genet, Beijing, Peoples R China
[6] Harvard Univ, Harvard Med Sch, Boston, MA 02215 USA
基金
中国国家自然科学基金;
关键词
Congenital heart disease; Multi-view learning; Multi-channel networks; Neural aggregation; PREVALENCE;
D O I
10.1016/j.media.2020.101942
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Congenital heart disease (CHD) is the most common birth defect and the leading cause of neonate death in China. Clinical diagnosis can be based on the selected 2D key-frames from five views. Limited by the availability of multi-view data, most methods have to rely on the insufficient single view analysis. This study proposes to automatically analyze the multi-view echocardiograms with a practical end-to end framework. We collect the five-view echocardiograms video records of 1308 subjects (including normal controls, ventricular septal defect (VSD) patients and atrial septal defect (ASD) patients) with both disease labels and standard-view key-frame labels. Depthwise separable convolution-based multi-channel networks are adopted to largely reduce the network parameters. We also approach the imbalanced class problem by augmenting the positive training samples. Our 2D key-frame model can diagnose CHD or negative samples with an accuracy of 95.4%, and in negative, VSD or ASD classification with an accuracy of 92.3%. To further alleviate the work of key-frame selection in real-world implementation, we propose an adaptive soft attention scheme to directly explore the raw video data. Four kinds of neural aggregation methods are systematically investigated to fuse the information of an arbitrary number of frames in a video. Moreover, with a view detection module, the system can work without the view records. Our video-based model can diagnose with an accuracy of 93.9% (binary classification), and 92.1% (3-class classification) in a collected 2D video testing set, which does not need key-frame selection and view annotation in testing. The detailed ablation study and the interpretability analysis are provided. The presented model has high diagnostic rates for VSD and ASD that can be potentially applied to the clinical practice in the future. The short-term automated machine learning process can partially replace and promote the long-term professional training of primary doctors, improving the primary diagnosis rate of CHD in China, and laying the foundation for early diagnosis and timely treatment of children with CHD. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] DENSE MULTI-VIEW STEREO FROM SATELLITE IMAGERY
    d'Angelo, Pablo
    Kuschk, Georg
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 6944 - 6947
  • [42] Interactive object segmentation from multi-view images
    Thi Nhat Anh Nguyen
    Cai, Jianfei
    Zheng, Jianmin
    Li, Jianguo
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (04) : 477 - 485
  • [43] UTILITY OF SURVEILLANCE ECHOCARDIOGRAMS FOLLOWING TRANSCATHETER PULMONARY VALVE REPLACEMENT IN ADULTS WITH CONGENITAL HEART DISEASE
    Bishop, Travis
    Kostelyna, Stefan
    Weyland, Cassie
    Dolgner, Stephen
    Lam, Wilson W.
    Qureshi, Athar M.
    Parekh, Dhaval R.
    Salciccioli, Katherine B.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2023, 81 (08) : 1580 - 1580
  • [44] Multi-Layer Multi-View Classification for Alzheimer's Disease Diagnosis
    Zhang, Changqing
    Adeli, Ehsan
    Zhou, Tao
    Chen, Xiaobo
    Shen, Dinggang
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4406 - 4413
  • [45] Solid Reconstruction from Multi-view Engineering Drawings
    Fu, Zi-Gang
    Zou, Bei-Ji
    Wu, Ling
    Chen, Yi-Ming
    SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 173 - +
  • [46] Performance capture from sparse multi-view video
    de Aguiar, Edilson
    Stoll, Carsten
    Theobalt, Christian
    Ahmed, Naveed
    Seidel, Hans-Peter
    Thrun, Sebastian
    ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03):
  • [47] Multi-view calibration from planar motion trajectories
    Jaynes, C
    IMAGE AND VISION COMPUTING, 2004, 22 (07) : 535 - 550
  • [48] Interactive Mechanism Modeling from Multi-view Images
    Xu, Mingliang
    Li, Mingyuan
    Xu, Weiwei
    Deng, Zhigang
    Yang, Yin
    Zhou, Kun
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
  • [49] Robust Face Recognition from Multi-View Videos
    Du, Ming
    Sankaranarayanan, Aswin C.
    Chellappa, Rama
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (03) : 1105 - 1117
  • [50] Articulated mesh animation from multi-view silhouettes
    Vlasic, Daniel
    Baran, Ilya
    Matusik, Wojciech
    Popovic, Jovan
    ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03):