Automated interpretation of congenital heart disease from multi-view echocardiograms

被引：47

作者：

Wang, Jing ^{[1
]}

Liu, Xiaofeng ^{[3
,6
]}

Wang, Fangyun ^{[2
]}

Zheng, Lin ^{[2
]}

Gao, Fengqiao ^{[1
]}

Zhang, Hanwen ^{[1
]}

Zhang, Xin ^{[2
]}

Xie, Wanqing ^{[4
,6
]}

Wang, Binbin ^{[5
]}

机构：

[1] Capital Med Univ, Sch Basic Med Sci, Dept Med Genet & Dev Biol, Beijing 10069, Peoples R China

[2] Capital Med Univ, Beijing Childrens Hosp, Heart Ctr, Natl Ctr Childrens Hlth, Beijing 10045, Peoples R China

[3] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15232 USA

[4] Harbin Engn Univ, Coll Math Sci, Harbin, Peoples R China

[5] Natl Res Inst Family Planning, Ctr Genet, Beijing, Peoples R China

[6] Harvard Univ, Harvard Med Sch, Boston, MA 02215 USA

来源：

MEDICAL IMAGE ANALYSIS | 2021年 / 69卷

基金：

中国国家自然科学基金;

关键词：

Congenital heart disease; Multi-view learning; Multi-channel networks; Neural aggregation; PREVALENCE;

D O I：

10.1016/j.media.2020.101942

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Congenital heart disease (CHD) is the most common birth defect and the leading cause of neonate death in China. Clinical diagnosis can be based on the selected 2D key-frames from five views. Limited by the availability of multi-view data, most methods have to rely on the insufficient single view analysis. This study proposes to automatically analyze the multi-view echocardiograms with a practical end-to end framework. We collect the five-view echocardiograms video records of 1308 subjects (including normal controls, ventricular septal defect (VSD) patients and atrial septal defect (ASD) patients) with both disease labels and standard-view key-frame labels. Depthwise separable convolution-based multi-channel networks are adopted to largely reduce the network parameters. We also approach the imbalanced class problem by augmenting the positive training samples. Our 2D key-frame model can diagnose CHD or negative samples with an accuracy of 95.4%, and in negative, VSD or ASD classification with an accuracy of 92.3%. To further alleviate the work of key-frame selection in real-world implementation, we propose an adaptive soft attention scheme to directly explore the raw video data. Four kinds of neural aggregation methods are systematically investigated to fuse the information of an arbitrary number of frames in a video. Moreover, with a view detection module, the system can work without the view records. Our video-based model can diagnose with an accuracy of 93.9% (binary classification), and 92.1% (3-class classification) in a collected 2D video testing set, which does not need key-frame selection and view annotation in testing. The detailed ablation study and the interpretability analysis are provided. The presented model has high diagnostic rates for VSD and ASD that can be potentially applied to the clinical practice in the future. The short-term automated machine learning process can partially replace and promote the long-term professional training of primary doctors, improving the primary diagnosis rate of CHD in China, and laying the foundation for early diagnosis and timely treatment of children with CHD. (c) 2020 Elsevier B.V. All rights reserved.

引用

页数：12

共 50 条

[41] DENSE MULTI-VIEW STEREO FROM SATELLITE IMAGERY
d'Angelo, Pablo
Kuschk, Georg
2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 6944 - 6947
[42] Interactive object segmentation from multi-view images
Thi Nhat Anh Nguyen
Cai, Jianfei
Zheng, Jianmin
Li, Jianguo
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2013, 24 (04) : 477 - 485
[43] UTILITY OF SURVEILLANCE ECHOCARDIOGRAMS FOLLOWING TRANSCATHETER PULMONARY VALVE REPLACEMENT IN ADULTS WITH CONGENITAL HEART DISEASE
Bishop, Travis
Kostelyna, Stefan
Weyland, Cassie
Dolgner, Stephen
Lam, Wilson W.
Qureshi, Athar M.
Parekh, Dhaval R.
Salciccioli, Katherine B.
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2023, 81 (08) : 1580 - 1580
[44] Multi-Layer Multi-View Classification for Alzheimer's Disease Diagnosis
Zhang, Changqing
Adeli, Ehsan
Zhou, Tao
Chen, Xiaobo
Shen, Dinggang
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4406 - 4413
[45] Solid Reconstruction from Multi-view Engineering Drawings
Fu, Zi-Gang
Zou, Bei-Ji
Wu, Ling
Chen, Yi-Ming
SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 173 - +
[46] Performance capture from sparse multi-view video
de Aguiar, Edilson
Stoll, Carsten
Theobalt, Christian
Ahmed, Naveed
Seidel, Hans-Peter
Thrun, Sebastian
ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03):
[47] Multi-view calibration from planar motion trajectories
Jaynes, C
IMAGE AND VISION COMPUTING, 2004, 22 (07) : 535 - 550
[48] Interactive Mechanism Modeling from Multi-view Images
Xu, Mingliang
Li, Mingyuan
Xu, Weiwei
Deng, Zhigang
Yang, Yin
Zhou, Kun
ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
[49] Robust Face Recognition from Multi-View Videos
Du, Ming
Sankaranarayanan, Aswin C.
Chellappa, Rama
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (03) : 1105 - 1117
[50] Articulated mesh animation from multi-view silhouettes
Vlasic, Daniel
Baran, Ilya
Matusik, Wojciech
Popovic, Jovan
ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03):

← 1 2 3 4 5 →