End-to-End diagnosis of breast biopsy images with transformers

被引：23

作者：

Mehta, Sachin ^{[1
]}

Lu, Ximing ^{[1
]}

Wu, Wenjun ^{[1
]}

Weaver, Donald ^{[2
]}

Hajishirzi, Hannaneh ^{[1
]}

Elmore, Joann G. ^{[3
]}

Shapiro, Linda G. ^{[1
,4
]}

机构：

[1] Univ Washington, Seattle, WA USA

[2] Univ Vermont Coll Med, Dept Pathol, Burlington, VT USA

[3] Univ Calif Los Angeles, David Geffen Sch Med, Los Angeles, CA USA

[4] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA

来源：

MEDICAL IMAGE ANALYSIS | 2022年 / 79卷

关键词：

Transformers; Histopathological images; Breast cancer; Image classification; Convolutional neural networks; Whole slide images; CANCER; CLASSIFICATION; FRAMEWORK; NETWORKS; REGIONS;

D O I：

10.1016/j.media.2022.102466

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Diagnostic disagreements among pathologists occur throughout the spectrum of benign to malignant lesions. A computer-aided diagnostic system capable of reducing uncertainties would have important clinical impact. To develop a computer-aided diagnosis method for classifying breast biopsy images into a range of diagnostic categories (benign, atypia, ductal carcinoma in situ, and invasive breast cancer), we introduce a transformer-based hollistic attention network called HATNet. Unlike state-of-the-art histopathological image classification systems that use a two pronged approach, i.e., they first learn local representations using a multi-instance learning framework and then combine these local representations to produce image-level decisions, HATNet streamlines the histopathological image classification pipeline and shows how to learn representations from gigapixel size images end-to-end. HATNet extends the bag-of-words approach and uses self-attention to encode global information, allowing it to learn representations from clinically relevant tissue structures without any explicit supervision. It outperforms the previous best network Y-Net, which uses supervision in the form of tissue-level segmentation masks, by 8%. Importantly, our analysis reveals that HATNet learns representations from clinically relevant structures, and it matches the classification accuracy of 87 U.S. pathologists for this challenging test set.(c) 2022 Elsevier B.V. All rights reserved.

引用

页数：13

共 50 条

[1] End-to-end Symbolic Regression with Transformers
Kamienny, Pierre-Alexandre
d'Ascoli, Stephane
Lample, Guillaume
Charton, Francois
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[2] End-to-End Deep Diagnosis of X-ray Images
Urinbayev, Kudaibergen
Orazbek, Yerassyl
Nurambek, Yernur
Mirzakhmetov, Almas
Varol, Huseyin Atakan
42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 2182 - 2185
[3] TransVG: End-to-End Visual Grounding with Transformers
Deng, Jiajun
Yang, Zhengyuan
Chen, Tianlang
Zhou, Wengang
Li, Houqiang
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1749 - 1759
[4] SYNCHRONOUS TRANSFORMERS FOR END-TO-END SPEECH RECOGNITION
Tian, Zhengkun
Yi, Jiangyan
Bai, Ye
Tao, Jianhua
Zhang, Shuai
Wen, Zhengqi
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7884 - 7888
[5] End-to-end Lane Shape Prediction with Transformers
Liu, Ruijin
Yuan, Zejian
Liu, Tie
Xiong, Zhiliang
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3693 - 3701
[6] End-to-End Video Instance Segmentation with Transformers
Wang, Yuqing
Xu, Zhaoliang
Wang, Xinlong
Shen, Chunhua
Cheng, Baoshan
Shen, Hao
Xia, Huaxia
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8737 - 8746
[7] Cascade Transformers for End-to-End Person Search
Yu, Rui
Du, Dawei
LaLonde, Rodney
Davila, Daniel
Funk, Christopher
Hoogs, Anthony
Clipp, Brian
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7257 - 7266
[8] End-to-End Human Pose and Mesh Reconstruction with Transformers
Lin, Kevin
Wang, Lijuan
Liu, Zicheng
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1954 - 1963
[9] Chasing Sparsity in Vision Transformers: An End-to-End Exploration
Chen, Tianlong
Cheng, Yu
Gan, Zhe
Yuan, Lu
Zhang, Lei
Wang, Zhangyang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[10] RETR: END-TO-END REFERRING EXPRESSION COMPREHENSION WITH TRANSFORMERS
Rui, Yang
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,

← 1 2 3 4 5 →