Towards a general-purpose foundation model for computational pathology

被引:119
|
作者
Chen, Richard J. [1 ,2 ,3 ,4 ,5 ]
Ding, Tong [1 ,6 ]
Lu, Ming Y. [1 ,2 ,3 ,4 ,7 ]
Williamson, Drew F. K. [1 ,2 ,3 ]
Jaume, Guillaume [1 ,2 ,3 ,4 ]
Song, Andrew H. [1 ,2 ,3 ,4 ]
Chen, Bowen [1 ,2 ]
Zhang, Andrew [1 ,2 ,3 ,4 ,8 ]
Shao, Daniel [1 ,2 ,3 ,4 ,8 ]
Shaban, Muhammad [1 ,2 ,3 ,4 ]
Williams, Mane [1 ,2 ,3 ,4 ,5 ]
Oldenburg, Lukas [1 ]
Weishaupt, Luca L. [1 ,2 ,3 ,4 ,8 ]
Wang, Judy J. [1 ]
Vaidya, Anurag [1 ,2 ,3 ,4 ,8 ]
Le, Long Phi [2 ,8 ]
Gerber, Georg [1 ]
Sahai, Sharifa [1 ,2 ,3 ,4 ,9 ]
Williams, Walt [1 ,6 ]
Mahmood, Faisal [1 ,2 ,3 ,4 ,10 ]
机构
[1] Harvard Med Sch, Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[2] Harvard Med Sch, Massachusetts Gen Hosp, Dept Pathol, Boston, MA 02115 USA
[3] Broad Inst Harvard & MIT, Canc Program, Cambridge, MA 02142 USA
[4] Dana Farber Canc Inst, Canc Data Sci Program, Boston, MA 02215 USA
[5] Harvard Med Sch, Dept Biomed Informat, Boston, MA USA
[6] Harvard Univ, Harvard John A Paulson Sch Engn & Appl Sci, Cambridge, MA USA
[7] Massachusetts Inst Technol MIT, Elect Engn & Comp Sci, Cambridge, MA USA
[8] Harvard MIT, Hlth Sci & Technol, Cambridge, MA USA
[9] Harvard Univ, Dept Syst Biol, Cambridge, MA USA
[10] Harvard Univ, Harvard Data Sci Initiat, Cambridge, MA 02138 USA
基金
美国国家卫生研究院;
关键词
SOMATIC GENOMIC LANDSCAPE; ARTIFICIAL-INTELLIGENCE; CANCER; ADENOCARCINOMAS; BIOPSIES; FEATURES; SYSTEM;
D O I
10.1038/s41591-024-02857-3
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative evaluation of tissue images is crucial for computational pathology (CPath) tasks, requiring the objective characterization of histopathological entities from whole-slide images (WSIs). The high resolution of WSIs and the variability of morphological features present significant challenges, complicating the large-scale annotation of data for high-performance applications. To address this challenge, current efforts have proposed the use of pretrained image encoders through transfer learning from natural image datasets or self-supervised learning on publicly available histopathology datasets, but have not been extensively developed and evaluated across diverse tissue types at scale. We introduce UNI, a general-purpose self-supervised model for pathology, pretrained using more than 100 million images from over 100,000 diagnostic H&E-stained WSIs (>77 TB of data) across 20 major tissue types. The model was evaluated on 34 representative CPath tasks of varying diagnostic difficulty. In addition to outperforming previous state-of-the-art models, we demonstrate new modeling capabilities in CPath such as resolution-agnostic tissue classification, slide classification using few-shot class prototypes, and disease subtyping generalization in classifying up to 108 cancer types in the OncoTree classification system. UNI advances unsupervised representation learning at scale in CPath in terms of both pretraining data and downstream evaluation, enabling data-efficient artificial intelligence models that can generalize and transfer to a wide range of diagnostically challenging tasks and clinical workflows in anatomic pathology.
引用
收藏
页码:850 / 862
页数:13
相关论文
共 50 条
  • [21] A GENERAL-PURPOSE ELECTROMETER
    FRY, RM
    JOURNAL OF SCIENTIFIC INSTRUMENTS, 1954, 31 (08): : 269 - 271
  • [22] A GENERAL-PURPOSE ANIMATOR
    BRUNNER, DT
    HENRIKSEN, JO
    1989 WINTER SIMULATION CONFERENCE PROCEEDINGS, 1989, : 155 - 163
  • [23] A First Step Towards a General-Purpose Distributed Cyberdefense System
    Rodriguez, Aaron
    Castillo, Luis
    ADVANCES IN PRACTICAL APPLICATIONS OF AGENTS, MULTI-AGENT SYSTEMS, AND COMPLEXITY: THE PAAMS COLLECTION, 2018, 10978 : 237 - 247
  • [24] Plastic cell architecture: Towards reconfigurable computing for general-purpose
    Nagami, K
    Oguri, K
    Shiozawa, T
    Ito, H
    Konishi, R
    IEEE SYMPOSIUM ON FPGAS FOR CUSTOM COMPUTING MACHINES, PROCEEDINGS, 1998, : 68 - 77
  • [25] TOWARDS A GENERAL-PURPOSE OPEN BOUNDARY CONDITION FOR WAVE SIMULATIONS
    Duz, Bulent
    Huijsmans, Rene H. M.
    Wellens, Peter R.
    Borsboom, Mart J. A.
    Veldman, Arthur E. P.
    OMAE2011: PROCEEDINGS OF THE ASME 30TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, VOL 7: CFD AND VIV: OFFSHORE GEOTECHNICS, 2011, : 557 - +
  • [26] Towards a general-purpose sequence design system in DNA computing
    Tanaka, F
    Nakatsugawa, M
    Yamamoto, M
    Shiba, T
    Ohuchi, A
    CEC'02: PROCEEDINGS OF THE 2002 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2002, : 73 - 78
  • [28] GENERAL-PURPOSE SILVER SOLUTION FOR USE IN ROUTINE HISTO-PATHOLOGY
    WHEELER, EE
    MEDICAL LABORATORY SCIENCES, 1979, 36 (02): : 147 - 152
  • [29] Boggart: Towards General-Purpose Acceleration of Retrospective Video Analytics
    Agarwal, Neil
    Netravali, Ravi
    PROCEEDINGS OF THE 20TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, NSDI 2023, 2023, : 933 - 951
  • [30] Towards Efficient Processing of General-Purpose Joins in Sensor Networks
    Stern, Mirco
    Buchmann, Erik
    Boehm, Klemens
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 126 - 137