Effective sample size: A measure of individual uncertainty in predictions

被引:1
|
作者
Thomassen, Doranne [1 ]
le Cessie, Saskia [1 ,2 ]
van Houwelingen, Hans C. [1 ]
Steyerberg, Ewout W. [1 ]
机构
[1] Leiden Univ, Med Ctr, Dept Biomed Data Sci, Postzone S-05-S,POB 9600, Leiden, Netherlands
[2] Leiden Univ, Med Ctr, Dept Clin Epidemiol, Leiden, Netherlands
关键词
prediction modeling; risk communication; uncertainty quantification; CANCER; MODELS; RISK; VALIDATION; SELECTION;
D O I
10.1002/sim.10018
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clinical prediction models are estimated using a sample of limited size from the target population, leading to uncertainty in predictions, even when the model is correctly specified. Generally, not all patient profiles are observed uniformly in model development. As a result, sampling uncertainty varies between individual patients' predictions. We aimed to develop an intuitive measure of individual prediction uncertainty. The variance of a patient's prediction can be equated to the variance of the sample mean outcome in n*$$ {n}_{\ast } $$ hypothetical patients with the same predictor values. This hypothetical sample size n*$$ {n}_{\ast } $$ can be interpreted as the number of similar patients neff$$ {n}_{\mathrm{eff}} $$ that the prediction is effectively based on, given that the model is correct. For generalized linear models, we derived analytical expressions for the effective sample size. In addition, we illustrated the concept in patients with acute myocardial infarction. In model development, neff$$ {n}_{\mathrm{eff}} $$ can be used to balance accuracy versus uncertainty of predictions. In a validation sample, the distribution of neff$$ {n}_{\mathrm{eff}} $$ indicates which patients were more and less represented in the development data, and whether predictions might be too uncertain for some to be practically meaningful. In a clinical setting, the effective sample size may facilitate communication of uncertainty about predictions. We propose the effective sample size as a clinically interpretable measure of uncertainty in individual predictions. Its implications should be explored further for the development, validation and clinical implementation of prediction models.
引用
收藏
页码:1384 / 1396
页数:13
相关论文
共 50 条
  • [1] Effective sample size: a promising tool to communicate uncertainty in individual predictions?
    Thomassen, Doranne
    Le Cessie, Saskia
    Van Houwelingen, Hans
    Steyerberg, Ewout W.
    MEDICAL DECISION MAKING, 2024, 44 (02) : NP18 - NP19
  • [2] Flexible Effective Sample Size Based on the Message Importance Measure
    Li, Zhefan
    Fan, Pingyi
    Dong, Yunquan
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2020, 1 : 216 - 229
  • [3] THE EFFECTIVE SAMPLE SIZE
    Berger, James
    Bayarri, M. J.
    Pericchi, L. R.
    ECONOMETRIC REVIEWS, 2014, 33 (1-4) : 197 - 217
  • [4] STREAM ORDER AS A MEASURE OF SAMPLE SOURCE UNCERTAINTY
    SHARP, WE
    WATER RESOURCES RESEARCH, 1970, 6 (03) : 919 - &
  • [5] On the effective geographic sample size
    Acosta, Jonathan
    Vallejos, Ronny
    Griffth, Daniel
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (10) : 1958 - 1975
  • [6] Rethinking the Effective Sample Size
    Elvira, Victor
    Martino, Luca
    Robert, Christian P.
    INTERNATIONAL STATISTICAL REVIEW, 2022, 90 (03) : 525 - 550
  • [7] Phylogenetic effective sample size
    Bartoszek, Krzysztof
    JOURNAL OF THEORETICAL BIOLOGY, 2016, 407 : 371 - 386
  • [8] An Effective Measure of Uncertainty of Basic Belief Assignments
    Dezert, Jean
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
  • [9] Estimating the uncertainty of individual artificial neural net ensemble predictions
    Clark, Robert D.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2011, 242
  • [10] Finite sample size and neural model uncertainty
    Università di Perugia, Ist. Elettronica, Perugia, Italy
    Neurocomputing, 1-3 (121-131):