Effective sample size: A measure of individual uncertainty in predictions

被引:1
|
作者
Thomassen, Doranne [1 ]
le Cessie, Saskia [1 ,2 ]
van Houwelingen, Hans C. [1 ]
Steyerberg, Ewout W. [1 ]
机构
[1] Leiden Univ, Med Ctr, Dept Biomed Data Sci, Postzone S-05-S,POB 9600, Leiden, Netherlands
[2] Leiden Univ, Med Ctr, Dept Clin Epidemiol, Leiden, Netherlands
关键词
prediction modeling; risk communication; uncertainty quantification; CANCER; MODELS; RISK; VALIDATION; SELECTION;
D O I
10.1002/sim.10018
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clinical prediction models are estimated using a sample of limited size from the target population, leading to uncertainty in predictions, even when the model is correctly specified. Generally, not all patient profiles are observed uniformly in model development. As a result, sampling uncertainty varies between individual patients' predictions. We aimed to develop an intuitive measure of individual prediction uncertainty. The variance of a patient's prediction can be equated to the variance of the sample mean outcome in n*$$ {n}_{\ast } $$ hypothetical patients with the same predictor values. This hypothetical sample size n*$$ {n}_{\ast } $$ can be interpreted as the number of similar patients neff$$ {n}_{\mathrm{eff}} $$ that the prediction is effectively based on, given that the model is correct. For generalized linear models, we derived analytical expressions for the effective sample size. In addition, we illustrated the concept in patients with acute myocardial infarction. In model development, neff$$ {n}_{\mathrm{eff}} $$ can be used to balance accuracy versus uncertainty of predictions. In a validation sample, the distribution of neff$$ {n}_{\mathrm{eff}} $$ indicates which patients were more and less represented in the development data, and whether predictions might be too uncertain for some to be practically meaningful. In a clinical setting, the effective sample size may facilitate communication of uncertainty about predictions. We propose the effective sample size as a clinically interpretable measure of uncertainty in individual predictions. Its implications should be explored further for the development, validation and clinical implementation of prediction models.
引用
收藏
页码:1384 / 1396
页数:13
相关论文
共 50 条
  • [21] Effective sample size for glacier mass balance
    Cogley, JG
    GEOGRAFISKA ANNALER SERIES A-PHYSICAL GEOGRAPHY, 1999, 81A (04) : 497 - 507
  • [22] THE INTERPRETATION AND ESTIMATION OF EFFECTIVE SAMPLE-SIZE
    THIEBAUX, HJ
    ZWIERS, FW
    JOURNAL OF CLIMATE AND APPLIED METEOROLOGY, 1984, 23 (05): : 800 - 811
  • [23] Gene genealogies when the sample size exceeds the effective size of the population
    Wakeley, J
    Takahashi, T
    MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (02) : 208 - 213
  • [24] Analysis of data with interval uncertainty: application of a combined sample consistency measure
    Bazhenov, A. N.
    Zhilin, S. I.
    Telnova, A. Yu.
    MEASUREMENT TECHNIQUES, 2024, 66 (11) : 827 - 838
  • [25] Analysis of data with interval uncertainty: application of a combined sample consistency measure
    A. N. Bazhenov
    S. I. Zhilin
    A. Yu. Telnova
    Measurement Techniques, 2024, 66 : 827 - 838
  • [26] Uncertainty Importance Measure of Individual Components in Multi-State Systems
    Wang, Yong
    Li, Lin
    IEEE TRANSACTIONS ON RELIABILITY, 2015, 64 (02) : 772 - 783
  • [27] A RELATIONSHIP BETWEEN SAMPLE-SIZE AND ACCURACY OF SPECIES RICHNESS PREDICTIONS
    DECAPRARIIS, P
    LINDEMANN, R
    HAIMES, R
    JOURNAL OF THE INTERNATIONAL ASSOCIATION FOR MATHEMATICAL GEOLOGY, 1981, 13 (04): : 351 - 355
  • [28] On the sample size of random convex programs with structured dependence on the uncertainty
    Zhang, Xiaojing
    Grammatico, Sergio
    Schildbach, Georg
    Goulart, Paul
    Lygeros, John
    AUTOMATICA, 2015, 60 : 182 - 188
  • [29] GUI for Bayesian sample size planning in type A uncertainty evaluation
    Martin, Joerg
    Elster, Clemens
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2021, 32 (07)
  • [30] Measuring uncertainty in ecosystem service correlations as a function of sample size
    Shanafelt, David W.
    Serra-Diaz, Josep M.
    Bocqueho, Geraldine
    ECOSYSTEM SERVICES, 2023, 63