Effective sample size: A measure of individual uncertainty in predictions

被引:1
|
作者
Thomassen, Doranne [1 ]
le Cessie, Saskia [1 ,2 ]
van Houwelingen, Hans C. [1 ]
Steyerberg, Ewout W. [1 ]
机构
[1] Leiden Univ, Med Ctr, Dept Biomed Data Sci, Postzone S-05-S,POB 9600, Leiden, Netherlands
[2] Leiden Univ, Med Ctr, Dept Clin Epidemiol, Leiden, Netherlands
关键词
prediction modeling; risk communication; uncertainty quantification; CANCER; MODELS; RISK; VALIDATION; SELECTION;
D O I
10.1002/sim.10018
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clinical prediction models are estimated using a sample of limited size from the target population, leading to uncertainty in predictions, even when the model is correctly specified. Generally, not all patient profiles are observed uniformly in model development. As a result, sampling uncertainty varies between individual patients' predictions. We aimed to develop an intuitive measure of individual prediction uncertainty. The variance of a patient's prediction can be equated to the variance of the sample mean outcome in n*$$ {n}_{\ast } $$ hypothetical patients with the same predictor values. This hypothetical sample size n*$$ {n}_{\ast } $$ can be interpreted as the number of similar patients neff$$ {n}_{\mathrm{eff}} $$ that the prediction is effectively based on, given that the model is correct. For generalized linear models, we derived analytical expressions for the effective sample size. In addition, we illustrated the concept in patients with acute myocardial infarction. In model development, neff$$ {n}_{\mathrm{eff}} $$ can be used to balance accuracy versus uncertainty of predictions. In a validation sample, the distribution of neff$$ {n}_{\mathrm{eff}} $$ indicates which patients were more and less represented in the development data, and whether predictions might be too uncertain for some to be practically meaningful. In a clinical setting, the effective sample size may facilitate communication of uncertainty about predictions. We propose the effective sample size as a clinically interpretable measure of uncertainty in individual predictions. Its implications should be explored further for the development, validation and clinical implementation of prediction models.
引用
收藏
页码:1384 / 1396
页数:13
相关论文
共 50 条
  • [31] SAMPLE-SIZE BIAS AND SHARPES PERFORMANCE-MEASURE
    MILLER, RE
    GEHR, AK
    JOURNAL OF FINANCIAL AND QUANTITATIVE ANALYSIS, 1978, 13 (05) : 943 - 946
  • [32] Quasi-reliable estimates of effective sample size
    Fang, Youhan
    Cao, Yudong
    Skeel, Robert D.
    IMA Journal of Numerical Analysis, 2022, 42 (01): : 680 - 697
  • [33] Effective geographic sample size in the presence of spatial autocorrelation
    Griffith, DA
    ANNALS OF THE ASSOCIATION OF AMERICAN GEOGRAPHERS, 2005, 95 (04) : 740 - 760
  • [34] ALTERNATIVE EFFECTIVE SAMPLE SIZE MEASURES FOR IMPORTANCE SAMPLING
    Martino, L.
    Elvira, V.
    Louzada, F.
    2016 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2016,
  • [35] What is the effective sample size of a spatial point process?
    Renner, Ian W.
    Warton, David I.
    Hui, Francis K. C.
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2021, 63 (01) : 144 - 158
  • [36] Effective sample size for georeferenced and temporally evolving data
    Alegria, Alfredo
    SPATIAL STATISTICS, 2023, 53
  • [37] Quasi-reliable estimates of effective sample size
    Fang, Youhan
    Cao, Yudong
    Skeel, Robert D.
    IMA JOURNAL OF NUMERICAL ANALYSIS, 2022, 42 (01) : 680 - 697
  • [38] Some practical guidelines for effective sample size determination
    Lenth, RV
    AMERICAN STATISTICIAN, 2001, 55 (03): : 187 - 193
  • [39] Sample size for determining the individual electric conductivity of sunflower seeds
    Haesbaert, Fernando Machado
    Lopes, Sidinei Jose
    Mertz, Liliane Marcia
    Lucio, Alessandro Dal'Col
    Huth, Caroline
    BRAGANTIA, 2017, 76 (01) : 54 - 61
  • [40] Estimating uncertainty when providing individual cardiovascular risk predictions: a Bayesian survival analysis
    Hageman, Steven H. J.
    Post, Richard A. J.
    Visseren, Frank L. J.
    Mcevoy, J. William
    Jukema, J. Wouter
    Smulders, Yvo
    van Smeden, Maarten
    Dorresteijn, Jannick A. N.
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2024, 173