Effective sample size: A measure of individual uncertainty in predictions

被引:1
|
作者
Thomassen, Doranne [1 ]
le Cessie, Saskia [1 ,2 ]
van Houwelingen, Hans C. [1 ]
Steyerberg, Ewout W. [1 ]
机构
[1] Leiden Univ, Med Ctr, Dept Biomed Data Sci, Postzone S-05-S,POB 9600, Leiden, Netherlands
[2] Leiden Univ, Med Ctr, Dept Clin Epidemiol, Leiden, Netherlands
关键词
prediction modeling; risk communication; uncertainty quantification; CANCER; MODELS; RISK; VALIDATION; SELECTION;
D O I
10.1002/sim.10018
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clinical prediction models are estimated using a sample of limited size from the target population, leading to uncertainty in predictions, even when the model is correctly specified. Generally, not all patient profiles are observed uniformly in model development. As a result, sampling uncertainty varies between individual patients' predictions. We aimed to develop an intuitive measure of individual prediction uncertainty. The variance of a patient's prediction can be equated to the variance of the sample mean outcome in n*$$ {n}_{\ast } $$ hypothetical patients with the same predictor values. This hypothetical sample size n*$$ {n}_{\ast } $$ can be interpreted as the number of similar patients neff$$ {n}_{\mathrm{eff}} $$ that the prediction is effectively based on, given that the model is correct. For generalized linear models, we derived analytical expressions for the effective sample size. In addition, we illustrated the concept in patients with acute myocardial infarction. In model development, neff$$ {n}_{\mathrm{eff}} $$ can be used to balance accuracy versus uncertainty of predictions. In a validation sample, the distribution of neff$$ {n}_{\mathrm{eff}} $$ indicates which patients were more and less represented in the development data, and whether predictions might be too uncertain for some to be practically meaningful. In a clinical setting, the effective sample size may facilitate communication of uncertainty about predictions. We propose the effective sample size as a clinically interpretable measure of uncertainty in individual predictions. Its implications should be explored further for the development, validation and clinical implementation of prediction models.
引用
收藏
页码:1384 / 1396
页数:13
相关论文
共 50 条
  • [41] Effects of Uncertainty in Model Predictions of Individual Tree Volume on Large Area Volume Estimates
    McRoberts, Ronald E.
    Westfall, James A.
    FOREST SCIENCE, 2014, 60 (01) : 34 - 42
  • [42] Skill assessment strategies for screening regression predictions based on a small sample size
    Unger, DA
    13TH CONFERENCE ON PROBABILITY AND STATISTICS IN THE ATMOSPHERIC SCIENCES, 1996, : 260 - 267
  • [43] Uncertainty in QSAR Predictions
    Sahlin, Ullrika
    ATLA-ALTERNATIVES TO LABORATORY ANIMALS, 2013, 41 (01): : 111 - 125
  • [44] The Effective Sample Size and an Alternative Small-Sample Degrees-of-Freedom Method
    Faes, Christel
    Molenberghs, Geert
    Aerts, Marc
    Verbeke, Geert
    Kenward, Michael G.
    AMERICAN STATISTICIAN, 2009, 63 (04): : 389 - 399
  • [45] The impact of metrology study sample size on uncertainty in IAEA safeguards calculations
    Burr, Tom
    Krieger, Thomas
    Norman, Claude
    Zhao, Ke
    EPJ NUCLEAR SCIENCES & TECHNOLOGIES, 2016, 2
  • [46] Sample size calculations for clinical studies allowing for uncertainty about the variance
    Julious, Steven A.
    Owen, Roger J.
    PHARMACEUTICAL STATISTICS, 2006, 5 (01) : 29 - 37
  • [47] Universal Sample Size Invariant Measures for Uncertainty Quantification in Density Estimation
    Farmer, Jenny
    Merino, Zach
    Gray, Alexander
    Jacobs, Donald
    ENTROPY, 2019, 21 (11)
  • [48] Strength uncertainty analysis of composite turbine blade with small sample size
    Chen, Gaoxiang
    Fan, Jiang
    Dong, Shaojing
    Liu, Daxiang
    STRUCTURES, 2021, 33 : 1158 - 1179
  • [49] Sample-size calculation for a log-transformed outcome measure
    Wolfe, R
    Carlin, JB
    CONTROLLED CLINICAL TRIALS, 1999, 20 (06): : 547 - 554
  • [50] A proposal of the new quantities for the association as a measure and their Behavior as a function of sample size
    Itoyama, K
    Kamizono, K
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VII, PROCEEDINGS: APPLICATIONS OF INFORMATICS AND CYBERNETICS IN SCIENCE AND ENGINEERING, 2004, : 414 - 419