Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引:1
|
作者
Charvadeh, Yasin Khadem [1 ]
Yi, Grace Y. [1 ,2 ,3 ]
机构
[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada
[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;
D O I
10.1002/sim.9973
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.
引用
收藏
页码:578 / 605
页数:28
相关论文
共 50 条
  • [1] Q-Learning in Dynamic Treatment Regimes With Misclassified Binary Outcome
    Liu, Dan
    He, Wenqing
    STATISTICS IN MEDICINE, 2024, 43 (30) : 5885 - 5897
  • [2] A smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes
    Fan, Yanqin
    He, Ming
    Su, Liangjun
    Zhou, Xiao-Hua
    SCANDINAVIAN JOURNAL OF STATISTICS, 2019, 46 (02) : 446 - 469
  • [3] Imputation-based Q-learning for optimizing dynamic treatment regimes with right-censored survival outcome
    Lyu, Lingyun
    Cheng, Yu
    Wahed, Abdus S. S.
    BIOMETRICS, 2023, 79 (04) : 3676 - 3689
  • [4] Weighted Q-learning for optimal dynamic treatment regimes with nonignorable missing covariates
    Sun, Jian
    Fu, Bo
    Su, Li
    BIOMETRICS, 2025, 81 (01)
  • [5] Nonparametric Bayesian Q-learning for optimization of dynamic treatment regimes in the presence of partial compliance
    Bhattacharya, Indrabati
    Ertefaie, Ashkan
    Lynch, Kevin G.
    McKay, James R.
    Johnson, Brent A.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2023, 32 (09) : 1649 - 1663
  • [6] Identifying optimally cost-effective dynamic treatment regimes with a Q-learning approach
    Illenberger, Nicholas
    Spieker, Andrew J.
    Mitra, Nandita
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2023, 72 (02) : 434 - 449
  • [7] PENALIZED Q-LEARNING FOR DYNAMIC TREATMENT REGIMENS
    Song, Rui
    Wang, Weiwei
    Zeng, Donglin
    Kosorok, Michael R.
    STATISTICA SINICA, 2015, 25 (03) : 901 - 920
  • [8] Proper Inference for Value Function in High-Dimensional Q-Learning for Dynamic Treatment Regimes
    Zhu, Wensheng
    Zeng, Donglin
    Song, Rui
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (527) : 1404 - 1417
  • [9] Dynamic Treatment Regimes with Replicated Observations Available for Error-Prone Covariates: A Q-Learning Approach
    Liu, Dan
    He, Wenqing
    STATISTICS IN BIOSCIENCES, 2025,
  • [10] A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes
    Murray, Thomas A.
    Yuan, Ying
    Thall, Peter F.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (523) : 1255 - 1267