Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引:1
|
作者
Charvadeh, Yasin Khadem [1 ]
Yi, Grace Y. [1 ,2 ,3 ]
机构
[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada
[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;
D O I
10.1002/sim.9973
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.
引用
收藏
页码:578 / 605
页数:28
相关论文
共 50 条
  • [31] A Deep Q-Learning Approach for Dynamic Management of Heterogeneous Processors
    Gupta, Ujjwal
    Mandal, Sumit K.
    Mao, Manqing
    Chakrabarti, Chaitali
    Ogras, Umit Y.
    IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (01) : 14 - 17
  • [32] Dynamic single machine scheduling using Q-learning agent
    Kong, LF
    Wu, J
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 3237 - 3241
  • [33] A New Algorithm to Track Dynamic Goal Position in Q-learning
    Mitra, Soumishila
    Banerjee, Dhrubojyoti
    Konar, Amit
    Janarthanan, R.
    2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 69 - 74
  • [34] Dynamic feature selection algorithm based on Q-learning mechanism
    Ruohao Xu
    Mengmeng Li
    Zhongliang Yang
    Lifang Yang
    Kangjia Qiao
    Zhigang Shang
    Applied Intelligence, 2021, 51 : 7233 - 7244
  • [35] Dynamic Level of Difficulties Using Q-Learning and Fuzzy Logic
    Annisa Damastuti, Fardani
    Firmansyah, Kenan
    Miftachul Arif, Yunifa
    Dutono, Titon
    Barakbah, Aliridho
    Hariadi, Mochamad
    IEEE ACCESS, 2024, 12 : 137775 - 137789
  • [36] A Dynamic Planning Algorithm based on Q-Learning Routing in SDON
    Shang, Jingkun
    Li, Hui
    Man, Xiangkun
    Wu, Fang
    Zhao, Jia Wei
    Ma, Xiaomei
    2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
  • [37] Reduction of the dynamic state-space in Fuzzy Q-Learning
    Kovács, S
    Baranyi, N
    2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 1075 - 1080
  • [38] ENTROPY LEARNING FOR DYNAMIC TREATMENT REGIMES
    Jiang, Binyan
    Song, Rui
    Li, Jialiang
    Zeng, Donglin
    Lu, Wenbin
    He, Xin
    Xu, Shirong
    Wang, Junhui
    Qian, Min
    Cheng, Bin
    Qiu, Hongxiang
    Luedtke, Alex
    van der Laan, Mark
    Wager, Stefan
    Zhang, Yichi
    Laber, Eric B.
    Kallus, Nathan
    STATISTICA SINICA, 2019, 29 (04) : 1633 - 1710
  • [39] Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
    Bertsekas, Dimitri P.
    Yu, Huizhen
    MATHEMATICS OF OPERATIONS RESEARCH, 2012, 37 (01) : 66 - 94
  • [40] A Novel Deep Q-learning Method for Dynamic Spectrum Access
    Tomovic, S.
    Radusinovic, I
    2020 28TH TELECOMMUNICATIONS FORUM (TELFOR), 2020, : 9 - 12