Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引:1
|
作者
Charvadeh, Yasin Khadem [1 ]
Yi, Grace Y. [1 ,2 ,3 ]
机构
[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada
[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;
D O I
10.1002/sim.9973
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.
引用
收藏
页码:578 / 605
页数:28
相关论文
共 50 条
  • [41] Dynamic feature selection algorithm based on Q-learning mechanism
    Xu, Ruohao
    Li, Mengmeng
    Yang, Zhongliang
    Yang, Lifang
    Qiao, Kangjia
    Shang, Zhigang
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7233 - 7244
  • [42] Deep Q-Learning with Multiband Sensing for Dynamic Spectrum Access
    Nguyen, Ha Q.
    Nguyen, Binh T.
    Dong, Trung Q.
    Ngo, Dat T.
    Nguyen, Tuan A.
    2018 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2018,
  • [43] Dynamic Pricing Decision for Perishable Goods: A Q-learning Approach
    Cheng, Yan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11965 - 11969
  • [44] Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
    Bertsekas, Dimitri P.
    Yu, Huizhen
    49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 1409 - 1416
  • [45] Decentralized Q-Learning for Weakly Acyclic Stochastic Dynamic Games
    Arslan, Gurdal
    Yuksel, Serdar
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 6743 - 6748
  • [46] Optimizing the Impact of Musical Education on Mental Health of Students using Q-Learning
    Bing, Yang
    MOBILE NETWORKS & APPLICATIONS, 2024,
  • [47] Optimizing the prediction of adsorption in metal-organic frameworks leveraging Q-learning
    Osaro, Etinosa
    Colon, Yamil J.
    AICHE JOURNAL, 2024, 70 (12)
  • [48] Optimizing Handover Parameters by Q-Learning for Heterogeneous Radio-Optical Networks
    Shao, Sihua
    Liu, Guanxiong
    Khreishah, Abdallah
    Ayyash, Moussa
    Elgala, Hany
    Little, Thomas D. C.
    Rahaim, Michael
    IEEE PHOTONICS JOURNAL, 2020, 12 (01):
  • [49] Mounting of auction agent under dynamic environment by Q-learning and SARSA learning
    Katou, T
    Nagasaka, K
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 472 - 475
  • [50] Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments
    Gupta, Anmol
    Roy, Partha Pratim
    Dutt, Varun
    IEEE ACCESS, 2021, 9 : 138775 - 138790