Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引:1
|
作者
Charvadeh, Yasin Khadem [1 ]
Yi, Grace Y. [1 ,2 ,3 ]
机构
[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada
[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada
[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;
D O I
10.1002/sim.9973
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.
引用
收藏
页码:578 / 605
页数:28
相关论文
共 50 条
  • [21] Fuzzy adaptive Q-learning method with dynamic learning parameters
    Maeda, Y
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 2778 - 2780
  • [22] A Deep Q-Learning Dynamic Spectrum Sharing Experiment
    Shea, John M.
    Wong, Tan F.
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [23] Adaptive and Dynamic Service Composition Using Q-Learning
    Wang, Hongbing
    Zhou, Xuan
    Zhou, Xiang
    Liu, Weihong
    Li, Wenya
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 1, 2010,
  • [24] Dynamic Intermittent Q-Learning for Systems with Reduced Bandwidth
    Yang, Yongliang
    Vamvoudakis, Kyriakos G.
    Ferraz, Henrique
    Modares, Hamidreza
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 924 - 931
  • [25] Dynamic fuzzy Q-Learning and control of mobile robots
    Deng, C
    Er, MJ
    Xu, J
    2004 8TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1-3, 2004, : 2336 - 2341
  • [26] A dynamic channel assignment policy through Q-learning
    Nie, JH
    Haykin, S
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (06): : 1443 - 1455
  • [27] Dynamic scheduling with fuzzy clustering based Q-learning
    Wang, Guo-Lei
    Lin, Lin
    Zhong, Shi-Sheng
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2009, 15 (04): : 751 - 757
  • [28] Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes
    Schulte, Phillip J.
    Tsiatis, Anastasios A.
    Laber, Eric B.
    Davidian, Marie
    STATISTICAL SCIENCE, 2014, 29 (04) : 640 - 661
  • [29] Optimizing traffic flow with Q-learning and genetic algorithm for congestion control
    Deepika, Gitanjali
    Pandove, Gitanjali
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (5-6) : 4179 - 4197
  • [30] Optimizing Agent Training with Deep Q-Learning on a Self Driving Reinforcement Learning Environment
    Rodrigues, Pedro
    Vieira, Susana
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 745 - 752