Accommodating misclassification effects on optimizing dynamic treatment regimes with Q-learning

被引：1

作者：

Charvadeh, Yasin Khadem ^{[1
]}

Yi, Grace Y. ^{[1
,2
,3
]}

机构：

[1] Univ Western Ontario, Dept Stat & Actuarial Sci, London, ON, Canada

[2] Univ Western Ontario, Dept Comp Sci, London, ON, Canada

[3] Univ Western Ontario, Dept Stat & Actuarial Sci, Dept Comp Sci, 1151 Richmond St, London, ON N6A 5B7, Canada

来源：

STATISTICS IN MEDICINE | 2024年 / 43卷 / 03期

基金：

加拿大自然科学与工程研究理事会;

关键词：

dynamic treatment regimes; estimating function; misclassification; Q-learning; regression calibration; regression models; SEQUENCED TREATMENT ALTERNATIVES; PROPORTIONAL HAZARDS MODEL; INFERENCE; REGRESSION; RATIONALE; DESIGN;

D O I：

10.1002/sim.9973

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Research on dynamic treatment regimes has enticed extensive interest. Many methods have been proposed in the literature, which, however, are vulnerable to the presence of misclassification in covariates. In particular, although Q-learning has received considerable attention, its applicability to data with misclassified covariates is unclear. In this article, we investigate how ignoring misclassification in binary covariates can impact the determination of optimal decision rules in randomized treatment settings, and demonstrate its deleterious effects on Q-learning through empirical studies. We present two correction methods to address misclassification effects on Q-learning. Numerical studies reveal that misclassification in covariates induces non-negligible estimation bias and that the correction methods successfully ameliorate bias in parameter estimation.

引用

页码：578 / 605

页数：28

共 50 条

[41] Dynamic feature selection algorithm based on Q-learning mechanism
Xu, Ruohao
Li, Mengmeng
Yang, Zhongliang
Yang, Lifang
Qiao, Kangjia
Shang, Zhigang
APPLIED INTELLIGENCE, 2021, 51 (10) : 7233 - 7244
[42] Deep Q-Learning with Multiband Sensing for Dynamic Spectrum Access
Nguyen, Ha Q.
Nguyen, Binh T.
Dong, Trung Q.
Ngo, Dat T.
Nguyen, Tuan A.
2018 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (DYSPAN), 2018,
[43] Dynamic Pricing Decision for Perishable Goods: A Q-learning Approach
Cheng, Yan
2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11965 - 11969
[44] Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
Bertsekas, Dimitri P.
Yu, Huizhen
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 1409 - 1416
[45] Decentralized Q-Learning for Weakly Acyclic Stochastic Dynamic Games
Arslan, Gurdal
Yuksel, Serdar
2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 6743 - 6748
[46] Optimizing the Impact of Musical Education on Mental Health of Students using Q-Learning
Bing, Yang
MOBILE NETWORKS & APPLICATIONS, 2024,
[47] Optimizing the prediction of adsorption in metal-organic frameworks leveraging Q-learning
Osaro, Etinosa
Colon, Yamil J.
AICHE JOURNAL, 2024, 70 (12)
[48] Optimizing Handover Parameters by Q-Learning for Heterogeneous Radio-Optical Networks
Shao, Sihua
Liu, Guanxiong
Khreishah, Abdallah
Ayyash, Moussa
Elgala, Hany
Little, Thomas D. C.
Rahaim, Michael
IEEE PHOTONICS JOURNAL, 2020, 12 (01):
[49] Mounting of auction agent under dynamic environment by Q-learning and SARSA learning
Katou, T
Nagasaka, K
7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2003, : 472 - 475
[50] Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments
Gupta, Anmol
Roy, Partha Pratim
Dutt, Varun
IEEE ACCESS, 2021, 9 : 138775 - 138790

← 1 2 3 4 5 →