Conservative Policy Construction Using Variational Autoencoders for Logged Data With Missing Values

被引:4
|
作者
Abroshan, Mahed [1 ]
Yip, Kai Hou [2 ]
Tekin, Cem [3 ]
van der Schaar, Mihaela [4 ,5 ]
机构
[1] Alan Turing Inst, London NW1 2DB, England
[2] UCL, London WC1E 6BT, England
[3] Bilkent Univ, Dept Elect & Elect Engn, TR-06800 Ankara, Turkey
[4] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge CB2 1TN, England
[5] Univ Calif Los Angeles, Los Angeles, CA 90095 USA
基金
英国工程与自然科学研究理事会; 英国科研创新办公室;
关键词
Missing values; observational data; policy construction; variational autoencoder; PROPENSITY SCORE; INFERENCE; IMPUTATION;
D O I
10.1109/TNNLS.2021.3136385
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In high-stakes applications of data-driven decision-making such as healthcare, it is of paramount importance to learn a policy that maximizes the reward while avoiding potentially dangerous actions when there is uncertainty. There are two main challenges usually associated with this problem. First, learning through online exploration is not possible due to the critical nature of such applications. Therefore, we need to resort to observational datasets with no counterfactuals. Second, such datasets are usually imperfect, additionally cursed with missing values in the attributes of features. In this article, we consider the problem of constructing personalized policies using logged data when there are missing values in the attributes of features in both training and test data. The goal is to recommend an action (treatment) when (X) over tilde, a degraded version of X with missing values, is observed. We consider three strategies for dealing with missingness. In particular, we introduce the conservative strategy where the policy is designed to safely handle the uncertainty due to missingness. In order to implement this strategy, we need to estimate posterior distribution p(X vertical bar(X) over tilde) and use a variational autoencoder to achieve this. In particular, our method is based on partial variational autoencoders (PVAEs) that are designed to capture the underlying structure of features with missing values.
引用
收藏
页码:6368 / 6378
页数:11
相关论文
共 50 条
  • [1] Interpolation of missing swaption volatility data using variational autoencoders
    Richert I.
    Buch R.
    Behaviormetrika, 2024, 51 (1) : 291 - 317
  • [2] Posterior Consistency for Missing Data in Variational Autoencoders
    Sudak, Timur
    Tschiatschek, Sebastian
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT II, 2023, 14170 : 508 - 524
  • [3] Analysis of Variational Autoencoders for Imputing Missing Values from Sensor Data of Marine Systems
    Velasco-Gallego, Christian
    Lazakis, Iraklis
    JOURNAL OF SHIP RESEARCH, 2022, 66 (03): : 193 - 203
  • [4] Imputation of Missing Values in Training Data using Variational Autoencoder
    Hong, Xuerui
    Hao, Shuang
    2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 49 - 54
  • [5] Supervised Variational Autoencoders for Soft Sensor Modeling With Missing Data
    Xie, Ruimin
    Jan, Nabil Magbool
    Hao, Kuangrong
    Chen, Lei
    Huang, Biao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (04) : 2820 - 2828
  • [6] Variational Autoencoders for Missing Data Imputation with Application to a Simulated Milling Circuit
    McCoy, John T.
    Kroon, Steve
    Auret, Lidia
    IFAC PAPERSONLINE, 2018, 51 (21): : 141 - 146
  • [7] Seismic labeled data expansion using variational autoencoders
    Li, Kunhong
    Chen, Song
    Hu, Guangmin
    ARTIFICIAL INTELLIGENCE IN GEOSCIENCES, 2020, 1 : 24 - 30
  • [8] Physiological Waveform Imputation of Missing Data using Convolutional Autoencoders
    Miller, Daniel
    Ward, Andrew
    Bambos, Nicholas
    Scheinker, David
    Shin, Andrew
    2018 IEEE 20TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2018,
  • [9] Imputation of Missing Traffic Flow Data Using Denoising Autoencoders
    Jiang, Boyuan
    Siddiqi, Muhammad Danial
    Asadi, Reza
    Regan, Amelia
    12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 84 - 91
  • [10] Data augmentation using Variational Autoencoders for improvement of respiratory disease classification
    Saldanha, Jane
    Chakraborty, Shaunak
    Patil, Shruti
    Kotecha, Ketan
    Kumar, Satish
    Nayyar, Anand
    PLOS ONE, 2022, 17 (08):