MOCODA: Model-based Counterfactual Data Augmentation

被引:0
|
作者
Pitis, Silviu [1 ,2 ]
Creager, Elliot [1 ,2 ]
Mandlekar, Ajay [3 ]
Garg, Animesh [1 ,2 ,3 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Vector Inst, Toronto, ON, Canada
[3] NVIDIA, Santa Clara, CA USA
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The number of states in a dynamic process is exponential in the number of objects, making reinforcement learning (RL) difficult in complex, multi-object domains. For agents to scale to the real world, they will need to react to and reason about unseen combinations of objects. We argue that the ability to recognize and use local factorization in transition dynamics is a key element in unlocking the power of multi-object reasoning. To this end, we show that (1) known local structure in the environment transitions is sufficient for an exponential reduction in the sample complexity of training a dynamics model, and (2) a locally factored dynamics model provably generalizes out-of-distribution to unseen states and actions. Knowing the local structure also allows us to predict which unseen states and actions this dynamics model will generalize to. We propose to leverage these observations in a novel Model-based Counterfactual Data Augmentation (MOCODA) framework. MOCODA applies a learned locally factored dynamics model to an augmented distribution of states and actions to generate counterfactual transitions for RL. MOCODA works with a broader set of local structures than prior work and allows for direct control over the augmented training distribution. We show that MOCODA enables RL agents to learn policies that generalize to unseen states and actions. We use MOCODA to train an offline RL agent to solve an out-of-distribution robotics manipulation task on which standard offline RL algorithms fail.(1)
引用
收藏
页数:14
相关论文
共 50 条
  • [1] FairFlow: An Automated Approach to Model-Based Counterfactual Data Augmentation for NLP
    Tokpo, Ewoenam Kwaku
    Calders, Toon
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 160 - 176
  • [2] Null Model-Based Data Augmentation for Graph Classification
    Wang, Zeyu
    Wang, Jinhuan
    Shan, Yalu
    Yu, Shanqing
    Xu, Xiaoke
    Xuan, Qi
    Chen, Guanrong
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1821 - 1833
  • [3] Model-Based Counterfactual Synthesizer for Interpretation
    Yang, Fan
    Alva, Sahan Suresh
    Chen, Jiahao
    Hu, Xia
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1964 - 1974
  • [4] Diffusion Model-Based Data Augmentation for Lung Ultrasound Classification with Limited Data
    Zhang, Xiaohui
    Gangopadhyay, Ahana
    Chang, Hsi-Ming
    Soni, Ravi
    MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 664 - 676
  • [5] ROMA: Reverse Model-Based Data Augmentation for Offline Reinforcement Learning
    Wei, Xiaochen
    Huang, Wenzhen
    Zhai, Ziming
    BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 178 - 193
  • [6] Improving Text Classification with Large Language Model-Based Data Augmentation
    Zhao, Huanhuan
    Chen, Haihua
    Ruggles, Thomas A.
    Feng, Yunhe
    Singh, Debjani
    Yoon, Hong-Jun
    ELECTRONICS, 2024, 13 (13)
  • [7] Fault Detection of Bearing by Resnet Classifier with Model-Based Data Augmentation
    Qian, Lu
    Pan, Qing
    Lv, Yaqiong
    Zhao, Xingwei
    MACHINES, 2022, 10 (07)
  • [8] Model-based data augmentation for user-independent fatigue estimation
    Jiang, Yanran
    Malliaras, Peter
    Chen, Bernard
    Kulic, Dana
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 137
  • [9] Improved Model-based Learning with Data Augmentation for Quantitative Susceptibility Mapping
    Liu, Juan
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 143, 2021, 143 : 431 - 450
  • [10] Intrinsically motivated reinforcement learning based recommendation with counterfactual data augmentation
    Xiaocong Chen
    Siyu Wang
    Lianyong Qi
    Yong Li
    Lina Yao
    World Wide Web, 2023, 26 : 3253 - 3274