MOCODA: Model-based Counterfactual Data Augmentation

被引:0
|
作者
Pitis, Silviu [1 ,2 ]
Creager, Elliot [1 ,2 ]
Mandlekar, Ajay [3 ]
Garg, Animesh [1 ,2 ,3 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Vector Inst, Toronto, ON, Canada
[3] NVIDIA, Santa Clara, CA USA
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The number of states in a dynamic process is exponential in the number of objects, making reinforcement learning (RL) difficult in complex, multi-object domains. For agents to scale to the real world, they will need to react to and reason about unseen combinations of objects. We argue that the ability to recognize and use local factorization in transition dynamics is a key element in unlocking the power of multi-object reasoning. To this end, we show that (1) known local structure in the environment transitions is sufficient for an exponential reduction in the sample complexity of training a dynamics model, and (2) a locally factored dynamics model provably generalizes out-of-distribution to unseen states and actions. Knowing the local structure also allows us to predict which unseen states and actions this dynamics model will generalize to. We propose to leverage these observations in a novel Model-based Counterfactual Data Augmentation (MOCODA) framework. MOCODA applies a learned locally factored dynamics model to an augmented distribution of states and actions to generate counterfactual transitions for RL. MOCODA works with a broader set of local structures than prior work and allows for direct control over the augmented training distribution. We show that MOCODA enables RL agents to learn policies that generalize to unseen states and actions. We use MOCODA to train an offline RL agent to solve an out-of-distribution robotics manipulation task on which standard offline RL algorithms fail.(1)
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A Data Augmentation Model Based on Variational Approach
    Xia, Lei
    Lv, Jiancheng
    Xu, Yong
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 157 - 168
  • [42] RumorLLM: A Rumor Large Language Model-Based Fake-News-Detection Data-Augmentation Approach
    Lai, Jianqiao
    Yang, Xinran
    Luo, Wenyue
    Zhou, Linjiang
    Li, Langchen
    Wang, Yongqi
    Shi, Xiaochuan
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [43] Distortion Model-Based Spectral Augmentation for Generalized Recaptured Document Detection
    Chen, Changsheng
    Li, Bokang
    Cai, Rizhao
    Zeng, Jishen
    Huang, Jiwu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 1283 - 1298
  • [44] Counterfactual Data Augmentation for Mitigating Gender Stereotypes in Languages with Rich Morphology
    Zmigrod, Ran
    Mielke, Sebastian J.
    Wallach, Hanna
    Cotterell, Ryan
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1651 - 1661
  • [45] Solving the class imbalance problem using a counterfactual method for data augmentation
    Temraz, Mohammed
    Keane, Mark T.
    MACHINE LEARNING WITH APPLICATIONS, 2022, 9
  • [46] ACAMDA: Improving Data Efficiency in Reinforcement Learning Through Guided Counterfactual Data Augmentation
    Sun, Yuewen
    Wang, Erli
    Huang, Biwei
    Lu, Chaochao
    Feng, Lu
    Sun, Changyin
    Zhang, Kun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15193 - 15201
  • [47] Model-based data processing with transient model extensions
    Thonhauser, Michael
    Schmoelzer, Gernot
    Kreiner, Christian
    ECBS 2007: 14TH ANNUAL IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON THE ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS: RAISING EXPECTATIONS OF COMPUTER-BASES SYSTEMS, 2007, : 299 - +
  • [48] Relation-based Counterfactual Data Augmentation and Contrastive Learning for Robustifying Natural Language Inference Models
    Yang, Heerin
    Hwang, Seung-won
    So, Jungmin
    INTERSPEECH 2023, 2023, : 2938 - 2942
  • [49] A Novel Metric-Based Counterfactual Data Augmentation with Self-Imitation Reinforcement Learning (SIL)
    Sreedhar, K. C.
    Kavya, T.
    Prasad, J. V. S. Rajendra
    Varshini, V.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 654 - 661
  • [50] MODEL-BASED AND DATA-BASED PLANNING SYSTEMS
    BLANNING, RW
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1981, 9 (02): : 163 - 168