Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning

被引:1
|
作者
Lin, Bingqian [1 ]
Long, Yanxin [1 ]
Zhu, Yi [2 ]
Zhu, Fengda [3 ]
Liang, Xiaodan [1 ,4 ]
Ye, Qixiang [2 ]
Lin, Liang [1 ]
机构
[1] Sun Yat Sen Univ, Shenzhen Campus, Shenzhen 510275, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Beijing 101408, Peoples R China
[3] Monash Univ, Melbourne, Vic 3800, Australia
[4] Dark Matter Inc, Guangzhou 511400, Guangdong, Peoples R China
关键词
Contrastive learning; navigation robustness; progressive training; vision-and-language navigation;
D O I
10.1109/TPAMI.2023.3273594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision-and-language navigation (VLN) asks an agent to follow a given language instruction to navigate through a real 3D environment. Despite significant advances, conventional VLN agents are trained typically under disturbance-free environments and may easily fail in real-world navigation scenarios, since they are unaware of how to deal with various possible disturbances, such as sudden obstacles or human interruptions, which widely exist and may usually cause an unexpected route deviation. In this paper, we present a model-agnostic training paradigm, called Progressive Perturbation-aware Contrastive Learning (PROPER) to enhance the generalization ability of existing VLN agents to the real world, by requiring them to learn towards deviation-robust navigation. Specifically, a simple yet effective path perturbation scheme is introduced to implement the route deviation, with which the agent is required to still navigate successfully following the original instruction. Since directly enforcing the agent to learn perturbed trajectories may lead to insufficient and inefficient training, a progressively perturbed trajectory augmentation strategy is designed, where the agent can self-adaptively learn to navigate under perturbation with the improvement of its navigation performance for each specific trajectory. For encouraging the agent to well capture the difference brought by perturbation and adapt to both perturbation-free and perturbation-based environments, a perturbation-aware contrastive learning mechanism is further developed by contrasting perturbation-free trajectory encodings and perturbation-based counterparts. Extensive experiments on the standard Room-to-Room (R2R) benchmark show that PROPER can benefit multiple state-of-the-art VLN baselines in perturbation-free scenarios. We further collect the perturbed path data to construct an introspection subset based on the R2R, called Path-Perturbed R2R (PP-R2R). The results on PP-R2R show unsatisfying robustness of popular VLN agents and the capability of PROPER in improving the navigation robustness under deviation.
引用
收藏
页码:12535 / 12549
页数:15
相关论文
共 50 条
  • [21] Towards Robust False Information Detection on Social Networks with Contrastive Learning
    Ma, Guanghui
    Hu, Chunming
    Ge, Ling
    Chen, Junfan
    Zhang, Hong
    Zhang, Richong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1441 - 1450
  • [22] Towards robust and generalizable representations of extracellular data using contrastive learning
    Vishnubhotla, Ankit
    Loh, Charlotte
    Paninski, Liam
    Srivastava, Akash
    Hurwitz, Cole
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Towards Adversarial Robustness with Multidimensional Perturbations via Contrastive Learning
    Chen, Chuanxi
    Ye, Dengpan
    Wang, Hao
    Tang, Long
    Xu, Yue
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 184 - 191
  • [24] Towards multimodal sarcasm detection via label-aware graph contrastive learning with back-translation augmentation
    Wei, Yiwei
    Duan, Maomao
    Zhou, Hengyang
    Jia, Zhiyang
    Gao, Zengwei
    Wang, Longbiao
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [25] A Robust Shape-Aware Rib Fracture Detection and Segmentation Framework With Contrastive Learning
    Cao, Zheng
    Xu, Liming
    Chen, Danny Z.
    Gao, Honghao
    Wu, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1584 - 1591
  • [26] DMCL: Robot Autonomous Navigation Via Depth Image Masked Contrastive Learning
    Jiang, Jiahao
    Li, Ping
    Lv, Xudong
    Yang, Yuxiang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5172 - 5178
  • [27] Counterfactual Contrastive Learning: Robust Representations via Causal Image Synthesis
    Roschewitz, Melanie
    Ribeiro, Fabio de Sousa
    Xia, Tian
    Khara, Galvin
    Glocker, Ben
    DATA ENGINEERING IN MEDICAL IMAGING, DEMI 2024, 2025, 15265 : 22 - 32
  • [28] Learning Visual Representation for Autonomous Drone Navigation via a Contrastive World Model
    Zhao J.
    Wang Y.
    Cai Z.
    Liu N.
    Wu K.
    Wang Y.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1263 - 1276
  • [29] Enhancing robust VQA via contrastive and self-supervised learning
    Cao, Runlin
    Li, Zhixin
    Tang, Zhenjun
    Zhang, Canlong
    Ma, Huifang
    PATTERN RECOGNITION, 2025, 159
  • [30] Semantic-aware contrastive learning via multi-prompt alignment
    Zhao, Zhuoran
    Qin, Hao
    Kong, Ming
    Chen, Luyuan
    Xie, Di
    Zhu, Jiang
    Zhu, Qiang
    MACHINE LEARNING, 2025, 114 (03)