Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning

被引:1
|
作者
Lin, Bingqian [1 ]
Long, Yanxin [1 ]
Zhu, Yi [2 ]
Zhu, Fengda [3 ]
Liang, Xiaodan [1 ,4 ]
Ye, Qixiang [2 ]
Lin, Liang [1 ]
机构
[1] Sun Yat Sen Univ, Shenzhen Campus, Shenzhen 510275, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Beijing 101408, Peoples R China
[3] Monash Univ, Melbourne, Vic 3800, Australia
[4] Dark Matter Inc, Guangzhou 511400, Guangdong, Peoples R China
关键词
Contrastive learning; navigation robustness; progressive training; vision-and-language navigation;
D O I
10.1109/TPAMI.2023.3273594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision-and-language navigation (VLN) asks an agent to follow a given language instruction to navigate through a real 3D environment. Despite significant advances, conventional VLN agents are trained typically under disturbance-free environments and may easily fail in real-world navigation scenarios, since they are unaware of how to deal with various possible disturbances, such as sudden obstacles or human interruptions, which widely exist and may usually cause an unexpected route deviation. In this paper, we present a model-agnostic training paradigm, called Progressive Perturbation-aware Contrastive Learning (PROPER) to enhance the generalization ability of existing VLN agents to the real world, by requiring them to learn towards deviation-robust navigation. Specifically, a simple yet effective path perturbation scheme is introduced to implement the route deviation, with which the agent is required to still navigate successfully following the original instruction. Since directly enforcing the agent to learn perturbed trajectories may lead to insufficient and inefficient training, a progressively perturbed trajectory augmentation strategy is designed, where the agent can self-adaptively learn to navigate under perturbation with the improvement of its navigation performance for each specific trajectory. For encouraging the agent to well capture the difference brought by perturbation and adapt to both perturbation-free and perturbation-based environments, a perturbation-aware contrastive learning mechanism is further developed by contrasting perturbation-free trajectory encodings and perturbation-based counterparts. Extensive experiments on the standard Room-to-Room (R2R) benchmark show that PROPER can benefit multiple state-of-the-art VLN baselines in perturbation-free scenarios. We further collect the perturbed path data to construct an introspection subset based on the R2R, called Path-Perturbed R2R (PP-R2R). The results on PP-R2R show unsatisfying robustness of popular VLN agents and the capability of PROPER in improving the navigation robustness under deviation.
引用
收藏
页码:12535 / 12549
页数:15
相关论文
共 50 条
  • [41] Web Semantic-Based Robust Graph Contrastive Learning for Recommendation via Invariant Learning
    Dai, Wengui
    Wang, Yujun
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2024, 20 (01)
  • [42] Hessian Aware Low-Rank Perturbation for Order-Robust Continual Learning
    Li, Jiaqi
    Lai, Yuanhao
    Wang, Rui
    Shui, Changjian
    Sahoo, Sabyasachi
    Ling, Charles X.
    Yang, Shichun
    Wang, Boyu
    Gagne, Christian
    Zhou, Fan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6385 - 6396
  • [43] Towards Adversarial-Robust Class-Incremental Learning via Progressively Volume-Up Perturbation Generation
    You, Yeliang
    Chen, Bin
    Yin, Jia-li
    Liu, Ximeng
    Lin, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 61 - 75
  • [44] Network-aware Multi-agent Reinforcement Learning for the Vehicle Navigation Problem
    Arasteh, Fazel
    SheikhGarGar, Soroush
    Papagelis, Manos
    30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 504 - 507
  • [45] Efficient and robust Pedestrian Detection using Deep Learning for Human-Aware Navigation
    Mateus, Andre
    Ribeiro, David
    Miraldo, Pedro
    Nascimento, Jacinto C.
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 113 : 23 - 37
  • [46] Towards Understanding the Mechanism of Contrastive Learning via Similarity Structure: A Theoretical Analysis
    Waida, Hiroki
    Wada, Yuichiro
    Andeol, Leo
    Nakagawa, Takumi
    Zhang, Yuhui
    Kanamori, Takafumi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 709 - 727
  • [47] Alleviating Exposure Bias via Multi-level Contrastive Learning and Deviation Simulation in Abstractive Summarization
    Xie, Jiawen
    Su, Qi
    Zhang, Shaoting
    Zhang, Xiaofan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9732 - 9747
  • [48] Margin-aware Noise-robust Contrastive Learning for Partially View-aligned Problem
    Qin, Yalan
    Pu, Nan
    Wu, Hanzhou
    Sebe, Nicu
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2025, 19 (01)
  • [49] Interaction-Aware Crowd Navigation via Augmented Relational Graph Learning
    Xu, Qing
    He, Wangli
    Kubota, Naoyuki
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2021), 2021, : 1106 - 1112
  • [50] Enhancing New Intent Discovery via Robust Neighbor-based Contrastive Learning
    Wu, Zhenhe
    Yu, Xiaoguang
    Chen, Meng
    Wu, Liangqing
    Ji, Jiahao
    Li, Zhoujun
    INTERSPEECH 2023, 2023, : 740 - 744