Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning

被引:1
|
作者
Lin, Bingqian [1 ]
Long, Yanxin [1 ]
Zhu, Yi [2 ]
Zhu, Fengda [3 ]
Liang, Xiaodan [1 ,4 ]
Ye, Qixiang [2 ]
Lin, Liang [1 ]
机构
[1] Sun Yat Sen Univ, Shenzhen Campus, Shenzhen 510275, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Beijing 101408, Peoples R China
[3] Monash Univ, Melbourne, Vic 3800, Australia
[4] Dark Matter Inc, Guangzhou 511400, Guangdong, Peoples R China
关键词
Contrastive learning; navigation robustness; progressive training; vision-and-language navigation;
D O I
10.1109/TPAMI.2023.3273594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision-and-language navigation (VLN) asks an agent to follow a given language instruction to navigate through a real 3D environment. Despite significant advances, conventional VLN agents are trained typically under disturbance-free environments and may easily fail in real-world navigation scenarios, since they are unaware of how to deal with various possible disturbances, such as sudden obstacles or human interruptions, which widely exist and may usually cause an unexpected route deviation. In this paper, we present a model-agnostic training paradigm, called Progressive Perturbation-aware Contrastive Learning (PROPER) to enhance the generalization ability of existing VLN agents to the real world, by requiring them to learn towards deviation-robust navigation. Specifically, a simple yet effective path perturbation scheme is introduced to implement the route deviation, with which the agent is required to still navigate successfully following the original instruction. Since directly enforcing the agent to learn perturbed trajectories may lead to insufficient and inefficient training, a progressively perturbed trajectory augmentation strategy is designed, where the agent can self-adaptively learn to navigate under perturbation with the improvement of its navigation performance for each specific trajectory. For encouraging the agent to well capture the difference brought by perturbation and adapt to both perturbation-free and perturbation-based environments, a perturbation-aware contrastive learning mechanism is further developed by contrasting perturbation-free trajectory encodings and perturbation-based counterparts. Extensive experiments on the standard Room-to-Room (R2R) benchmark show that PROPER can benefit multiple state-of-the-art VLN baselines in perturbation-free scenarios. We further collect the perturbed path data to construct an introspection subset based on the R2R, called Path-Perturbed R2R (PP-R2R). The results on PP-R2R show unsatisfying robustness of popular VLN agents and the capability of PROPER in improving the navigation robustness under deviation.
引用
收藏
页码:12535 / 12549
页数:15
相关论文
共 50 条
  • [31] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
    Yuan, Haoqi
    Lu, Zongqing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [32] Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection
    Xu, Xilie
    Zhang, Jingfeng
    Liu, Feng
    Sugiyama, Masashi
    Kankanhalli, Mohan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
    Sun, Weixuan
    Zhang, Jiayi
    Wang, Jianyuan
    Liu, Zheyuan
    Zhong, Yiran
    Feng, Tianpeng
    Guo, Yandong
    Zhang, Yanhao
    Barnes, Nick
    arXiv, 2023,
  • [34] Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning
    Sun, Weixuan
    Zhang, Jiayi
    Wang, Jianyuan
    Liu, Zheyuan
    Zhong, Yiran
    Feng, Tianpeng
    Guo, Yandong
    Zhang, Yanhao
    Barnes, Nick
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6420 - 6429
  • [35] Towards generalizable and robust image tampering localization with multi-task learning and contrastive learning
    Li, Haodong
    Zhuang, Peiyu
    Su, Yang
    Huang, Jiwu
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 270
  • [36] Noise Perturbation Based Graph Contrastive Learning via Flexible Filters for Node Classification
    Xiong, Zhilong
    Cai, Jia
    Yan, Ranhui
    Huang, Xiaolin
    Proceedings of the International Joint Conference on Neural Networks, 2024,
  • [37] Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation
    Ding, Wenhao
    Li, Shuaijun
    Qian, Huihuan
    Chen, Yongquan
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 237 - 242
  • [38] FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models
    Lee, Younghan
    Cho, Yungi
    Han, Woorim
    Bae, Ho
    Paek, Yunheung
    COMPUTER SECURITY - ESORICS 2023, PT IV, 2024, 14347 : 65 - 84
  • [39] Robust Basket Recommendation via Noise-tolerated Graph Contrastive Learning
    He, Xinrui
    Wei, Tianxin
    He, Jingrui
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 709 - 719
  • [40] Robust Zero-Shot Intent Detection via Contrastive Transfer Learning
    Maqbool, M. H.
    Khan, F. A.
    Siddique, A. B.
    Foroosh, Hassan
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 49 - 56