Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning

被引:1
|
作者
Lin, Bingqian [1 ]
Long, Yanxin [1 ]
Zhu, Yi [2 ]
Zhu, Fengda [3 ]
Liang, Xiaodan [1 ,4 ]
Ye, Qixiang [2 ]
Lin, Liang [1 ]
机构
[1] Sun Yat Sen Univ, Shenzhen Campus, Shenzhen 510275, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Beijing 101408, Peoples R China
[3] Monash Univ, Melbourne, Vic 3800, Australia
[4] Dark Matter Inc, Guangzhou 511400, Guangdong, Peoples R China
关键词
Contrastive learning; navigation robustness; progressive training; vision-and-language navigation;
D O I
10.1109/TPAMI.2023.3273594
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision-and-language navigation (VLN) asks an agent to follow a given language instruction to navigate through a real 3D environment. Despite significant advances, conventional VLN agents are trained typically under disturbance-free environments and may easily fail in real-world navigation scenarios, since they are unaware of how to deal with various possible disturbances, such as sudden obstacles or human interruptions, which widely exist and may usually cause an unexpected route deviation. In this paper, we present a model-agnostic training paradigm, called Progressive Perturbation-aware Contrastive Learning (PROPER) to enhance the generalization ability of existing VLN agents to the real world, by requiring them to learn towards deviation-robust navigation. Specifically, a simple yet effective path perturbation scheme is introduced to implement the route deviation, with which the agent is required to still navigate successfully following the original instruction. Since directly enforcing the agent to learn perturbed trajectories may lead to insufficient and inefficient training, a progressively perturbed trajectory augmentation strategy is designed, where the agent can self-adaptively learn to navigate under perturbation with the improvement of its navigation performance for each specific trajectory. For encouraging the agent to well capture the difference brought by perturbation and adapt to both perturbation-free and perturbation-based environments, a perturbation-aware contrastive learning mechanism is further developed by contrasting perturbation-free trajectory encodings and perturbation-based counterparts. Extensive experiments on the standard Room-to-Room (R2R) benchmark show that PROPER can benefit multiple state-of-the-art VLN baselines in perturbation-free scenarios. We further collect the perturbed path data to construct an introspection subset based on the R2R, called Path-Perturbed R2R (PP-R2R). The results on PP-R2R show unsatisfying robustness of popular VLN agents and the capability of PROPER in improving the navigation robustness under deviation.
引用
收藏
页码:12535 / 12549
页数:15
相关论文
共 50 条
  • [1] AdvFilter: Predictive Perturbation-aware Filtering against Adversarial Attack via Multi-domain Learning
    Huang, Yihao
    Guo, Qing
    Juefei-Xu, Felix
    Ma, Lei
    Miao, Weikai
    Liu, Yang
    Pu, Geguang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 395 - 403
  • [2] Robust image clustering via context-aware contrastive graph learning
    Fang, Uno
    Li, Jianxin
    Lu, Xuequan
    Mian, Ajmal
    Gu, Zhaoquan
    PATTERN RECOGNITION, 2023, 138
  • [3] Rethinking learning difficulty and uncertainty of samples with a target perturbation-aware bias-variance decomposition
    Yao, Rujing
    Wu, Ou
    Wang, Fang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
  • [4] MeshCL: Towards robust 3D mesh analysis via contrastive learning
    Liang, Yaqian
    He, Fazhi
    Fan, Bo
    Tang, Wei
    ADVANCED ENGINEERING INFORMATICS, 2024, 60
  • [5] COAL: Robust Contrastive Learning-Based Visual Navigation Framework
    Wang, Zengmao
    Hu, Jianhua
    Tang, Qifei
    Gao, Wei
    JOURNAL OF FIELD ROBOTICS, 2025,
  • [6] DISTRIBUTION-AWARE CONTRASTIVE LEARNING FOR ROBUST MEDICAL IMAGE SEGMENTATION
    Qin, Zheyun
    Xi, Xiaoming
    Yin, Yilong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1991 - 1995
  • [7] Understanding Contrastive Learning via Distributionally Robust Optimization
    Wu, Junkang
    Chen, Jiawei
    Wu, Jiancan
    Shi, Wentao
    Wang, Xiang
    He, Xiangnan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Towards Effective and Robust Graph Contrastive Learning With Graph Autoencoding
    Li, Wen-Zhi
    Wang, Chang-Dong
    Lai, Jian-Huang
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 868 - 881
  • [9] LandslideCL: towards robust landslide analysis guided by contrastive learning
    Li, Penglei
    Wang, Yi
    Xu, Guosen
    Wang, Lizhe
    LANDSLIDES, 2023, 20 (02) : 461 - 474
  • [10] Towards Robust Rumor Detection with Graph Contrastive and Curriculum Learning
    Zhuang, Wen-Ming
    Chen, Chih-Yao
    Li, Cheng-Te
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (07)