Layout-Aware Dreamer for Embodied Referring Expression Grounding

被引:0
|
作者
Li, Mingxiao [1 ]
Wang, Zehao [2 ]
Tuytelaars, Tinne [2 ]
Moens, Marie-Francine [1 ]
机构
[1] Katholieke Univ Leuven, Comp Sci Dept, Leuven, Belgium
[2] Katholieke Univ Leuven, Elect Engn Dept ESAT PSI, Leuven, Belgium
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study the problem of Embodied Referring Expression Grounding, where an agent needs to navigate in a previously unseen environment and to localize a remote object described by a concise high-level natural language instruction. When facing such a situation, a human tends to imagine what the destination may look like and to explore the environment based on prior knowledge of the environmental layout, such as the fact that a bathroom is more likely to be found near a bedroom than a kitchen. We have de-signed an autonomous agent called Layout-aware Dreamer (LAD), including two novel modules, that is, the Layout Learner and the Goal Dreamer to mimic this cognitive decision process. The Layout Learner learns to infer the room category distribution of neighboring unexplored areas along the path for coarse layout estimation, which effectively introduces layout common sense of room-to-room transitions to our agent. To learn an effective exploration of the environment, the Goal Dreamer imagines the destination before-hand. Our agent achieves new state-of-the-art performance on the public leaderboard of the REVERIE dataset in challenging unseen test environments with improvement in navigation success (SR) by 4.02% and remote grounding success (RGS) by 3.43% compared to the previous state-of-the-art. The code is released at https://github.com/zehao-wang/LAD
引用
收藏
页码:1386 / 1395
页数:10
相关论文
共 50 条
  • [31] Layout-Aware Limiarization for Readability Enhancement of Degraded Historical Documents
    Bertholdo, Flavio
    Valle, Eduardo
    Araujo, Arnaldo de A.
    DOCENG'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2009, : 131 - 134
  • [32] Efficient layout-aware statistical analysis for photonic integrated circuits
    Jhoja, Jaspreet
    Lu, Zeqin
    Pond, James
    Chrostowski, Lukas
    OPTICS EXPRESS, 2020, 28 (06): : 7799 - 7816
  • [33] AIDA: Robust Layout-Aware Synthesis of Analog ICs including Sizing and Layout Generation
    Martins, Ricardo
    Lourenco, Nuno
    Canelas, Antonio
    Povoa, Ricardo
    Horta, Nuno
    2015 INTERNATIONAL CONFERENCE ON SYNTHESIS, MODELING, ANALYSIS AND SIMULATION METHODS AND APPLICATIONS TO CIRCUIT DESIGN (SMACD), 2015,
  • [34] Layout-Aware I/O Scheduling for Terabits Data Movement
    Kim, Youngjae
    Atchley, Scott
    Vallee, Geoffroy R.
    Shipman, Galen M.
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [35] LAME: Layout-Aware Metadata Extraction Approach for Research Articles
    Choi, Jongyun
    Kong, Hyesoo
    Yoon, Hwamook
    Oh, Heungseon
    Jung, Yuchul
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 4019 - 4037
  • [36] Detour: Layout-aware Reroute Attack Vulnerability Assessment and Analysis
    Gao, Minyan
    Forte, Domenic
    2023 IEEE INTERNATIONAL SYMPOSIUM ON HARDWARE ORIENTED SECURITY AND TRUST, HOST, 2023, : 122 - 132
  • [37] Layout-Aware Bidirectional Transfer Network for Fashion Landmark Detection
    Xie, Huosheng
    Chen, Jiaqi
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [38] AIDA: Layout-aware analog circuit-level sizing with in-loop layout generation
    Lourenco, Nuno
    Martins, Ricardo
    Canelas, Antonio
    Povoa, Ricardo
    Horta, Nuno
    INTEGRATION-THE VLSI JOURNAL, 2016, 55 : 316 - 329
  • [39] Systematic Design of a Voltage Controlled Oscillator using a Layout-Aware Approach
    Passos, F.
    Roca, E.
    Castro-Lopez, R.
    Fernandez, F. V.
    Martins, R.
    Lourenco, N.
    Povoa, R.
    Canelas, A.
    Horta, N.
    2017 14TH INTERNATIONAL CONFERENCE ON SYNTHESIS, MODELING, ANALYSIS AND SIMULATION METHODS AND APPLICATIONS TO CIRCUIT DESIGN (SMACD), 2017,
  • [40] Layout-aware scientific computing: A case study using the MILC code
    He, Jun
    Kowalkowski, Jim
    Paterno, Marc
    Holmgren, Don
    Simone, James
    Sun, Xian-He
    JOURNAL OF COMPUTATIONAL SCIENCE, 2013, 4 (06) : 496 - 506