Embodied Question Answering

被引:14
|
作者
Das, Abhishek [1 ,2 ]
Datta, Samyak [1 ]
Gkioxari, Georgia [2 ]
Lee, Stefan [1 ]
Parikh, Devi [1 ,2 ]
Batra, Dhruv [1 ,2 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Facebook AI Res, Menlo Pk, CA USA
关键词
D O I
10.1109/CVPRW.2018.00279
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new AI task - Embodied Question Answering (EmbodiedQA) - where an agent is spawned at a random location in a 3D environment and asked a question ('What color is the car?'). In order to answer, the agent must first intelligently navigate to explore the environment, gather necessary visual information through first-person (egocentric) vision, and then answer the question ('orange'). EmbodiedQA requires a range of AI skills - language understanding, visual recognition, active perception, goal-driven navigation, commonsense reasoning, long-term memory, and grounding language into actions. In this work, we develop a dataset of questions and answers in House3D environments [1], evaluation metrics, and a hierarchical model trained with imitation and reinforcement learning.
引用
收藏
页码:2135 / 2144
页数:10
相关论文
共 50 条
  • [31] ANSWERING SAUNDERS QUESTION
    LUERS, JK
    FICHTL, G
    ASTRONAUTICS & AERONAUTICS, 1972, 10 (04): : 4 - &
  • [32] QUESTION ANSWERING SYSTEMS
    Tomljanovic, Jasminka
    Krsnik, Marina
    Pavlic, Mile
    ZBORNIK VELEUCILISTA U RIJECI-JOURNAL OF THE POLYTECHNICS OF RIJEKA, 2014, 2 (01): : 177 - 195
  • [33] ANSWERING THE CATT QUESTION
    Boute, Raymond
    ELECTRONICS WORLD, 2012, 118 (1914): : 38 - 38
  • [34] Answering the Ethical Question
    Nye, Sebastian
    RATIO, 2013, 26 (03) : 279 - 298
  • [35] Answering the Correct Question
    Craig, Michelle
    Petersen, Andrew
    Campbell, Jennifer
    PROCEEDINGS OF THE ACM CONFERENCE ON GLOBAL COMPUTING EDUCATION (COMPED '19), 2019, : 72 - 77
  • [36] Question answering for Biology
    Neves, Mariana
    Leser, Ulf
    METHODS, 2015, 74 : 36 - 46
  • [37] Answering the Question, why?
    Louthan Jr., McIntyre R.
    Journal of Failure Analysis and Prevention, 2001, 1 (02)
  • [38] Contextualized question answering
    Bradeško L.
    Dali L.
    Fortuna B.
    Grobelnik M.
    Mladenić D.
    Novalija I.
    Pajntar B.
    Journal of Computing and Information Technology, 2010, 18 (04) : 325 - 332
  • [39] Locate Before Answering: Answer Guided Question Localization for Video Question Answering
    Qian, Tianwen
    Cui, Ran
    Chen, Jingjing
    Peng, Pai
    Guo, Xiaowei
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4554 - 4563
  • [40] Enhancing yes/no question answering with weak supervision via extractive question answering
    Dimitris Dimitriadis
    Grigorios Tsoumakas
    Applied Intelligence, 2023, 53 : 27560 - 27570