Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning

被引:0
|
作者
Yang, Zhibo [1 ]
Huang, Lihan [1 ]
Chen, Yupei [1 ]
Wei, Zijun [2 ]
Ahn, Seoyoung [1 ]
Zelinsky, Gregory [1 ]
Samaras, Dimitris [1 ]
Hoai, Minh [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
[2] Adobe Inc, San Jose, CA USA
基金
美国国家科学基金会;
关键词
EYE-MOVEMENTS; SEARCH; MODEL; GUIDANCE; SCENES;
D O I
10.1109/CVPR42600.2020.00027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human gaze behavior prediction is important for behavioral vision and for computer vision applications. Most models mainly focus on predicting free-viewing behavior using saliency maps, but do not generalize to goal-directed behavior, such as when a person searches for a visual target object. We propose the first inverse reinforcement learning (IRL) model to learn the internal reward function and policy used by humans during visual search. We modeled the viewer's internal belief states as dynamic contextual belief maps of object locations. These maps were learned and then used to predict behavioral scanpaths for multiple target categories. To train and evaluate our IRL model we created COCO-Search18, which is now the largest dataset of high-quality search fixations in existence. COCO-Search18 has 10 participants searching for each of 18 target-object categories in 6202 images, making about 300,000 goal-directed fixations. When trained and evaluated on COCO-Search18, the IRL model outperformed baseline models in predicting search fixation scanpaths, both in terms of similarity to human search behavior and search efficiency. Finally, reward maps recovered by the IRL model reveal distinctive target-dependent patterns of object prioritization, which we interpret as a learned object context.
引用
收藏
页码:190 / 199
页数:10
相关论文
共 50 条
  • [1] Goal-directed graph construction using reinforcement learning
    Darvariu, Victor-Alexandru
    Hailes, Stephen
    Musolesi, Mirco
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 477 (2254):
  • [2] Goal-directed behaviours by reinforcement learning
    Johannet, A
    Sarda, I
    NEUROCOMPUTING, 1999, 28 : 107 - 125
  • [3] Reinforcement learning with goal-directed eligibility traces
    Andrecut, M
    Ali, MK
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2004, 15 (09): : 1235 - 1247
  • [4] Accelerating Goal-Directed Reinforcement Learning by Model Characterization
    Debnath, Shoubhik
    Sukhatme, Gaurav
    Liu, Lantao
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 8666 - 8673
  • [5] Human generalization of internal representations through prototype learning with goal-directed attention
    Pettine, Warren Woodrich
    Raman, Dhruva Venkita
    Redish, A. David
    Murray, John D.
    NATURE HUMAN BEHAVIOUR, 2023, 7 (03) : 442 - +
  • [6] Human generalization of internal representations through prototype learning with goal-directed attention
    Warren Woodrich Pettine
    Dhruva Venkita Raman
    A. David Redish
    John D. Murray
    Nature Human Behaviour, 2023, 7 : 442 - 463
  • [7] THE IMPACT OF STIMULUS VALUE ON GOAL-DIRECTED AVERSIVE REINFORCEMENT LEARNING
    Lindstrom, Bjorn
    Golkar, Armita
    Olsson, Andreas
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, : 155 - 155
  • [8] Goal-Directed Feature Learning
    Weber, Cornelius
    Triesch, Jochen
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 3355 - 3362
  • [9] Goal-directed EEG activity evoked by discriminative stimuli in reinforcement learning
    Luque, David
    Moris, Joaquin
    Rushby, Jacqueline A.
    Le Pelley, Mike E.
    PSYCHOPHYSIOLOGY, 2015, 52 (02) : 238 - 248
  • [10] Corticostriatal Correlates of Human Goal-Directed Learning and Motivation
    Eryilmaz, Hamdi
    Rodriguez-Thompson, Anais
    Huntington, Franklin C.
    Giegold, Madeline
    Tanner, Alexandra S.
    Roffman, Joshua L.
    BIOLOGICAL PSYCHIATRY, 2016, 79 (09) : 214S - 214S