Unsupervised Gaze Prediction in Egocentric Videos by Energy-based Surprise Modeling

被引:1
|
作者
Aakur, Sathyanarayanan N. [1 ]
Bagavathi, Arunkumar [1 ]
机构
[1] Oklahoma State Univ, Dept Comp Sci, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
Unsupervised Gaze Prediction; Egocentric Vision; Temporal Event Segmentation; Pattern Theory;
D O I
10.5220/0010288009350942
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric perception has grown rapidly with the advent of immersive computing devices. Human gaze prediction is an important problem in analyzing egocentric videos and has primarily been tackled through either saliency-based modeling or highly supervised learning. We quantitatively analyze the generalization capabilities of supervised, deep learning models on the egocentric gaze prediction task on unseen, out-of-domain data. We find that their performance is highly dependent on the training data and is restricted to the domains specified in the training annotations. In this work, we tackle the problem of jointly predicting human gaze points and temporal segmentation of egocentric videos without using any training data. We introduce an unsupervised computational model that draws inspiration from cognitive psychology models of event perception. We use Grenander's pattern theory formalism to represent spatial-temporal features and model surprise as a mechanism to predict gaze fixation points. Extensive evaluation on two publicly available datasets - GTEA and GTEA+ datasets-shows that the proposed model can significantly outperform all unsupervised baselines and some supervised gaze prediction baselines. Finally, we show that the model can also temporally segment egocentric videos with a performance comparable to more complex, fully supervised deep learning baselines.
引用
收藏
页码:935 / 942
页数:8
相关论文
共 50 条
  • [1] Unsupervised Segmentation of Action Segments in Egocentric Videos using Gaze
    Hipiny
    Ujir, H.
    Minoi, J. L.
    Juan, S. F. Samson
    Khairuddin, M. A.
    Sunar, M. S.
    2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2017, : 351 - 356
  • [2] FOVEATED NEURAL NETWORK: GAZE PREDICTION ON EGOCENTRIC VIDEOS
    Zhang, Mengmi
    Ma, Keng Teck
    Lim, Joo Hwee
    Zhao, Qi
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3720 - 3724
  • [3] OGaze: Gaze Prediction in Egocentric Videos for Attentional Object Selection
    Al-Naser, Mohammad
    Siddiqui, Shoaib Ahmed
    Ohashi, Hiroki
    Ahmed, Sheraz
    Katsuyki, Nakamura
    Takuto, Sato
    Dengel, Andreas
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 270 - 277
  • [4] Egocentric Action Anticipation Based on Unsupervised Gaze Estimation
    ZHONG Cengsi
    FANG Zhijun
    GAO Yongbin
    HUANG Bo
    Wuhan University Journal of Natural Sciences, 2021, 26 (03) : 207 - 214
  • [5] An energy-based modeling and prediction approach for surface roughness in turning
    Xie, Nan
    Zhou, Junfeng
    Zheng, Beirong
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2018, 96 (5-8): : 2293 - 2306
  • [6] An energy-based modeling and prediction approach for surface roughness in turning
    Nan Xie
    Junfeng Zhou
    Beirong Zheng
    The International Journal of Advanced Manufacturing Technology, 2018, 96 : 2293 - 2306
  • [7] Energy-based Modeling in BioNetGen
    Sekar, John A. P.
    Hogg, Justin S.
    Faeder, James R.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1460 - 1467
  • [8] Novelty-based Spatiotemporal Saliency Detection for Prediction of Gaze in Egocentric Video
    Polatsek, Patrik
    Benesova, Wanda
    Paletta, Lucas
    Perko, Roland
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (03) : 394 - 398
  • [9] Energy-based formation pressure prediction
    Oloruntobi, Olalere
    Butt, Stephen
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2019, 173 : 955 - 964
  • [10] Surprise Minimizing Multi-Agent Learning with Energy-based Models
    Suri, Karush
    Shi, Xiao Qi
    Plataniotis, Konstantinos
    Lawryshyn, Yuri
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,