Unsupervised Gaze Prediction in Egocentric Videos by Energy-based Surprise Modeling

被引:1
|
作者
Aakur, Sathyanarayanan N. [1 ]
Bagavathi, Arunkumar [1 ]
机构
[1] Oklahoma State Univ, Dept Comp Sci, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
Unsupervised Gaze Prediction; Egocentric Vision; Temporal Event Segmentation; Pattern Theory;
D O I
10.5220/0010288009350942
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Egocentric perception has grown rapidly with the advent of immersive computing devices. Human gaze prediction is an important problem in analyzing egocentric videos and has primarily been tackled through either saliency-based modeling or highly supervised learning. We quantitatively analyze the generalization capabilities of supervised, deep learning models on the egocentric gaze prediction task on unseen, out-of-domain data. We find that their performance is highly dependent on the training data and is restricted to the domains specified in the training annotations. In this work, we tackle the problem of jointly predicting human gaze points and temporal segmentation of egocentric videos without using any training data. We introduce an unsupervised computational model that draws inspiration from cognitive psychology models of event perception. We use Grenander's pattern theory formalism to represent spatial-temporal features and model surprise as a mechanism to predict gaze fixation points. Extensive evaluation on two publicly available datasets - GTEA and GTEA+ datasets-shows that the proposed model can significantly outperform all unsupervised baselines and some supervised gaze prediction baselines. Finally, we show that the model can also temporally segment egocentric videos with a performance comparable to more complex, fully supervised deep learning baselines.
引用
收藏
页码:935 / 942
页数:8
相关论文
共 50 条
  • [41] ENERGY-BASED METHODOLOGY FOR THE FATIGUE LIFE PREDICTION OF SOLDER MATERIALS
    VAYNMAN, S
    MCKEOWN, SA
    IEEE TRANSACTIONS ON COMPONENTS HYBRIDS AND MANUFACTURING TECHNOLOGY, 1993, 16 (03): : 317 - 322
  • [42] Analysis of energy-based algorithms for RNA secondary structure prediction
    Monir Hajiaghayi
    Anne Condon
    Holger H Hoos
    BMC Bioinformatics, 13
  • [43] A promising new energy-based fatigue life prediction framework
    Scott-Emuakpor, Onome
    Shen, M. -H. Herman
    Cross, Charles
    Calcaterra, Jeffrey
    George, Tommy
    Proceedings of the ASME Turbo Expo 2005, Vol 4, 2005, : 397 - 404
  • [44] A NEW ENERGY-BASED MULTIAXIAL FATIGUE LIFE PREDICTION PROCEDURE
    Tarar, Wasim
    Scott-Emuakpor, Onome
    Shen, M. -H. Herman
    George, Tommy
    Cross, Charles
    PROCEEDINGS OF THE ASME TURBO EXPO 2008, VOL 5, PT A, 2008, : 215 - 223
  • [45] Energy-based approach for fatigue life prediction of pure copper
    Wang, X. G.
    Crupi, V.
    Jiang, C.
    Feng, E. S.
    Guglielmino, E.
    Wang, C. S.
    INTERNATIONAL JOURNAL OF FATIGUE, 2017, 104 : 243 - 250
  • [46] Homology as a tool in energy-based protein structure prediction.
    Keasar, C
    Skolnick, J
    Elber, R
    PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 1996, 65 : PA202 - PA202
  • [47] Analysis of energy-based algorithms for RNA secondary structure prediction
    Hajiaghayi, Monir
    Condon, Anne
    Hoos, Holger H.
    BMC BIOINFORMATICS, 2012, 13
  • [48] Energy-based performance prediction for metals in powder bed fusion
    Li, Zhi-Jian
    Dai, Hong-Liang
    Yao, Yuan
    Liu, Jing-Ling
    INTERNATIONAL JOURNAL OF MECHANICAL SCIENCES, 2024, 265
  • [49] A Hybrid Approach for Performance and Energy-Based Cost Prediction in Clouds
    Aldossary, Mohammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (03): : 3531 - 3562
  • [50] Unsupervised Energy-based Adversarial Domain Adaptation for Cross-domain Text Classification
    Zou, Han
    Yang, Jianfei
    Wu, Xiaojian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1208 - 1218