Residual-Network-Based Supervised Gaze Prediction for First-Person Videos

被引:1
|
作者
Li, Yujie [1 ]
Ding, Shuxue [2 ]
Li, Xiang [3 ]
Tan, Benying [1 ,3 ]
Kanemura, Atsunori [1 ,4 ,5 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki 3058560, Japan
[2] Guilin Univ Elect Technol, Sch Artificial Intelligence, Guilin 541004, Peoples R China
[3] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, Fukushima 9650005, Japan
[4] LeapMind Inc, Tokyo 1500044, Japan
[5] Adv Telecommun Res Inst Int, Kyoto 6190288, Japan
基金
日本学术振兴会;
关键词
Gaze prediction; first-person vision (FPV); saliency detection; convolution neural network (CNN); residual network; SALIENCY;
D O I
10.1109/ACCESS.2019.2913791
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaze prediction is a significant problem in efficiently processing and understanding a large number of incoming visual signals from first-person views (i.e., egocentric vision). Because many visual processes are expensive and human beings do not process the whole visual field, thus knowing the gaze position is an efficient way to understand the salient content of a video and what users pay attention to. However, current methods for gaze prediction are bottom-up methods and cannot incorporate information about user actions. We proposed a supervised gaze prediction framework based on a residual network, which takes the gaze of user action into consideration. Our model uses the features extracted from the VGG-16 deep neural network to predict the gaze position in FPV videos. The deep residual networks are introduced to combine with this model for learning the residual maps. Our proposed method attempts to obtain gaze prediction results with high accuracy. According to the experimental results, the performance of our proposed gaze prediction method is competitive with that of the state-of-the-art approaches.
引用
收藏
页码:56208 / 56216
页数:9
相关论文
共 50 条
  • [31] First-Person Animal Activity Recognition from Egocentric Videos
    Iwashita, Yumi
    Takamine, Asamichi
    Kurazume, Ryo
    Ryoo, M. S.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4310 - 4315
  • [32] Ranking Based Boosted Multiple Kernel Learning For Activity Recognition on First-Person Videos
    Ozkan, Fatih
    Surer, Elif
    Temizel, Alptekin
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [33] Collective First-Person Vision for Automatic Gaze Analysis in Multiparty Conversations
    Kumano, Shiro
    Otsuka, Kazuhiro
    Ishii, Ryo
    Yamato, Junji
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (01) : 107 - 122
  • [34] AR Tips: Augmented First-Person View Task Instruction Videos
    Lee, Gun A.
    Ahn, Seungjun
    Hoff, William
    Billinghurst, Mark
    ADJUNCT PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT 2019), 2019, : 34 - 36
  • [35] Browsing Group First-Person Videos with 3D Visualization
    Sugita, Yuki
    Higuchi, Keita
    Yonetani, Ryo
    Kamikubo, Rie
    Sato, Yoichi
    PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE SURFACES AND SPACES (ISS'18), 2018, : 55 - 60
  • [36] MUTUAL REFERENCE FRAME-QUALITY ASSESSMENT FOR FIRST-PERSON VIDEOS
    Bai, Chen
    Reibman, Amy R.
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 290 - 294
  • [37] Summarizing First-Person Videos from Third Persons' Points of Views
    Ho, Hsuan-, I
    Chiu, Wei-Chen
    Wang, Yu-Chiang Frank
    COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 72 - 89
  • [38] EgoScanning: Quickly Scanning First-Person Videos with Egocentric Elastic Timelines
    Higuchi, Keita
    Yonetani, Ryo
    Sato, Yoichi
    SA'17: SIGGRAPH ASIA 2017 EMERGING TECHNOLOGIES, 2017,
  • [39] EgoScanning: Quickly Scanning First-Person Videos with Egocentric Elastic Timelines
    Higuch, Keita
    Yonetani, Ryo
    Sato, Yoichi
    PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17), 2017, : 6536 - 6546
  • [40] Neural Network Based Scope Positioning System for First-Person Shooters
    Izmailov, Ruslan S.
    Voronov, Igor A.
    PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 429 - 433