Multistream Gaze Estimation with Anatomical Eye Region Isolation by Synthetic to Real Transfer Learning

被引:1
|
作者
Mahmud Z. [1 ,3 ]
Hungler P. [3 ]
Etemad A. [1 ,3 ]
机构
[1] s University, Kingston, Ontario
[2] s University, Kingston, Ontario
来源
关键词
deep neural network; domain randomization; Estimation; eye region segmentation; Feature extraction; Gaze estimation; Head; Iris; Lighting; multistream network; Synthetic data; Training; transfer learning;
D O I
10.1109/TAI.2024.3366174
中图分类号
学科分类号
摘要
We propose a novel neural pipeline, MSGazeNet, that learns gaze representations by taking advantage of the eye anatomy information through a multistream framework. Our proposed solution comprises two components, first a network for isolating anatomical eye regions, and a second network for multistream gaze estimation. The eye region isolation is performed with a U-Net style network which we train using a synthetic dataset that contains eye region masks for the visible eyeball and the iris region. The synthetic dataset used in this stage is procured using the UnityEyes simulator, and consists of 80,000 eye images. Successive to training, the eye region isolation network is then transferred to the real domain for generating masks for the real-world eye images. In order to successfully make the transfer, we exploit domain randomization in the training process, which allows for the synthetic images to benefit from a larger variance with the help of augmentations that resemble artifacts. The generated eye region masks along with the raw eye images are then used together as a multistream input to our gaze estimation network, which consists of wide residual blocks. The output embeddings from these encoders are fused in the channel dimension before feeding into the gaze regression layers. We evaluate our framework on three gaze estimation datasets and achieve strong performances. Our method surpasses the state-of-the-art by 7.57% and 1.85% on two datasets, and obtains competitive results on the other. We also study the robustness of our method with respect to the noise in the data and demonstrate that our model is less sensitive to noisy data. Lastly, we perform a variety of experiments including ablation studies to evaluate the contribution of different components and design choices in our solution. IEEE
引用
收藏
页码:1 / 15
页数:14
相关论文
共 50 条
  • [1] Real Time Eye Gaze Estimation
    Anwar, Suzan
    Milanova, Mariofanna
    Svetleff, Zvetomira
    Abdulla, Shereen
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 526 - 531
  • [2] Learning to Find Eye Region Landmarks for Remote Gaze Estimation in Unconstrained Settings
    Park, Seonwook
    Zhang, Xucong
    Bulling, Andreas
    Hilliges, Otmar
    2018 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS (ETRA 2018), 2018,
  • [3] Eye Gaze Region Estimation via Multi-scale Sparse Dictionary Learning
    Yuan, Guoliang
    Wang, Yafei
    Zhao, Tongtong
    Ding, Xueyan
    Mi, Zetian
    Fu, Xianping
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 1459 - 1463
  • [4] Driver Gaze Region Estimation without Use of Eye Movement
    Fridman, Lex
    Langhans, Philipp
    Lee, Joonbum
    Reimer, Bryan
    IEEE INTELLIGENT SYSTEMS, 2016, 31 (03) : 49 - 56
  • [5] EM-Gaze: eye context correlation and metric learning for gaze estimation
    Zhou, Jinchao
    Li, Guoan
    Shi, Feng
    Guo, Xiaoyan
    Wan, Pengfei
    Wang, Miao
    VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2023, 6 (01)
  • [6] EM-Gaze: eye context correlation and metric learning for gaze estimation
    Jinchao Zhou
    Guoan Li
    Feng Shi
    Xiaoyan Guo
    Pengfei Wan
    Miao Wang
    Visual Computing for Industry, Biomedicine, and Art, 6
  • [7] Cascaded learning with transformer for simultaneous eye landmark, eye state and gaze estimation
    Gou, Chao
    Yu, Yuezhao
    Guo, Zipeng
    Xiong, Chen
    Cai, Ming
    PATTERN RECOGNITION, 2024, 156
  • [8] A 3D Morphable Eye Region Model for Gaze Estimation
    Wood, Erroll
    Baltrusaitis, Tadas
    Morency, Louis-Philippe
    Robinson, Peter
    Bulling, Andreas
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 297 - 313
  • [9] Driver's Gaze Zone Estimation by Transfer Learning
    Tayibnapis, Iman Rahmansyah
    Choi, Min-Kook
    Kwon, Soon
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [10] Real-time estimation of eye gaze by in-ear electrodes
    Favre-Felix, A.
    Graversen, C.
    Dau, T.
    Lunner, T.
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 4086 - 4089