Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

被引:107
|
作者
Li, Jia [2 ,3 ]
Tian, Yonghong [1 ]
Huang, Tiejun [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
关键词
Visual saliency; Probabilistic framework; Visual search tasks; Multi-task learning; ATTENTION; MODEL;
D O I
10.1007/s11263-010-0354-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a probabilistic multi-task learning approach for visual saliency estimation in video. In our approach, the problem of visual saliency estimation is modeled by simultaneously considering the stimulus-driven and task-related factors in a probabilistic framework. In this framework, a stimulus-driven component simulates the low-level processes in human vision system using multi-scale wavelet decomposition and unbiased feature competition; while a task-related component simulates the high-level processes to bias the competition of the input features. Different from existing approaches, we propose a multi-task learning algorithm to learn the task-related "stimulus-saliency" mapping functions for each scene. The algorithm also learns various fusion strategies, which are used to integrate the stimulus-driven and task-related components to obtain the visual saliency. Extensive experiments were carried out on two public eye-fixation datasets and one regional saliency dataset. Experimental results show that our approach outperforms eight state-of-the-art approaches remarkably.
引用
收藏
页码:150 / 165
页数:16
相关论文
共 50 条
  • [41] Multi-task learning とmulti-stream monocular depth estimation using integrated model with multi-task learning and multi-stream
    Takamine, Michiru
    Endo, Satoshi
    Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (05): : 1 - 9
  • [42] A multi-task learning framework for gas detection and concentration estimation
    Liu, Huixiang
    Li, Qing
    Gu, Yu
    NEUROCOMPUTING, 2020, 416 : 28 - 37
  • [43] Enhancing Direction-of-Arrival Estimation with Multi-Task Learning
    Bianco, Simone
    Celona, Luigi
    Crotti, Paolo
    Napoletano, Paolo
    Petraglia, Giovanni
    Vinetti, Pietro
    SENSORS, 2024, 24 (22)
  • [44] MTLM: a multi-task learning model for travel time estimation
    Xu, Saijun
    Zhang, Ruoqian
    Cheng, Wanjun
    Xu, Jiajie
    GEOINFORMATICA, 2022, 26 (02) : 379 - 395
  • [45] MTLM: a multi-task learning model for travel time estimation
    Saijun Xu
    Ruoqian Zhang
    Wanjun Cheng
    Jiajie Xu
    GeoInformatica, 2022, 26 : 379 - 395
  • [46] Simultaneous Estimation of Dish Locations and Calories with Multi-Task Learning
    Ege, Takumi
    Yanai, Keiji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (07) : 1240 - 1246
  • [47] A Multi-task Learning Method for Direct Estimation of Spinal Curvature
    Wang, Jiacheng
    Wang, Liansheng
    Liu, Changhua
    COMPUTATIONAL METHODS AND CLINICAL APPLICATIONS FOR SPINE IMAGING, CSI 2019, 2020, 11963 : 113 - 118
  • [48] Semi-Supervised Depth Estimation by Multi-Task Learning
    Fu, Qingshun
    Dong, Xuan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3765 - 3771
  • [49] Group LASSO with Asymmetric Structure Estimation for Multi-Task Learning
    Oliveira, Saullo H. G.
    Goncalves, Andre R.
    Von Zuben, Fernando J.
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3202 - 3208
  • [50] Visual Person Understanding Through Multi-task and Multi-dataset Learning
    Pfeiffer, Kilian
    Hermans, Alexander
    Sarandi, Istvan
    Weber, Mark
    Leibe, Bastian
    PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 : 551 - 566