Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

被引:107
|
作者
Li, Jia [2 ,3 ]
Tian, Yonghong [1 ]
Huang, Tiejun [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
关键词
Visual saliency; Probabilistic framework; Visual search tasks; Multi-task learning; ATTENTION; MODEL;
D O I
10.1007/s11263-010-0354-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a probabilistic multi-task learning approach for visual saliency estimation in video. In our approach, the problem of visual saliency estimation is modeled by simultaneously considering the stimulus-driven and task-related factors in a probabilistic framework. In this framework, a stimulus-driven component simulates the low-level processes in human vision system using multi-scale wavelet decomposition and unbiased feature competition; while a task-related component simulates the high-level processes to bias the competition of the input features. Different from existing approaches, we propose a multi-task learning algorithm to learn the task-related "stimulus-saliency" mapping functions for each scene. The algorithm also learns various fusion strategies, which are used to integrate the stimulus-driven and task-related components to obtain the visual saliency. Extensive experiments were carried out on two public eye-fixation datasets and one regional saliency dataset. Experimental results show that our approach outperforms eight state-of-the-art approaches remarkably.
引用
收藏
页码:150 / 165
页数:16
相关论文
共 50 条
  • [31] Deep multi-task learning for image/video distortions identification
    Ameur, Zoubida
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    Neural Computing and Applications, 2022, 34 (24) : 21607 - 21623
  • [32] Multi-label Annotation for Visual Multi-Task Learning Models
    Sharma, Gaurang
    Angleraud, Alexandre
    Pieters, Roel
    2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023, 2023, : 31 - 34
  • [33] ONLINE MULTI-TASK LEARNING FOR SEMANTIC CONCEPT DETECTION IN VIDEO
    Markatopoulou, Foteini
    Mezaris, Vasileios
    Patras, Ioannis
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 186 - 190
  • [34] Deep multi-task learning for image/video distortions identification
    Ameur, Zoubida
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21607 - 21623
  • [35] MULTI-TASK LEARNING OF GENERALIZABLE REPRESENTATIONS FOR VIDEO ACTION RECOGNITION
    Yao, Zhiyu
    Wang, Yunbo
    Long, Mingsheng
    Wang, Jianmin
    Yu, Philip S.
    Sun, Jiaguang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [36] Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning
    Nguyen, A. Tuan
    Jeong, Hyewon
    Yang, Eunho
    Hwang, Sung Ju
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9081 - 9091
  • [37] Task-conditioned adaptation of visual features in multi-task policy learning
    Marza, Pierre
    Matignon, Laetitia
    Simonin, Olivier
    Wolf, Christian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17847 - 17856
  • [38] Robust Visual Tracking via Multi-Task Sparse Learning
    Zhang, Tianzhu
    Ghanem, Bernard
    Liu, Si
    Ahuja, Narendra
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2042 - 2049
  • [39] Curriculum learning of visual attribute clusters for multi-task classification
    Sarafianos, Nikolaos
    Giannakopoulos, Theodoros
    Nikou, Christophoros
    Kakadiaris, Ioannis A.
    PATTERN RECOGNITION, 2018, 80 : 94 - 108
  • [40] Learning Multi-Task Correlation Particle Filters for Visual Tracking
    Zhang, Tianzhu
    Xu, Changsheng
    Yang, Ming-Hsuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) : 365 - 378