Incorporating Audio Signals into Constructing a Visual Saliency Map

被引:0
|
作者
Nakajima, Jiro [1 ]
Sugimoto, Akihiro [2 ]
Kawamoto, Kazuhiko [1 ]
机构
[1] Chiba Univ, Chiba, Japan
[2] Natl Inst Informat, Tokyo, Japan
来源
关键词
gaze; visual attention; visual saliency; auditory saliency; audio signal; video; sound source feature; AUDITORY ATTENTION; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The saliency map has been proposed to identify regions that draw human visual attention. Differences of features from the surroundings are hierarchially computed for an image or an image sequence in multiple resolutions and they are fused in a fully bottom-up manner to obtain a saliency map. A video usually contains sounds, and not only visual stimuli but also auditory stimuli attract human attention. Nevertheless, most conventional methods discard auditory information and image information alone is used in computing a saliency map. This paper presents a method for constructing a visual saliency map by integrating image features with auditory features. We assume a single moving sound source in a video and introduce a sound source feature. Our method detects the sound source feature using the correlation between audio signals and sound source motion, and computes its importance in each frame in a video using an auditory saliency map. The importance is used to fuse the sound source feature with image features to construct a visual saliency map. Experiments using subjects demonstrate that a saliency map by our proposed method reflects human's visual attention more accurately than that by a conventional method.
引用
收藏
页码:468 / 480
页数:13
相关论文
共 50 条
  • [31] Blotch Detection in Archive Films Based on Visual Saliency Map
    Aydin, Yildiz
    Dizdaroglu, Bekir
    COMPLEXITY, 2020, 2020
  • [32] Fast Visual Saliency Map Extraction from Digital Video
    Xu, Shilin
    Lin, Weisi
    Kuo, C-C Jay
    2009 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2009, : 337 - +
  • [33] Fabric Defect Detection Based on Visual Saliency Map and SVM
    Zhang, Hao
    Hu, Jiajuan
    He, Zhiyong
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 322 - 326
  • [34] Analysis and Realization of Saliency Map Based on Visual Attention Mechanism
    Hu, Yaqi
    Meng, Fang
    2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 415 - 419
  • [35] Synergetic object recognition based on visual attention saliency map
    Shao, Jing
    Gao, Jun
    Yang, Jing
    2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 660 - 665
  • [36] Visual contrast based saliency map generation and object detection
    Li, Deren
    Hu, Xiaoguang
    Zhu, Xinyan
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2012, 37 (04): : 379 - 383
  • [37] An Improved Model of Producing Saliency Map for Visual Attention System
    Huang, Jingang
    Kong, Bin
    Cheng, Erkang
    Zheng, Fei
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2008, 15 : 423 - 431
  • [38] MSR: a Simple and Effective Metric for Visual Saliency Map Fusion
    Jiang, Qingzhu
    Wu, Zemin
    Tian, Chang
    Liu, Tao
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 432 - 435
  • [39] Saliency map estimation by constructing graphs of possible eye-tracking paths
    Szalai, Szilard
    Sziranyi, Tamas
    Vidnyanszky, Zoltan
    3RD IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM 2012), 2012, : 114 - 119
  • [40] Superior detection of audio-visual signals over visual signals: Are overt movements necessary?
    Doyle, M. C.
    Snowden, R. J.
    PERCEPTION, 1997, 26 : 103 - 104