Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval

被引:0
|
作者
Hong Zhang
Yan-yun Wang
Hong Pan
Fei Wu
机构
[1] Wuhan University of Science and Technology,College of Computer Science and Technology
[2] Zhejiang University,School of Computer Science and Technology
[3] Hangzhou Normal University,School of Elementary Education
[4] Hangzhou Normal University,School of Information Engineering
关键词
Heterogeneity; Cross-media retrieval; Subspace optimization; Dynamic correlation update; A; TP37; TP391;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-media retrieval is an interesting research topic, which seeks to remove the barriers among different modalities. To enable cross-media retrieval, it is needed to find the correlation measures between heterogeneous low-level features and to judge the semantic similarity. This paper presents a novel approach to learn cross-media correlation between visual features and auditory features for image-audio retrieval. A semi-supervised correlation preserving mapping (SSCPM) method is described to construct the isomorphic SSCPM subspace where canonical correlations between the original visual and auditory features are further preserved. Subspace optimization algorithm is proposed to improve the local image cluster and audio cluster quality in an interactive way. A unique relevance feedback strategy is developed to update the knowledge of cross-media correlation by learning from user behaviors, so retrieval performance is enhanced in a progressive manner. Experimental results show that the performance of our approach is effective.
引用
收藏
页码:241 / 249
页数:8
相关论文
共 50 条
  • [2] Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval
    Zhang, Hong
    Wang, Yan-yun
    Pan, Hong
    Wu, Fei
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2008, 9 (02): : 241 - 249
  • [3] Boosting Cross-media Retrieval via Visual-Auditory Feature Analysis and Relevance Feedback
    Zhang, Hong
    Yuan, Junsong
    Gao, Xingyu
    Chen, Zhenyu
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 953 - 956
  • [4] Nonnegative cross-media recoding of visual-auditory content for social media analysis
    Hong Zhang
    Xin Xu
    Multimedia Tools and Applications, 2015, 74 : 577 - 593
  • [5] Nonnegative cross-media recoding of visual-auditory content for social media analysis
    Zhang, Hong
    Xu, Xin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (02) : 577 - 593
  • [6] Mining semantic correlation of heterogeneous multimedia data for cross-media retrieval
    Zhuang, Yue-Ting
    Yang, Yi
    Wu, Fei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (02) : 221 - 229
  • [7] Bridging the gap between visual and auditory feature spaces for cross-media retrieval
    Hong Zhang
    Fei Wu
    ADVANCES IN MULTIMEDIA MODELING, PT 1, 2007, 4351 : 596 - 605
  • [8] Structural Fusion of Heterogeneous Visual-Auditory Features for Multimedia Analysis
    Zhang, Hong
    Nie, Jiamei
    Chen, Li
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 821 - 825
  • [9] An Approach for Mining Heterogeneous Data for Cross-Media Retrieval
    Pavan, K. Madhu
    Ananthanarayana, V. S.
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [10] CROSS-MODALITY CORRELATION PROPAGATION FOR CROSS-MEDIA RETRIEVAL
    Zhai, Xiaohua
    Peng, Yuxin
    Xiao, Jianguo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2337 - 2340