Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search

被引:60
|
作者
Lu, Xu [1 ]
Zhu, Lei [1 ]
Li, Jingjing [2 ]
Zhang, Huaxiang [1 ]
Shen, Heng Tao [2 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Binary codes; Semantics; Optimization; Training; Quantization (signal); Kernel; Search problems; Hashing; multi-view; multimedia search; CODES;
D O I
10.1109/TMM.2019.2947358
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing has recently received substantial attention in large-scale multimedia search for its extremely low-cost storage cost and high retrieval efficiency. However, most existing hashing techniques focus on learning hash codes for single-view or cross-view retrieval. It is still an unsolved problem that how to efficiently learn discriminative binary codes for multi-view data that is common in real world multimedia search. In this paper, we propose an efficient Supervised Discrete Multi-view Hashing (SDMH) to solve the problem. SDMH first properly detects the shared binary hash codes, with an integrated multi-view feature mapping and latent hash coding, by exploiting the complementarity of different view-specific features and removing the involved inter-view redundancy. To further enhance the discriminative capability of hash codes, SDMH directly represses the explicit semantic labels of data samples with their corresponding binary codes. Different from most existing multi-view hashing methods that adopt "relaxing+rounding" hash optimization strategy or the discrete optimization method based on discrete cyclic coordinate descent, an efficient augmented Lagrangian multiplier (ALM) based discrete hash optimization method is developed in this paper to optimize the hash codes within a single step. Experimental results on four benchmark datasets demonstrate the superior performance of the proposed approach over state-of-the-art hashing techniques, in terms of both learning efficiency and retrieval accuracy.
引用
收藏
页码:2048 / 2060
页数:13
相关论文
共 50 条
  • [31] Contextual Hashing for Large-Scale Image Search
    Liu, Zhen
    Li, Houqiang
    Zhou, Wengang
    Zhao, Ruizhen
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (04) : 1606 - 1614
  • [32] Large-scale image retrieval with supervised sparse hashing
    Xu, Yan
    Shen, Fumin
    Xu, Xing
    Gao, Lianli
    Wang, Yuan
    Tan, Xiao
    NEUROCOMPUTING, 2017, 229 : 45 - 53
  • [33] Efficient large-scale multi-view stereo for ultra high-resolution image sets
    Tola, Engin
    Strecha, Christoph
    Fua, Pascal
    MACHINE VISION AND APPLICATIONS, 2012, 23 (05) : 903 - 920
  • [34] Efficient large-scale multi-view stereo for ultra high-resolution image sets
    Engin Tola
    Christoph Strecha
    Pascal Fua
    Machine Vision and Applications, 2012, 23 : 903 - 920
  • [35] Highly-efficient Incomplete Large-scale Multi-view Clustering with Consensus Bipartite Graph
    Wang, Siwei
    Liu, Xinwang
    Liu, Li
    Tu, Wenxuan
    Zhu, Xinzhong
    Liu, Jiyuan
    Zhou, Sihang
    Zhu, En
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9766 - 9775
  • [36] Efficient Supervised Graph Embedding Hashing for large-scale cross-media retrieval
    Yao, Tao
    Wang, Ruxin
    Wang, Jintao
    Li, Ying
    Yue, Jun
    Yan, Lianshan
    Tian, Qi
    PATTERN RECOGNITION, 2024, 145
  • [37] NSDH: A Nonlinear Supervised Discrete Hashing framework for large-scale cross-modal retrieval
    Yang, Zhan
    Yang, Liu
    Raymond, Osolo Ian
    Zhu, Lei
    Huang, Wenti
    Liao, Zhifang
    Long, Jun
    KNOWLEDGE-BASED SYSTEMS, 2021, 217
  • [38] Learning the consensus and complementary information for large-scale multi-view clustering
    Liu, Maoshan
    Palade, Vasile
    Zheng, Zhonglong
    NEURAL NETWORKS, 2024, 172
  • [39] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks
    Yao, Yao
    Luo, Zixin
    Li, Shiwei
    Zhang, Jingyang
    Ren, Yufan
    Zhou, Lei
    Fang, Tian
    Quan, Long
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1787 - 1796
  • [40] Triplets-based large-scale multi-view spectral clustering
    Yang, Tianchuan
    Wang, Chang-Dong
    Guo, Jipeng
    Li, Xiangcheng
    Chen, Man-Sheng
    Dang, Shuping
    Chen, Haiqiang
    INFORMATION FUSION, 2025, 121