Bayes pooling of visual phrases for object retrieval

被引:2
|
作者
Jiang, Wenhui [1 ]
Zhao, Zhicheng [1 ]
Su, Fei [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
关键词
Visual phrases; Unified framework; Bayes pooling; Burstiness; SIMILARITY;
D O I
10.1007/s11042-015-2939-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object retrieval is still an open question. A promising approach is based on the matching of visual phrases. However, this routine is often corrupted by visual phrase burstiness, i.e., the repetitive occurrence of some certain visual phrases. Burstiness leads to over-counting the co-occurring visual patterns between two images, thus would deteriorate the accuracy of image similarity measurement. On the other hand, existing methods are incapable of capturing the complete geometric variation between images. In this paper, we propose a novel strategy to address the two problems. Firstly, we propose a unified framework for matching geometry-constrained visual phrases. This framework provides a possibility of combing the optimal geometry constraints to improve the validity of matched visual phrases. Secondly, we propose to address the problem of visual phrase burstiness from a probabilistic view. This approach effectively filters out the bursty visual phrases through explicitly modelling their distribution. Experiments on five benchmark datasets demonstrate that our method outperforms other approaches consistently and significantly.
引用
收藏
页码:9095 / 9119
页数:25
相关论文
共 50 条
  • [31] Approximate object location deep visual representations for image retrieval
    Liao, Kaiyang
    Huang, Gang
    Zheng, Yuanlin
    Lin, Guangfeng
    Cao, Congjun
    DISPLAYS, 2023, 77
  • [32] Query-adaptive asymmetrical dissimilarities for visual object retrieval
    Zhu, Cai-Zhi
    Jegou, Herve
    Satoh, Shin'ichi
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1705 - 1712
  • [33] Object-based image retrieval beyond visual appearances
    Zheng, Yan-Tao
    Neo, Shi-Yong
    Chua, Tat-Seng
    Tian, Qi
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2008, 4903 : 13 - +
  • [34] Fine-grained visual classification via multilayer bilinear pooling with object localization
    Ming Li
    Lin Lei
    Hao Sun
    Xiao Li
    Gangyao Kuang
    The Visual Computer, 2022, 38 : 811 - 820
  • [35] Fine-grained visual classification via multilayer bilinear pooling with object localization
    Li, Ming
    Lei, Lin
    Sun, Hao
    Li, Xiao
    Kuang, Gangyao
    VISUAL COMPUTER, 2022, 38 (03): : 811 - 820
  • [36] Mining Visual Phrases for Visual Robot Localization
    Tanaka, Kanji
    Chokushi, Yuuto
    Ando, Masatoshi
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (01) : 57 - 65
  • [37] Visual object retrieval via block-based visual-pattern matching
    Cheng, Shyi-Chyi
    Kuo, Chen-Tsung
    Chen, Hong-Jay
    PATTERN RECOGNITION, 2007, 40 (06) : 1695 - 1710
  • [38] Recognition Using Visual Phrases
    Sadeghi, Mohammad Amin
    Farhadi, Ali
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1745 - 1752
  • [39] Online Multiple Object Tracking Using Spatial Pyramid Pooling Hashing and Image Retrieval for Autonomous Driving
    Wei, Hongjian
    Huang, Yingping
    MACHINES, 2022, 10 (08)
  • [40] Visual-based and object-conscious image retrieval by block reallocation into object region
    Mochizuki, Takahiro
    Kawai, Yoshihiko
    Sano, Masanori
    Sumiyoshi, Hideki
    Fujii, Mahito
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2016, 11 : S44 - S52