Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval

被引:3
|
作者
Zhu, Yunquan [1 ]
Gao, Xinkai [1 ]
Ke, Bo [1 ]
Qiao, Ruizhi [1 ]
Sun, Xing [1 ]
机构
[1] Tencent, YouTu Lab, Shenzhen, Peoples R China
关键词
DESCRIPTORS; MODEL;
D O I
10.1109/ICCV51070.2023.01034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image retrieval targets to find images from a database that are visually similar to the query image. Two-stage methods following retrieve-and-rerank paradigm have achieved excellent performance, but their separate local and global modules are inefficient to real-world applications. To better trade-off retrieval efficiency and accuracy, some approaches fuse global and local feature into a joint representation to perform single-stage image retrieval. However, they are still challenging due to various situations to tackle, e.g., background, occlusion and viewpoint. In this work, we design a Coarse-to-Fine framework to learn Compact Discriminative representation (CFCD) for end-to-end single- stage image retrieval-requiring only imagelevel labels. Specifically, we first design a novel adaptive softmax-based loss which dynamically tunes its scale and margin within each mini-batch and increases them progressively to strengthen supervision during training and intraclass compactness. Furthermore, we propose a mechanism which attentively selects prominent local descriptors and infuse fine-grained semantic relations into the global representation by a hard negative sampling strategy to optimize inter-class distinctiveness at a global scale. Extensive experimental results have demonstrated the effectiveness of our method, which achieves state-of-the-art single-stage image retrieval performance on benchmarks such as Revisited Oxford and Revisited Paris. Code is available at https://github.com/bassyess/CFCD.
引用
收藏
页码:11226 / 11235
页数:10
相关论文
共 50 条
  • [21] Coarse-to-Fine Image DeHashing Using Deep Pyramidal Residual Learning
    Wang, Yongwei
    Ward, Rabab
    Wang, Z. Jane
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (09) : 1295 - 1299
  • [22] Context-aware coarse-to-fine network for single image desnowing
    Cheng, Yunrui
    Ren, Hao
    Zhang, Rui
    Lu, Hong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55903 - 55920
  • [23] Zero-shot visual grounding via coarse-to-fine representation learning
    Mi, Jinpeng
    Jin, Shaofei
    Chen, Zhiqian
    Liu, Dan
    Wei, Xian
    Zhang, Jianwei
    NEUROCOMPUTING, 2024, 610
  • [24] Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search
    Huang, Wenxin
    Jia, Xuemei
    Zhong, Xian
    Wang, Xiao
    Jiang, Kui
    Wang, Zheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [25] Two-stage coarse-to-fine image anomaly segmentation and detection model
    Shah, Rizwan Ali
    Urmonov, Odilbek
    Kim, Hyungwon
    IMAGE AND VISION COMPUTING, 2023, 139
  • [26] A Coarse-to-fine Method for Near-duplicate Image Retrieval with Matching Probability Model
    Shi, Jingshu
    Ma, Yong
    Zhou, Huabing
    2017 INTERNATIONAL CONFERENCE ON GREEN INFORMATICS (ICGI), 2017, : 185 - 190
  • [27] Coarse-to-Fine Imitation Learning: Robot Manipulation from a Single Demonstration
    Johns, Edward
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4613 - 4619
  • [28] Coarse-to-Fine Document Image Registration for Dewarping
    Zhang, Weiguang
    Wang, Qiufeng
    Huang, Kaizhu
    Gu, Xiaomeng
    Guo, Fengjun
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT IV, 2024, 14807 : 343 - 358
  • [29] Coarse-to-Fine Implicit Representation Learning for 3D Hand-Object Reconstruction from a Single RGB-D Image
    Liu, Xingyu
    Ren, Pengfei
    Wang, Jingyu
    Qi, Qi
    Sun, Haifeng
    Zhuang, Zirui
    Liao, Jianxin
    COMPUTER VISION - ECCV 2024, PT LI, 2025, 15109 : 74 - 92
  • [30] Coarse-to-fine Geometric and Photometric Image Registration
    Xu, Jieping
    Liu, Jin
    Huang, Zongfu
    Liang, Yonghui
    AOPC 2017: OPTICAL SENSING AND IMAGING TECHNOLOGY AND APPLICATIONS, 2017, 10462