Coarse-to-Fine: Learning Compact Discriminative Representation for Single-Stage Image Retrieval

被引：3

作者：

Zhu, Yunquan ^{[1
]}

Gao, Xinkai ^{[1
]}

Ke, Bo ^{[1
]}

Qiao, Ruizhi ^{[1
]}

Sun, Xing ^{[1
]}

机构：

[1] Tencent, YouTu Lab, Shenzhen, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

DESCRIPTORS; MODEL;

D O I：

10.1109/ICCV51070.2023.01034

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image retrieval targets to find images from a database that are visually similar to the query image. Two-stage methods following retrieve-and-rerank paradigm have achieved excellent performance, but their separate local and global modules are inefficient to real-world applications. To better trade-off retrieval efficiency and accuracy, some approaches fuse global and local feature into a joint representation to perform single-stage image retrieval. However, they are still challenging due to various situations to tackle, e.g., background, occlusion and viewpoint. In this work, we design a Coarse-to-Fine framework to learn Compact Discriminative representation (CFCD) for end-to-end single- stage image retrieval-requiring only imagelevel labels. Specifically, we first design a novel adaptive softmax-based loss which dynamically tunes its scale and margin within each mini-batch and increases them progressively to strengthen supervision during training and intraclass compactness. Furthermore, we propose a mechanism which attentively selects prominent local descriptors and infuse fine-grained semantic relations into the global representation by a hard negative sampling strategy to optimize inter-class distinctiveness at a global scale. Extensive experimental results have demonstrated the effectiveness of our method, which achieves state-of-the-art single-stage image retrieval performance on benchmarks such as Revisited Oxford and Revisited Paris. Code is available at https://github.com/bassyess/CFCD.

引用

页码：11226 / 11235

页数：10

共 50 条

[21] Coarse-to-Fine Image DeHashing Using Deep Pyramidal Residual Learning
Wang, Yongwei
Ward, Rabab
Wang, Z. Jane
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (09) : 1295 - 1299
[22] Context-aware coarse-to-fine network for single image desnowing
Cheng, Yunrui
Ren, Hao
Zhang, Rui
Lu, Hong
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 55903 - 55920
[23] Zero-shot visual grounding via coarse-to-fine representation learning
Mi, Jinpeng
Jin, Shaofei
Chen, Zhiqian
Liu, Dan
Wei, Xian
Zhang, Jianwei
NEUROCOMPUTING, 2024, 610
[24] Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search
Huang, Wenxin
Jia, Xuemei
Zhong, Xian
Wang, Xiao
Jiang, Kui
Wang, Zheng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
[25] Two-stage coarse-to-fine image anomaly segmentation and detection model
Shah, Rizwan Ali
Urmonov, Odilbek
Kim, Hyungwon
IMAGE AND VISION COMPUTING, 2023, 139
[26] A Coarse-to-fine Method for Near-duplicate Image Retrieval with Matching Probability Model
Shi, Jingshu
Ma, Yong
Zhou, Huabing
2017 INTERNATIONAL CONFERENCE ON GREEN INFORMATICS (ICGI), 2017, : 185 - 190
[27] Coarse-to-Fine Imitation Learning: Robot Manipulation from a Single Demonstration
Johns, Edward
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4613 - 4619
[28] Coarse-to-Fine Document Image Registration for Dewarping
Zhang, Weiguang
Wang, Qiufeng
Huang, Kaizhu
Gu, Xiaomeng
Guo, Fengjun
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT IV, 2024, 14807 : 343 - 358
[29] Coarse-to-Fine Implicit Representation Learning for 3D Hand-Object Reconstruction from a Single RGB-D Image
Liu, Xingyu
Ren, Pengfei
Wang, Jingyu
Qi, Qi
Sun, Haifeng
Zhuang, Zirui
Liao, Jianxin
COMPUTER VISION - ECCV 2024, PT LI, 2025, 15109 : 74 - 92
[30] Coarse-to-fine Geometric and Photometric Image Registration
Xu, Jieping
Liu, Jin
Huang, Zongfu
Liang, Yonghui
AOPC 2017: OPTICAL SENSING AND IMAGING TECHNOLOGY AND APPLICATIONS, 2017, 10462

← 1 2 3 4 5 →