SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting

被引：0

作者：

Zgaren, Ahmed ^{[1
,2
]}

Bouachir, Wassim ^{[2
]}

Bouguila, Nizar ^{[1
]}

机构：

[1] Concordia Univ, Concordia Inst Informat & Syst Engn CIISE, Montreal, PQ H3G 1M8, Canada

[2] Univ Quebec TELUQ, Data Sci Lab, Montreal, PQ H2S 3L5, Canada

来源：

JOURNAL OF IMAGING | 2025年 / 11卷 / 02期

基金：

加拿大自然科学与工程研究理事会;

关键词：

object counting; transformers; visual attention; zero-shot; class-agnostic;

D O I：

10.3390/jimaging11020052

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Zero-shot counting is a subcategory of Generic Visual Object Counting, which aims to count objects from an arbitrary class in a given image. While few-shot counting relies on delivering exemplars to the model to count similar class objects, zero-shot counting automates the operation for faster processing. This paper proposes a fully automated zero-shot method outperforming both zero-shot and few-shot methods. By exploiting feature maps from a pre-trained detection-based backbone, we introduce a new Visual Embedding Module designed to generate semantic embeddings within object contextual information. These embeddings are then fed to a Self-Attention Matching Module to generate an encoded representation for the head counter. Our proposed method has outperformed recent zero-shot approaches, achieving the best Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) results of 8.89 and 35.83, respectively, on the FSC147 dataset. Additionally, our method demonstrates competitive performance compared to few-shot methods, advancing the capabilities of visual object counting in various industrial applications such as tree counting, wildlife animal counting, and medical applications like blood cell counting.

引用

页数：21

共 50 条

[31] ZERO-SHOT OBJECT DETECTION WITH TRANSFORMERS
Zheng, Ye
Cui, Li
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 444 - 448
[32] A Survey of Zero-Shot Object Detection
Cao, Weipeng
Yao, Xuyang
Xu, Zhiwu
Liu, Ye
Pan, Yinghui
Ming, Zhong
BIG DATA MINING AND ANALYTICS, 2025, 8 (03): : 726 - 750
[33] Zero-Shot Camouflaged Object Detection
Li, Haoran
Feng, Chun-Mei
Xu, Yong
Zhou, Tao
Yao, Lina
Chang, Xiaojun
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5126 - 5137
[34] Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval
Ueki, Kazuya
20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 628 - 634
[35] Learning visual-and-semantic knowledge embedding for zero-shot image classification
Dehui Kong
Xiliang Li
Shaofan Wang
Jinghua Li
Baocai Yin
Applied Intelligence, 2023, 53 : 2250 - 2264
[36] Learning visual-and-semantic knowledge embedding for zero-shot image classification
Kong, Dehui
Li, Xiliang
Wang, Shaofan
Li, Jinghua
Yin, Baocai
APPLIED INTELLIGENCE, 2023, 53 (02) : 2250 - 2264
[37] Contrastive Embedding for Generalized Zero-Shot Learning
Han, Zongyan
Fu, Zhenyong
Chen, Shuo
Yang, Jian
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
[38] Spatiotemporal visual-semantic embedding network for zero-shot action recognition
An, Rongqiao
Miao, Zhenjiang
Li, Qingyu
Xu, Wanru
Zhang, Qiang
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (02)
[39] Transductive Unbiased Embedding for Zero-Shot Learning
Song, Jie
Shen, Chengchao
Yang, Yezhou
Liu, Yang
Song, Mingli
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1024 - 1033
[40] TDANet: Target-Directed Attention Network for Object-Goal Visual Navigation With Zero-Shot Ability
Lian, Shiwei
Zhang, Feitian
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 8075 - 8082

← 1 2 3 4 5 →