SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting

被引：0

作者：

Zgaren, Ahmed ^{[1
,2
]}

Bouachir, Wassim ^{[2
]}

Bouguila, Nizar ^{[1
]}

机构：

[1] Concordia Univ, Concordia Inst Informat & Syst Engn CIISE, Montreal, PQ H3G 1M8, Canada

[2] Univ Quebec TELUQ, Data Sci Lab, Montreal, PQ H2S 3L5, Canada

来源：

JOURNAL OF IMAGING | 2025年 / 11卷 / 02期

基金：

加拿大自然科学与工程研究理事会;

关键词：

object counting; transformers; visual attention; zero-shot; class-agnostic;

D O I：

10.3390/jimaging11020052

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Zero-shot counting is a subcategory of Generic Visual Object Counting, which aims to count objects from an arbitrary class in a given image. While few-shot counting relies on delivering exemplars to the model to count similar class objects, zero-shot counting automates the operation for faster processing. This paper proposes a fully automated zero-shot method outperforming both zero-shot and few-shot methods. By exploiting feature maps from a pre-trained detection-based backbone, we introduce a new Visual Embedding Module designed to generate semantic embeddings within object contextual information. These embeddings are then fed to a Self-Attention Matching Module to generate an encoded representation for the head counter. Our proposed method has outperformed recent zero-shot approaches, achieving the best Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) results of 8.89 and 35.83, respectively, on the FSC147 dataset. Additionally, our method demonstrates competitive performance compared to few-shot methods, advancing the capabilities of visual object counting in various industrial applications such as tree counting, wildlife animal counting, and medical applications like blood cell counting.

引用

页数：21

共 50 条

[21] Manifold embedding for zero-shot recognition
Ji, Zhong
Yu, Xuejie
Yu, Yunlong
He, Yuqing
COGNITIVE SYSTEMS RESEARCH, 2019, 55 : 34 - 43
[22] Visual-guided attentive attributes embedding for zero-shot learning
Zhang, Rui
Zhu, Qi
Xu, Xiangyu
Zhang, Daoqiang
Huang, Sheng-Jun
NEURAL NETWORKS, 2021, 143 : 709 - 718
[23] Zero-Shot Visual Imitation
Pathak, Deepak
Mahmoudieh, Parsa
Luo, Guanghao
Agrawal, Pulkit
Chen, Dian
Shentu, Fred
Shelhamer, Evan
Malik, Jitendra
Efros, Alexei A.
Darrell, Trevor
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2131 - 2134
[24] Zero-shot Object Detection Through Vision-Language Embedding Alignment
Xie, Johnathan
Zheng, Shuai
2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 926 - 940
[25] Zero-Shot Object Counting With Vision-Language Prior Guidance Network
Zhai, Wenzhe
Xing, Xianglei
Gao, Mingliang
Li, Qilei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2487 - 2498
[26] Enhancing Zero-Shot Many to Many Voice Conversion via Self-Attention VAE with Structurally Regularized Layers
Long, Ziang
Zheng, Yunling
Yu, Meng
Xin, Jack
2022 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES, AI4I, 2022, : 59 - 63
[27] Self-Attention Generative Distribution Adversarial Network for Few- and Zero-Shot Face Anti-Spoofing
Son Minh Nguyen
Linh Duy Tran
Le, Duc Viet
Masayuki, Arai
2022 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB), 2022,
[28] Attribute-Based Classification for Zero-Shot Visual Object Categorization
Lampert, Christoph H.
Nickisch, Hannes
Harmeling, Stefan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 453 - 465
[29] Improved Visual-Semantic Alignment for Zero-Shot Object Detection
Rahman, Shafin
Khan, Salman
Barnes, Nick
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11932 - 11939
[30] Semantic Policy Network for Zero-Shot Object Goal Visual Navigation
Zhao, Qianfan
Zhang, Lu
He, Bin
Liu, Zhiyong
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7655 - 7662

← 1 2 3 4 5 →