SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting

被引：0

作者：

Zgaren, Ahmed ^{[1
,2
]}

Bouachir, Wassim ^{[2
]}

Bouguila, Nizar ^{[1
]}

机构：

[1] Concordia Univ, Concordia Inst Informat & Syst Engn CIISE, Montreal, PQ H3G 1M8, Canada

[2] Univ Quebec TELUQ, Data Sci Lab, Montreal, PQ H2S 3L5, Canada

来源：

JOURNAL OF IMAGING | 2025年 / 11卷 / 02期

基金：

加拿大自然科学与工程研究理事会;

关键词：

object counting; transformers; visual attention; zero-shot; class-agnostic;

D O I：

10.3390/jimaging11020052

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Zero-shot counting is a subcategory of Generic Visual Object Counting, which aims to count objects from an arbitrary class in a given image. While few-shot counting relies on delivering exemplars to the model to count similar class objects, zero-shot counting automates the operation for faster processing. This paper proposes a fully automated zero-shot method outperforming both zero-shot and few-shot methods. By exploiting feature maps from a pre-trained detection-based backbone, we introduce a new Visual Embedding Module designed to generate semantic embeddings within object contextual information. These embeddings are then fed to a Self-Attention Matching Module to generate an encoded representation for the head counter. Our proposed method has outperformed recent zero-shot approaches, achieving the best Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) results of 8.89 and 35.83, respectively, on the FSC147 dataset. Additionally, our method demonstrates competitive performance compared to few-shot methods, advancing the capabilities of visual object counting in various industrial applications such as tree counting, wildlife animal counting, and medical applications like blood cell counting.

引用

页数：21

共 50 条

[1] Zero-Shot Object Counting
Xu, Jingyi
Le, Hieu
Nguyen, Vu
Ranjan, Viresh
Samaras, Dimitris
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15548 - 15557
[2] Zero-Shot Object Counting with Good Exemplars
Zhu, Huilin
Yuan, Jingling
Yang, Zhengwei
Guo, Yu
Wang, Zheng
Zhong, Xian
He, Shengfeng
COMPUTER VISION - ECCV 2024, PT V, 2025, 15063 : 368 - 385
[3] LANGUAGE-GUIDED ZERO-SHOT OBJECT COUNTING
Wang, Mingjie
Yuan, Song
Li, Zhuohang
Zhu, Longlong
Buys, Eric
Gong, Minglun
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
[4] Zero-Shot Object Goal Visual Navigation
Zhao, Qianfan
Zhang, Lu
He, Bin
Qiao, Hong
Liu, Zhiyong
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2025 - 2031
[5] Hyperbolic Visual Embedding Learning for Zero-Shot Recognition
Liu, Shaoteng
Chen, Jingjing
Pan, Liangming
Ngo, Chong-Wah
Chua, Tat-Seng
Jiang, Yu-Gang
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9270 - 9278
[6] Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech
Yoon, Hyungchan
Kim, Changhwan
Song, Eunwoo
Yoon, Hyun-Wook
Kang, Hong-Goo
INTERSPEECH 2023, 2023, : 4299 - 4303
[7] Adaptive adjustment with semantic embedding for zero-shot object detection
Lv, Wen
Shi, Hongbo
Tan, Shuai
Song, Bing
Tao, Yang
JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
[8] Hierarchical-Dynamic Embedding for Zero-shot Object Recognition
Han, Xuebo
Li, Kan
PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 520 - 525
[9] Zero-Shot Object Detection via Learning an Embedding from Semantic Space to Visual Space
Zhang, Licheng
Wang, Xianzhi
Yao, Lina
Wu, Lin
Zheng, Feng
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 906 - 912
[10] Zero-Shot Visual Recognition via Bidirectional Latent Embedding
Qian Wang
Ke Chen
International Journal of Computer Vision, 2017, 124 : 356 - 383

← 1 2 3 4 5 →