SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting

被引:0
|
作者
Zgaren, Ahmed [1 ,2 ]
Bouachir, Wassim [2 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat & Syst Engn CIISE, Montreal, PQ H3G 1M8, Canada
[2] Univ Quebec TELUQ, Data Sci Lab, Montreal, PQ H2S 3L5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
object counting; transformers; visual attention; zero-shot; class-agnostic;
D O I
10.3390/jimaging11020052
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Zero-shot counting is a subcategory of Generic Visual Object Counting, which aims to count objects from an arbitrary class in a given image. While few-shot counting relies on delivering exemplars to the model to count similar class objects, zero-shot counting automates the operation for faster processing. This paper proposes a fully automated zero-shot method outperforming both zero-shot and few-shot methods. By exploiting feature maps from a pre-trained detection-based backbone, we introduce a new Visual Embedding Module designed to generate semantic embeddings within object contextual information. These embeddings are then fed to a Self-Attention Matching Module to generate an encoded representation for the head counter. Our proposed method has outperformed recent zero-shot approaches, achieving the best Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) results of 8.89 and 35.83, respectively, on the FSC147 dataset. Additionally, our method demonstrates competitive performance compared to few-shot methods, advancing the capabilities of visual object counting in various industrial applications such as tree counting, wildlife animal counting, and medical applications like blood cell counting.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Manifold embedding for zero-shot recognition
    Ji, Zhong
    Yu, Xuejie
    Yu, Yunlong
    He, Yuqing
    COGNITIVE SYSTEMS RESEARCH, 2019, 55 : 34 - 43
  • [22] Visual-guided attentive attributes embedding for zero-shot learning
    Zhang, Rui
    Zhu, Qi
    Xu, Xiangyu
    Zhang, Daoqiang
    Huang, Sheng-Jun
    NEURAL NETWORKS, 2021, 143 : 709 - 718
  • [23] Zero-Shot Visual Imitation
    Pathak, Deepak
    Mahmoudieh, Parsa
    Luo, Guanghao
    Agrawal, Pulkit
    Chen, Dian
    Shentu, Fred
    Shelhamer, Evan
    Malik, Jitendra
    Efros, Alexei A.
    Darrell, Trevor
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2131 - 2134
  • [24] Zero-shot Object Detection Through Vision-Language Embedding Alignment
    Xie, Johnathan
    Zheng, Shuai
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW, 2022, : 926 - 940
  • [25] Zero-Shot Object Counting With Vision-Language Prior Guidance Network
    Zhai, Wenzhe
    Xing, Xianglei
    Gao, Mingliang
    Li, Qilei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2487 - 2498
  • [26] Enhancing Zero-Shot Many to Many Voice Conversion via Self-Attention VAE with Structurally Regularized Layers
    Long, Ziang
    Zheng, Yunling
    Yu, Meng
    Xin, Jack
    2022 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES, AI4I, 2022, : 59 - 63
  • [27] Self-Attention Generative Distribution Adversarial Network for Few- and Zero-Shot Face Anti-Spoofing
    Son Minh Nguyen
    Linh Duy Tran
    Le, Duc Viet
    Masayuki, Arai
    2022 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB), 2022,
  • [28] Attribute-Based Classification for Zero-Shot Visual Object Categorization
    Lampert, Christoph H.
    Nickisch, Hannes
    Harmeling, Stefan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 453 - 465
  • [29] Improved Visual-Semantic Alignment for Zero-Shot Object Detection
    Rahman, Shafin
    Khan, Salman
    Barnes, Nick
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11932 - 11939
  • [30] Semantic Policy Network for Zero-Shot Object Goal Visual Navigation
    Zhao, Qianfan
    Zhang, Lu
    He, Bin
    Liu, Zhiyong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7655 - 7662