Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

被引:0
|
作者
Behera, Ardhendu [1 ]
Wharton, Zachary [1 ]
Hewage, Pradeep R. P. G. [1 ]
Bera, Asish [1 ]
机构
[1] Edge Hill Univ, Dept Comp Sci, St Helen Rd, Ormskirk L39 4QP, Lancs, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative object pose and parts information for image recognition. For fine-grained recognition, context-aware rich feature representation of object/scene plays a key role since it exhibits a significant variance in the same subcategory and subtle variance among different subcategories. Finding the subtle variance that fully characterizes the object/scene is not straightforward. To address this, we propose a novel context-aware attentional pooling (CAP) that effectively captures subtle changes via sub-pixel gradients, and learns to attend informative integral regions and their importance in discriminating different subcategories without requiring the bounding-box and/or distinguishable part annotations. We also introduce a novel feature encoding by considering the intrinsic consistency between the informativeness of the integral regions and their spatial structures to capture the semantic correlation among them. Our approach is simple yet extremely effective and can be easily applied on top of a standard classification backbone network. We evaluate our approach using six state-of-the-art (SotA) backbone networks and eight benchmark datasets. Our method significantly outperforms the SotA approaches on six datasets and is very competitive with the remaining two.
引用
收藏
页码:929 / 937
页数:9
相关论文
共 50 条
  • [41] Exploration of Class Center for Fine-Grained Visual Classification
    Yao, Hang
    Miao, Qiguang
    Zhao, Peipei
    Li, Chaoneng
    Li, Xin
    Feng, Guanwen
    Liu, Ruyi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9954 - 9966
  • [42] A sparse focus framework for visual fine-grained classification
    YongXiong Wang
    Guangjun Li
    Li Ma
    Multimedia Tools and Applications, 2021, 80 : 25271 - 25289
  • [43] A sparse focus framework for visual fine-grained classification
    Wang, YongXiong
    Li, Guangjun
    Ma, Li
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 25271 - 25289
  • [44] Shape-aware fine-grained classification of erythroid cells
    Wang, Ye
    Ma, Rui
    Ma, Xiaoqing
    Cui, Honghua
    Xiao, Yubin
    Wu, Xuan
    Zhou, You
    APPLIED INTELLIGENCE, 2023, 53 (16) : 19115 - 19127
  • [45] PATIENT AWARE ACTIVE LEARNING FOR FINE-GRAINED OCT CLASSIFICATION
    Logan, Yash-yee
    Benkert, Ryan
    Mustafa, Ahmad
    AlRegib, Ghassan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3908 - 3912
  • [46] Shape-aware fine-grained classification of erythroid cells
    Ye Wang
    Rui Ma
    Xiaoqing Ma
    Honghua Cui
    Yubin Xiao
    Xuan Wu
    You Zhou
    Applied Intelligence, 2023, 53 : 19115 - 19127
  • [47] Adaptive and hybrid context-aware fine-grained word sense disambiguation in topic modeling based document representation
    Li, Wenbo
    Suzuki, Einoshin
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [48] Fine-Grained Driving Behavior Prediction via Context-Aware Multi-Task Inverse Reinforcement Learning
    Nishi, Kentaro
    Shimosaka, Masamichi
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2281 - 2287
  • [49] Dynamic context-aware access control - Use of resource hierarchies to define fine-grained, adaptable authorization policies
    Laube, Annett
    Gomez, Laurent
    SECRYPT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, 2007, : 386 - 393
  • [50] Adaptive and hybrid context-aware fine-grained word sense disambiguation in topic modeling based document representation
    Li, Wenbo
    Suzuki, Einoshin
    Information Processing and Management, 2021, 58 (04):