An Attention-Based Architecture for Hierarchical Classification With CNNs

被引:5
|
作者
Pizarro, Ivan [1 ]
Nanculef, Ricardo [1 ]
Valle, Carlos [2 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Informat, Valparaiso 2390123, Chile
[2] Univ Playa Ancha, Dept Data Sci & Informat, Valparaiso 2360072, Chile
关键词
Taxonomy; Measurement; Computer architecture; Training; Convolutional neural networks; Classification algorithms; Predictive models; Attention mechanisms; deep learning; hierarchical classification; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2023.3263472
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Branch Convolutional Neural Nets have become a popular approach for hierarchical classification in computer vision and other areas. Unfortunately, these models often led to hierarchical inconsistency: predictions for the different hierarchy levels do not necessarily respect the class-subclass constraints imposed by the hierarchy. Several architectures to connect the branches have arisen to overcome this limitation. In this paper, we propose a more straightforward and flexible method: let the neural net decide how these branches must be connected. We achieve this by formulating an attention mechanism that dynamically determines how branches influence each other during training and inference. Experiments on image classification benchmarks show that the proposed method can outperform state-of-the-art models in terms of hierarchical performance metrics and consistency. Furthermore, although sometimes we found a slightly lower performance at the deeper level of the hierarchy, the model predicts much more accurately the ground-truth path between a concept and its ancestors in the hierarchy. This result suggests that the model does learn not only local class memberships but also hierarchical dependencies between concepts.
引用
收藏
页码:32972 / 32995
页数:24
相关论文
共 50 条
  • [21] Attention-based hierarchical denoised deep clustering network
    Yongfeng Dong
    Ziqiu Wang
    Jiapeng Du
    Weidong Fang
    Linhao Li
    World Wide Web, 2023, 26 : 441 - 459
  • [22] Hierarchical Attention-Based Age Estimation and Bias Analysis
    Hiba, Shakediel
    Keller, Yosi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14682 - 14692
  • [23] Hierarchical attention-based multimodal fusion for video captioning
    Wu, Chunlei
    Wei, Yiwei
    Chu, Xiaoliang
    Weichen, Sun
    Su, Fei
    Wang, Leiquan
    NEUROCOMPUTING, 2018, 315 : 362 - 370
  • [24] A hierarchical contextual attention-based network for sequential recommendation
    Cui, Qiang
    Wu, Shu
    Huang, Yan
    Wang, Liang
    NEUROCOMPUTING, 2019, 358 : 141 - 149
  • [25] A hierarchical attention-based neural network architecture, based on human brain guidance, for perception, conceptualisation, action and reasoning
    Taylor, J. G.
    Hartley, M.
    Taylor, N.
    Panchev, C.
    Kasderidis, S.
    IMAGE AND VISION COMPUTING, 2009, 27 (11) : 1641 - 1657
  • [26] Triplet attention-based deep learning model for hierarchical image classification of household items for robotic applications
    Bhayana, Divya Arora
    Verma, Om Prakash
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 489 - 498
  • [27] HiViT: Hierarchical attention-based Transformer for multi-scale whole slide histopathological image classification
    Yu, Jinze
    Li, Shuo
    Tan, Luxin
    Zhou, Haoyi
    Li, Zhongwu
    Li, Jianxin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
  • [28] Relation Extraction via Attention-Based CNNs using Token-Level Representations
    Wang, Yan
    Xin, Xin
    Guo, Ping
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 113 - 117
  • [29] Attention-based Convolutional Neural Networks for Sentence Classification
    Zhao, Zhiwei
    Wu, Youzheng
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 705 - 709
  • [30] An Attention-Based Lattice Network for Hyperspectral Image Classification
    Nikzad, Mohammad
    Gao, Yongsheng
    Zhou, Jun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60