An Attention-Based Architecture for Hierarchical Classification With CNNs

被引:5
|
作者
Pizarro, Ivan [1 ]
Nanculef, Ricardo [1 ]
Valle, Carlos [2 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Informat, Valparaiso 2390123, Chile
[2] Univ Playa Ancha, Dept Data Sci & Informat, Valparaiso 2360072, Chile
关键词
Taxonomy; Measurement; Computer architecture; Training; Convolutional neural networks; Classification algorithms; Predictive models; Attention mechanisms; deep learning; hierarchical classification; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2023.3263472
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Branch Convolutional Neural Nets have become a popular approach for hierarchical classification in computer vision and other areas. Unfortunately, these models often led to hierarchical inconsistency: predictions for the different hierarchy levels do not necessarily respect the class-subclass constraints imposed by the hierarchy. Several architectures to connect the branches have arisen to overcome this limitation. In this paper, we propose a more straightforward and flexible method: let the neural net decide how these branches must be connected. We achieve this by formulating an attention mechanism that dynamically determines how branches influence each other during training and inference. Experiments on image classification benchmarks show that the proposed method can outperform state-of-the-art models in terms of hierarchical performance metrics and consistency. Furthermore, although sometimes we found a slightly lower performance at the deeper level of the hierarchy, the model predicts much more accurately the ground-truth path between a concept and its ancestors in the hierarchy. This result suggests that the model does learn not only local class memberships but also hierarchical dependencies between concepts.
引用
收藏
页码:32972 / 32995
页数:24
相关论文
共 50 条
  • [31] Attention-based LSTM-CNNs for Uncertainty Identification on Chinese Social Media Texts
    Li, Binyang
    Zhou, Kaiming
    Gao, Wei
    Han, Xu
    Zhou, Liana
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 609 - 614
  • [32] Multistructure Graph Classification Method With Attention-Based Pooling
    Xu, Yuhua
    Wang, Junli
    Guang, Mingjian
    Yan, Chungang
    Jiang, Changjun
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (02) : 602 - 613
  • [33] Attention-based Domain Adaptation for Hyperspectral Image Classification
    Rafi, Robiul Hossain Md.
    Tang, Bo
    Du, Qian
    Younan, Nicolas H.
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 67 - 70
  • [34] Attention-Based Combination of CNN and RNN for Relation Classification
    Guo, Xiaoyu
    Zhang, Hui
    Liu, Rui
    Ding, Xin
    Tian, Runqi
    Wang, Bencheng
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 244 - 255
  • [35] Speech Emotion Classification Using Attention-Based LSTM
    Xie, Yue
    Liang, Ruiyu
    Liang, Zhenlin
    Huang, Chengwei
    Zou, Cairong
    Schuller, Bjoern
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1675 - 1685
  • [36] Attention-Based Transformer-BiGRU for Question Classification
    Han, Dongfang
    Tohti, Turdi
    Hamdulla, Askar
    INFORMATION, 2022, 13 (05)
  • [37] A visual attention-based keyword extraction for document classification
    Xing Wu
    Zhikang Du
    Yike Guo
    Multimedia Tools and Applications, 2018, 77 : 25355 - 25367
  • [38] Attention-based Sound Classification Pipeline with Sound Spectrum
    Tan, Ki In
    Yean, Seanglidet
    Lee, Bu Sung
    2023 IEEE SENSORS APPLICATIONS SYMPOSIUM, SAS, 2023,
  • [39] Attention-based Approach for Efficient Moving Vehicle Classification
    Muchtar, Kahlil
    Nasaruddin
    Afdhal
    Nugraha, Indra
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 683 - 690
  • [40] Distant supervision for relation extraction with hierarchical attention-based networks
    Zhang, Jing
    Cao, Meilin
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 220