A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning

被引:1
|
作者
Song, Lingyun [1 ,2 ]
Shang, Xuequn [1 ,2 ]
Zhou, Ruizhi [1 ,2 ]
Liu, Jun [3 ]
Ma, Jie [3 ]
Li, Zhanhuai [1 ,2 ]
Sun, Mingxuan [4 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Big Data Storage & Management, Minist Ind & Informat Technol, Xian 710129, Peoples R China
[3] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, SPKLSTN Lab, Xian 710049, Peoples R China
[4] Louisiana State Univ, Sch Elect Engn & Comp Sci, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA
关键词
Fine-grained classification; Convolutional neural network; Attribute prediction; Zero-shot learning;
D O I
10.1016/j.neunet.2024.106558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization in zero-shot setting is a challenging problem in the computer vision community. It requires algorithms to accurately identify fine-grained categories that do not appear during the training phase and have high visual similarity to each other. Existing methods usually address this problem by using attribute information as intermediate knowledge, which provides sufficient fine-grained characteristics of categories and can be transferred from seen categories to unseen categories. However, the learning of attribute visual features is not trivial due to the following two reasons: (i) The visual information about attributes of different types may interfere with the visual feature learning of each other. (ii) The visual characteristics of the same attribute may vary in different categories. To solve these issues, we propose a Multi-Group Multi- Stream attribute Attention network (MGMSA), which not only separates the feature learning of attributes of different types, but also isolates the learning of attribute visual features for categories with big differences in attribute appearance. This avoids the interference between uncorrelated attributes and helps to learn category- specific attribute-related visual features. This is beneficial for distinguishing fine-grained categories with subtle visual differences. Extensive experiments on benchmark datasets show that MGMSA achieves state-of-the-art performance on attribute prediction and fine-grained zero-shot learning.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Fine-grained recognition of diseases and pests based on multi-stream Gaussian probability fusion network
    Kong J.
    Jin X.
    Tao Z.
    Wang X.
    Lin S.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2020, 36 (13): : 148 - 157
  • [22] A Multi-Stream Bi-Directional Recurrent Neural Network for Fine-Grained Action Detection
    Singh, Bharat
    Marks, Tim K.
    Jones, Michael
    Tuzel, Oncel
    Shao, Ming
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1961 - 1970
  • [23] LEARNING SIMILARITY-SPECIFIC DICTIONARY FOR ZERO-SHOT FINE-GRAINED RECOGNITION
    Chen, Hong
    Cao, Liujuan
    Ji, Rongrong
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3697 - 3701
  • [24] Attribute Attention for Semantic Disambiguation in Zero-Shot Learning
    Liu, Yang
    Guo, Jishun
    Cai, Deng
    He, Xiaofei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6697 - 6706
  • [25] Relational Proxies: Fine-Grained Relationships as Zero-Shot Discriminators
    Chaudhuri, Abhra
    Mancini, Massimiliano
    Akata, Zeynep
    Dutta, Anjan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8652 - 8664
  • [26] Fine-Grained Representation Alignment for Zero-Shot Domain Adaptation
    Liu, Yabo
    Wang, Jinghua
    Zhong, Shenghua
    Ma, Lianyang
    Xu, Yong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6285 - 6296
  • [27] Attribute-Guided Multi-Level Attention Network for Fine-Grained Fashion Retrieval
    Xiao, Ling
    Yamasaki, Toshihiko
    IEEE ACCESS, 2024, 12 (48068-48080) : 48068 - 48080
  • [28] Webly Supervised Learning Meets Zero-shot Learning: A Hybrid Approach for Fine-grained Classification
    Niu, Li
    Veeraraghavan, Ashok
    Sabharwal, Ashu
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7171 - 7180
  • [29] Multi-attention Meta Learning for Few-shot Fine-grained Image Recognition
    Zhu, Yaohui
    Liu, Chenlong
    Jiang, Shuqiang
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1090 - 1096
  • [30] Fine-Grained Age Estimation With Multi-Attention Network
    Hu, Chunlong
    Gao, Junbin
    Chen, Jianjun
    Jiang, Dengbiao
    Shu, Yucheng
    IEEE ACCESS, 2020, 8 : 196013 - 196023