A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning

被引:1
|
作者
Song, Lingyun [1 ,2 ]
Shang, Xuequn [1 ,2 ]
Zhou, Ruizhi [1 ,2 ]
Liu, Jun [3 ]
Ma, Jie [3 ]
Li, Zhanhuai [1 ,2 ]
Sun, Mingxuan [4 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Big Data Storage & Management, Minist Ind & Informat Technol, Xian 710129, Peoples R China
[3] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, SPKLSTN Lab, Xian 710049, Peoples R China
[4] Louisiana State Univ, Sch Elect Engn & Comp Sci, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA
关键词
Fine-grained classification; Convolutional neural network; Attribute prediction; Zero-shot learning;
D O I
10.1016/j.neunet.2024.106558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization in zero-shot setting is a challenging problem in the computer vision community. It requires algorithms to accurately identify fine-grained categories that do not appear during the training phase and have high visual similarity to each other. Existing methods usually address this problem by using attribute information as intermediate knowledge, which provides sufficient fine-grained characteristics of categories and can be transferred from seen categories to unseen categories. However, the learning of attribute visual features is not trivial due to the following two reasons: (i) The visual information about attributes of different types may interfere with the visual feature learning of each other. (ii) The visual characteristics of the same attribute may vary in different categories. To solve these issues, we propose a Multi-Group Multi- Stream attribute Attention network (MGMSA), which not only separates the feature learning of attributes of different types, but also isolates the learning of attribute visual features for categories with big differences in attribute appearance. This avoids the interference between uncorrelated attributes and helps to learn category- specific attribute-related visual features. This is beneficial for distinguishing fine-grained categories with subtle visual differences. Extensive experiments on benchmark datasets show that MGMSA achieves state-of-the-art performance on attribute prediction and fine-grained zero-shot learning.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Attribute Propagation Network for Graph Zero-Shot Learning
    Liu, Lu
    Zhou, Tianyi
    Long, Guodong
    Jiang, Jing
    Zhang, Chengqi
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4868 - 4875
  • [32] A Shared Multi-Attention Framework for Multi-Label Zero-Shot Learning
    Huynh, Dat
    Elhamifar, Ehsan
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8773 - 8783
  • [33] Description-Based Zero-shot Fine-Grained Entity Typing
    Obeidat, Rasha
    Fern, Xiaoli
    Shahbazi, Hamed
    Tadepalli, Prasad
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 807 - 814
  • [34] Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification
    Hong, Mingyao
    Zhang, Xinfeng
    Li, Guorong
    Huang, Qingming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1599 - 1612
  • [35] Multi-Interactive Attention Network for Fine-grained Feature Learning in CTR Prediction
    Zhang, Kai
    Qian, Hao
    Cui, Qing
    Liu, Qi
    Li, Longfei
    Zhou, Jun
    Ma, Jianhui
    Chen, Enhong
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 984 - 992
  • [36] Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
    Zheng, Heliang
    Fu, Jianlong
    Mei, Tao
    Luo, Jiebo
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5219 - 5227
  • [37] Knowledge Graph Enhancement for Fine-Grained Zero-Shot Learning on ImageNet21K
    Chen, Xingyu
    Liu, Jiaxu
    Liu, Zeyang
    Wan, Lipeng
    Lan, Xuguang
    Zheng, Nanning
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9090 - 9101
  • [38] Self-supervised learning of pseudo classes for generalized zero-shot fine-grained recognition
    Chen Y.-H.
    Yeh M.-C.
    Multimedia Tools and Applications, 2025, 84 (10) : 7915 - 7930
  • [39] Multi-Depth Learning with Multi-Attention for fine-grained image classification
    Dai, Zuhua
    Li, Hongyi
    Li, Kelong
    Zhou, Anwei
    2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 206 - 212
  • [40] Low-complexity Multicast Beamforming for Multi-stream Multi-group Communications
    Mahmoodi, Hamidreza Bakhshzad
    Gouda, Bikshapathi
    Salehi, MohammadJavad
    Tolli, Antti
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,