A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning

被引:1
|
作者
Song, Lingyun [1 ,2 ]
Shang, Xuequn [1 ,2 ]
Zhou, Ruizhi [1 ,2 ]
Liu, Jun [3 ]
Ma, Jie [3 ]
Li, Zhanhuai [1 ,2 ]
Sun, Mingxuan [4 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Big Data Storage & Management, Minist Ind & Informat Technol, Xian 710129, Peoples R China
[3] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, SPKLSTN Lab, Xian 710049, Peoples R China
[4] Louisiana State Univ, Sch Elect Engn & Comp Sci, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA
关键词
Fine-grained classification; Convolutional neural network; Attribute prediction; Zero-shot learning;
D O I
10.1016/j.neunet.2024.106558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization in zero-shot setting is a challenging problem in the computer vision community. It requires algorithms to accurately identify fine-grained categories that do not appear during the training phase and have high visual similarity to each other. Existing methods usually address this problem by using attribute information as intermediate knowledge, which provides sufficient fine-grained characteristics of categories and can be transferred from seen categories to unseen categories. However, the learning of attribute visual features is not trivial due to the following two reasons: (i) The visual information about attributes of different types may interfere with the visual feature learning of each other. (ii) The visual characteristics of the same attribute may vary in different categories. To solve these issues, we propose a Multi-Group Multi- Stream attribute Attention network (MGMSA), which not only separates the feature learning of attributes of different types, but also isolates the learning of attribute visual features for categories with big differences in attribute appearance. This avoids the interference between uncorrelated attributes and helps to learn category- specific attribute-related visual features. This is beneficial for distinguishing fine-grained categories with subtle visual differences. Extensive experiments on benchmark datasets show that MGMSA achieves state-of-the-art performance on attribute prediction and fine-grained zero-shot learning.
引用
收藏
页数:17
相关论文
共 50 条
  • [11] Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics
    Ao-Xue Li
    Ke-Xin Zhang
    Li-Wei Wang
    International Journal of Automation and Computing, 2019, 16 : 563 - 574
  • [12] Fine-grained Human Action Recognition Based on Zero-Shot Learning
    Zhao, Yahui
    Shi, Ping
    You, Jian
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 294 - 297
  • [13] Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics
    Li, Ao-Xue
    Zhang, Ke-Xin
    Wang, Li-Wei
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2019, 16 (05) : 563 - 574
  • [14] Multi-stream I3D Network for Fine-grained Action Recognition
    You, Jian
    Shi, Ping
    Bao, Xiaojie
    PROCEEDINGS OF 2018 IEEE 4TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2018), 2018, : 611 - 614
  • [15] Neural Zero-Shot Fine-Grained Entity Typing
    Ren, Yankun
    Lin, Jianbin
    Zhou, Jun
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 846 - 847
  • [16] Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
    Lin, Haoqiang
    Wen, Haokun
    Song, Xuemeng
    Liu, Meng
    Hu, Yupeng
    Nie, Liqiang
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 240 - 250
  • [17] Correction to: Zero-shot Fine-grained Classification by Deep Feature Learning with Semantics
    Ao-Xue Li
    Ke-Xin Zhang
    Li-Wei Wang
    International Journal of Automation and Computing, 2021, 18 : 1045 - 1045
  • [18] Fine-Grained Object Recognition and Zero-Shot Learning in Remote Sensing Imagery
    Sumbul, Gencer
    Cinbis, Ramazan Gokberk
    Aksoy, Selim
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (02): : 770 - 779
  • [19] Hybrid-order and Multi-stream Convolutional Neural Network for Fine-grained Visual Recognition
    Liu, Yang
    Gu, Hengrui
    Li, Chunguo
    Xu, Qinzhen
    Yang, Luxi
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [20] Integrate d generalize d zero-shot learning for fine-grained classification
    Shermin, Tasfia
    Teng, Shyh Wei
    Sohel, Ferdous
    Murshed, Manzur
    Lu, Guojun
    PATTERN RECOGNITION, 2022, 122