A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning

被引:1
|
作者
Song, Lingyun [1 ,2 ]
Shang, Xuequn [1 ,2 ]
Zhou, Ruizhi [1 ,2 ]
Liu, Jun [3 ]
Ma, Jie [3 ]
Li, Zhanhuai [1 ,2 ]
Sun, Mingxuan [4 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Big Data Storage & Management, Minist Ind & Informat Technol, Xian 710129, Peoples R China
[3] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, SPKLSTN Lab, Xian 710049, Peoples R China
[4] Louisiana State Univ, Sch Elect Engn & Comp Sci, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA
关键词
Fine-grained classification; Convolutional neural network; Attribute prediction; Zero-shot learning;
D O I
10.1016/j.neunet.2024.106558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization in zero-shot setting is a challenging problem in the computer vision community. It requires algorithms to accurately identify fine-grained categories that do not appear during the training phase and have high visual similarity to each other. Existing methods usually address this problem by using attribute information as intermediate knowledge, which provides sufficient fine-grained characteristics of categories and can be transferred from seen categories to unseen categories. However, the learning of attribute visual features is not trivial due to the following two reasons: (i) The visual information about attributes of different types may interfere with the visual feature learning of each other. (ii) The visual characteristics of the same attribute may vary in different categories. To solve these issues, we propose a Multi-Group Multi- Stream attribute Attention network (MGMSA), which not only separates the feature learning of attributes of different types, but also isolates the learning of attribute visual features for categories with big differences in attribute appearance. This avoids the interference between uncorrelated attributes and helps to learn category- specific attribute-related visual features. This is beneficial for distinguishing fine-grained categories with subtle visual differences. Extensive experiments on benchmark datasets show that MGMSA achieves state-of-the-art performance on attribute prediction and fine-grained zero-shot learning.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification
    Hong, Mingyao
    Zhang, Xinfeng
    Li, Guorong
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5959 - 5972
  • [42] GAPNET: GENERIC-ATTRIBUTE-POSE NETWORK FOR FINE-GRAINED VISUAL CATEGORIZATION USING MULTI-ATTRIBUTE ATTENTION MODULE
    Ju, Minjeong
    Ryu, Hobin
    Moon, Sangkeun
    Yoo, Chang D.
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 703 - 707
  • [43] Zero-shot fine-grained entity typing in information security based on ontology
    Zhang, Han
    Zhu, Jiaxian
    Chen, Jicheng
    Liu, Junxiu
    Ji, Lixia
    KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [44] Multi-modal generative adversarial network for zero-shot learning
    Ji, Zhong
    Chen, Kexin
    Wang, Junyue
    Yu, Yunlong
    Zhang, Zhongfei
    KNOWLEDGE-BASED SYSTEMS, 2020, 197
  • [45] ZERO-SHOT CROSS-LINGUAL TRANSFER USING MULTI-STREAM ENCODER AND EFFICIENT SPEAKER REPRESENTATION
    Zheng, Yibin
    Zhang, Zewang
    Li, Xinhui
    Su, Wenchao
    Lu, Li
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8027 - 8031
  • [46] Semantic-Guided Multi-Attention Localization for Zero-Shot Learning
    Zhu, Yizhe
    Xie, Jianwen
    Tang, Zhiqiang
    Peng, Xi
    Elgammal, Ahmed
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Multi-granularity contrastive zero-shot learning model based on attribute decomposition
    Wang, Yuanlong
    Wang, Jing
    Fan, Yue
    Chai, Qinghua
    Zhang, Hu
    Li, Xiaoli
    Li, Ru
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
  • [48] Towards Fine-grained Open Zero-shot Learning: Inferring Unseen Visual Features from Attributes
    Long, Yang
    Liu, Li
    Shao, Ling
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 944 - 952
  • [49] Multi-layer and multi-order fine-grained feature learning for artwork attribute recognition
    Gao, Yang
    Chang, Neng
    Shang, Kai
    COMPUTER COMMUNICATIONS, 2021, 173 : 214 - 219
  • [50] Complemental Attention Multi-Feature Fusion Network for Fine-Grained Classification
    Miao, Zhuang
    Zhao, Xun
    Wang, Jiabao
    Li, Yang
    Li, Hang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1983 - 1987