A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning

被引：1

作者：

Song, Lingyun ^{[1
,2
]}

Shang, Xuequn ^{[1
,2
]}

Zhou, Ruizhi ^{[1
,2
]}

Liu, Jun ^{[3
]}

Ma, Jie ^{[3
]}

Li, Zhanhuai ^{[1
,2
]}

Sun, Mingxuan ^{[4
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China

[2] Northwestern Polytech Univ, Key Lab Big Data Storage & Management, Minist Ind & Informat Technol, Xian 710129, Peoples R China

[3] Xi An Jiao Tong Univ, Dept Comp Sci & Technol, SPKLSTN Lab, Xian 710049, Peoples R China

[4] Louisiana State Univ, Sch Elect Engn & Comp Sci, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA

来源：

NEURAL NETWORKS | 2024年 / 179卷

关键词：

Fine-grained classification; Convolutional neural network; Attribute prediction; Zero-shot learning;

D O I：

10.1016/j.neunet.2024.106558

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fine-grained visual categorization in zero-shot setting is a challenging problem in the computer vision community. It requires algorithms to accurately identify fine-grained categories that do not appear during the training phase and have high visual similarity to each other. Existing methods usually address this problem by using attribute information as intermediate knowledge, which provides sufficient fine-grained characteristics of categories and can be transferred from seen categories to unseen categories. However, the learning of attribute visual features is not trivial due to the following two reasons: (i) The visual information about attributes of different types may interfere with the visual feature learning of each other. (ii) The visual characteristics of the same attribute may vary in different categories. To solve these issues, we propose a Multi-Group Multi- Stream attribute Attention network (MGMSA), which not only separates the feature learning of attributes of different types, but also isolates the learning of attribute visual features for categories with big differences in attribute appearance. This avoids the interference between uncorrelated attributes and helps to learn category- specific attribute-related visual features. This is beneficial for distinguishing fine-grained categories with subtle visual differences. Extensive experiments on benchmark datasets show that MGMSA achieves state-of-the-art performance on attribute prediction and fine-grained zero-shot learning.

引用

页数：17

共 50 条

[31] Attribute Propagation Network for Graph Zero-Shot Learning
Liu, Lu
Zhou, Tianyi
Long, Guodong
Jiang, Jing
Zhang, Chengqi
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4868 - 4875
[32] A Shared Multi-Attention Framework for Multi-Label Zero-Shot Learning
Huynh, Dat
Elhamifar, Ehsan
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8773 - 8783
[33] Description-Based Zero-shot Fine-Grained Entity Typing
Obeidat, Rasha
Fern, Xiaoli
Shahbazi, Hamed
Tadepalli, Prasad
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 807 - 814
[34] Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification
Hong, Mingyao
Zhang, Xinfeng
Li, Guorong
Huang, Qingming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1599 - 1612
[35] Multi-Interactive Attention Network for Fine-grained Feature Learning in CTR Prediction
Zhang, Kai
Qian, Hao
Cui, Qing
Liu, Qi
Li, Longfei
Zhou, Jun
Ma, Jianhui
Chen, Enhong
WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 984 - 992
[36] Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
Zheng, Heliang
Fu, Jianlong
Mei, Tao
Luo, Jiebo
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5219 - 5227
[37] Knowledge Graph Enhancement for Fine-Grained Zero-Shot Learning on ImageNet21K
Chen, Xingyu
Liu, Jiaxu
Liu, Zeyang
Wan, Lipeng
Lan, Xuguang
Zheng, Nanning
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9090 - 9101
[38] Self-supervised learning of pseudo classes for generalized zero-shot fine-grained recognition
Chen Y.-H.
Yeh M.-C.
Multimedia Tools and Applications, 2025, 84 (10) : 7915 - 7930
[39] Multi-Depth Learning with Multi-Attention for fine-grained image classification
Dai, Zuhua
Li, Hongyi
Li, Kelong
Zhou, Anwei
2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 206 - 212
[40] Low-complexity Multicast Beamforming for Multi-stream Multi-group Communications
Mahmoodi, Hamidreza Bakhshzad
Gouda, Bikshapathi
Salehi, MohammadJavad
Tolli, Antti
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,

← 1 2 3 4 5 →