On the Imaginary Wings: Text-Assisted Complex-Valued Fusion Network for Fine-Grained Visual Classification

被引：10

作者：

Guan, Xiang ^{[1
]}

Yang, Yang ^{[1
]}

Li, Jingjing ^{[1
]}

Zhu, Xiaofeng ^{[1
]}

Song, Jingkuan ^{[1
]}

Shen, Heng Tao ^{[1
,2
]}

机构：

[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu 611731, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Complex values; fine-grained visual classification (FGVC); graph convolutional networks (GCNs); multimodal;

D O I：

10.1109/TNNLS.2021.3126046

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fine-grained visual classification (FGVC) is challenging due to the interclass similarity and intraclass variation in datasets. In this work, we explore the great merit of complex values in introducing an imaginary part for modeling data uncertainty (e.g., different points on the complex plane can describe the same state) and graph convolutional networks (GCNs) in learning interdependently among classes to simultaneously tackle the above two major challenges. To the end, we propose a novel approach, termed text-assisted complex-valued fusion network (TA-CFN). Specifically, we expand each feature from 1-D real values to 2-D complex value by disassembling feature maps, thereby enabling the extension of traditional deep convolutional neural networks over the complex domain. Then, we fuse the real and imaginary parts of complex features through complex projection and modulus operation. Finally, we build an undirected graph over the object labels with the assistance of a text corpus, and a GCN is learned to map this graph into a set of classifiers. The benefits are in two folds: 1) complex features allow for a richer algebraic structure to better model the large variation within the same category and 2) leveraging the interclass dependencies brought by the GCN to capture key factors of the slight variation among different categories. We conduct extensive experiments to verify that our proposed model can achieve the state-of-the-art performance on two widely used FGVC datasets.

引用

页码：5112 / 5121

页数：10

共 50 条

[31] Spatial-frequency feature fusion network for small dataset fine-grained image classification
Guo, Yongfei
Li, Bo
Zhang, Wenyue
Dong, Weilong
SCIENTIFIC REPORTS, 2025, 15 (01):
[32] MFF-Trans: Multi-level Feature Fusion Transformer for Fine-Grained Visual Classification
Hang, Qi
Yan, Xuefeng
Gong, Lina
WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 220 - 234
[33] Feature Re-Attention and Multi-Layer Feature Fusion for Fine-Grained Visual Classification
Wang, Kun
Tian, Qingze
Wang, Yanjiang
Liu, Baodi
2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 95 - 100
[34] Web-Supervised Network with Softly Update-Drop Training for Fine-Grained Visual Classification
Zhang, Chuanyi
Yao, Yazhou
Liu, Huafeng
Xie, Guo-Sen
Shu, Xiangbo
Zhou, Tianfei
Zhang, Zheng
Shen, Fumin
Tang, Zhenmin
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12781 - 12788
[35] Hierarchical Feature Attention Learning Network for Detecting Object and Discriminative Parts in Fine-Grained Visual Classification
Han, A. Yeong
Yi, Kwang Moo
Kim, Kyeong Tae
Choi, Jae Young
IEEE ACCESS, 2025, 13 : 19533 - 19544
[36] Multi-scale local regional attention fusion using visual transformers for fine-grained image classification
Li, Yusong
Xie, Bin
Li, Yuling
Zhang, Jiahao
VISUAL COMPUTER, 2024,
[37] Multichannel attention mechanisms fusion based on gate recurrent unit memory network for fine-grained image classification
Yang, Rui
Li, Dahai
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022,
[38] A complex-valued convolutional fusion-type multi-stream spatiotemporal network for automatic modulation classification
Wang, Yuying
Fang, Shengliang
Fan, Youchen
Wang, Mengtao
Xu, Zhaojing
Hou, Shunhu
SCIENTIFIC REPORTS, 2024, 14 (01):
[39] Multi-scale network via progressive multi-granularity attention for fine-grained visual classification
An, Chen
Wang, Xiaodong
Wei, Zhiqiang
Zhang, Ke
Huang, Lei
APPLIED SOFT COMPUTING, 2023, 146
[40] AP-CNN: Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification
Ding, Yifeng
Ma, Zhanyu
Wen, Shaoguo
Xie, Jiyang
Chang, Dongliang
Si, Zhongwei
Wu, Ming
Ling, Haibin
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2826 - 2836

← 1 2 3 4 5 →