Multi-Task Collaborative Attention Network for Pedestrian Attribute Recognition

被引：1

作者：

Cao, Junliang ^{[1
]}

Wei, Hua ^{[1
]}

Sun, Yongli ^{[1
]}

Zhao, Zhifeng ^{[1
]}

Wang, Wei ^{[1
]}

Sun, Guangze ^{[1
]}

Wang, Gang ^{[1
]}

机构：

[1] Xian Fiberhome Software Tech, Xian, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

关键词：

Pedestrian Attribute Recognition; Feature Division Module; Spatial and Channel Collaborative Attention Module; Multi-Task Collaborative Attention Network; Adaptive-Soups;

D O I：

10.1109/IJCNN54540.2023.10191574

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pedestrian Attribute Recognition (PAR) is a multi-task attribute leaning problem. Research into person attributes recognition has focused on approaches to describe a person in terms of their appearance. Combination of some attributes is helpful to strengthen each other's learning such as upper clothing style and upper clothing length, while others are not, such as hair style and upper clothing length. Thus, how to effectively combine different task is the key challenges in PAR. To effectively utilizing the relationship between attributes and further improve the effects of PAR, we propose a novel Multi-Task Collaborative Attention Network (MTCAN), which consists of three modules. Specifically, we first design a Feature Division Module (FDM) to focus on reliable and flexible attribute-related regions. Based on the precise attribute-related locations, we further construct a Spatial and Channel Collaborative Attention Module (SCCAM) to facilitate the beneficial features and weaken mutually suppressed features. Thirdly, a newly weights fusion strategy named adaptive-soups is proposed to mine the optimal model which is universal for deep learning models in all fields. Experiments on two pedestrian attribute recognition datasets show that our proposed method achieves superior performance against other state-of-the-art methods.

引用

页数：6

共 50 条

[41] Adaptively Weighted Multi-task Deep Network for Person Attribute Classification
He, Keke
Wang, Zhanxiong
Fu, Yanwei
Feng, Rui
Jiang, Yu-Gang
Xue, Xiangyang
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1636 - 1644
[42] Face Attribute Estimation Using Multi-Task Convolutional Neural Network
Kawai, Hiroyarr
Ito, Koichi
Aoki, Takafumi
JOURNAL OF IMAGING, 2022, 8 (04)
[43] ATTRIBUTE-AWARE NETWORK FOR PEDESTRIAN ATTRIBUTE RECOGNITION
Wu, Zesen
Ye, Mang
Chen, Shuoyi
Du, Bo
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
[44] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models
Fang, Wenhua
Chen, Jun
Hu, Ruimin
CHINA COMMUNICATIONS, 2018, 15 (12) : 208 - 219
[45] Multi-Task Multi-Attention Transformer for Generative Named Entity Recognition
Mo, Ying
Liu, Jiahao
Tang, Hongyin
Wang, Qifan
Xu, Zenglin
Wang, Jingang
Quan, Xiaojun
Wu, Wei
Li, Zhoujun
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4171 - 4183
[46] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-task CNN Models
Fang, Wenhua
Chen, Jun
Lu, Tao
Hu, Ruimin
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 758 - 767
[47] MACC Net: Multi-task attention crowd counting network
Aldhaheri, Sahar
Alotaibi, Reem
Alzahrani, Bandar
Hadi, Anas
Mahmood, Arif
Alhothali, Areej
Barnawi, Ahmed
APPLIED INTELLIGENCE, 2023, 53 (08) : 9285 - 9297
[48] MACC Net: Multi-task attention crowd counting network
Sahar Aldhaheri
Reem Alotaibi
Bandar Alzahrani
Anas Hadi
Arif Mahmood
Areej Alhothali
Ahmed Barnawi
Applied Intelligence, 2023, 53 : 9285 - 9297
[49] MTSAN: Multi-Task Semantic Attention Network for ADAS Applications
Lai, Chun-Yu
Wu, Bo-Xun
Shivanna, Vinay Malligere
Guo, Jiun-In
IEEE ACCESS, 2021, 9 (09): : 50700 - 50714
[50] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models
Wenhua Fang
Jun Chen
Ruimin Hu
中国通信, 2018, 15 (12) : 208 - 219

← 1 2 3 4 5 →