Multi-Task Collaborative Attention Network for Pedestrian Attribute Recognition

被引:1
|
作者
Cao, Junliang [1 ]
Wei, Hua [1 ]
Sun, Yongli [1 ]
Zhao, Zhifeng [1 ]
Wang, Wei [1 ]
Sun, Guangze [1 ]
Wang, Gang [1 ]
机构
[1] Xian Fiberhome Software Tech, Xian, Peoples R China
关键词
Pedestrian Attribute Recognition; Feature Division Module; Spatial and Channel Collaborative Attention Module; Multi-Task Collaborative Attention Network; Adaptive-Soups;
D O I
10.1109/IJCNN54540.2023.10191574
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian Attribute Recognition (PAR) is a multi-task attribute leaning problem. Research into person attributes recognition has focused on approaches to describe a person in terms of their appearance. Combination of some attributes is helpful to strengthen each other's learning such as upper clothing style and upper clothing length, while others are not, such as hair style and upper clothing length. Thus, how to effectively combine different task is the key challenges in PAR. To effectively utilizing the relationship between attributes and further improve the effects of PAR, we propose a novel Multi-Task Collaborative Attention Network (MTCAN), which consists of three modules. Specifically, we first design a Feature Division Module (FDM) to focus on reliable and flexible attribute-related regions. Based on the precise attribute-related locations, we further construct a Spatial and Channel Collaborative Attention Module (SCCAM) to facilitate the beneficial features and weaken mutually suppressed features. Thirdly, a newly weights fusion strategy named adaptive-soups is proposed to mine the optimal model which is universal for deep learning models in all fields. Experiments on two pedestrian attribute recognition datasets show that our proposed method achieves superior performance against other state-of-the-art methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Adaptively Weighted Multi-task Deep Network for Person Attribute Classification
    He, Keke
    Wang, Zhanxiong
    Fu, Yanwei
    Feng, Rui
    Jiang, Yu-Gang
    Xue, Xiangyang
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1636 - 1644
  • [42] Face Attribute Estimation Using Multi-Task Convolutional Neural Network
    Kawai, Hiroyarr
    Ito, Koichi
    Aoki, Takafumi
    JOURNAL OF IMAGING, 2022, 8 (04)
  • [43] ATTRIBUTE-AWARE NETWORK FOR PEDESTRIAN ATTRIBUTE RECOGNITION
    Wu, Zesen
    Ye, Mang
    Chen, Shuoyi
    Du, Bo
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
  • [44] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models
    Fang, Wenhua
    Chen, Jun
    Hu, Ruimin
    CHINA COMMUNICATIONS, 2018, 15 (12) : 208 - 219
  • [45] Multi-Task Multi-Attention Transformer for Generative Named Entity Recognition
    Mo, Ying
    Liu, Jiahao
    Tang, Hongyin
    Wang, Qifan
    Xu, Zenglin
    Wang, Jingang
    Quan, Xiaojun
    Wu, Wei
    Li, Zhoujun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4171 - 4183
  • [46] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-task CNN Models
    Fang, Wenhua
    Chen, Jun
    Lu, Tao
    Hu, Ruimin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 758 - 767
  • [47] MACC Net: Multi-task attention crowd counting network
    Aldhaheri, Sahar
    Alotaibi, Reem
    Alzahrani, Bandar
    Hadi, Anas
    Mahmood, Arif
    Alhothali, Areej
    Barnawi, Ahmed
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9285 - 9297
  • [48] MACC Net: Multi-task attention crowd counting network
    Sahar Aldhaheri
    Reem Alotaibi
    Bandar Alzahrani
    Anas Hadi
    Arif Mahmood
    Areej Alhothali
    Ahmed Barnawi
    Applied Intelligence, 2023, 53 : 9285 - 9297
  • [49] MTSAN: Multi-Task Semantic Attention Network for ADAS Applications
    Lai, Chun-Yu
    Wu, Bo-Xun
    Shivanna, Vinay Malligere
    Guo, Jiun-In
    IEEE ACCESS, 2021, 9 (09): : 50700 - 50714
  • [50] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models
    Wenhua Fang
    Jun Chen
    Ruimin Hu
    中国通信, 2018, 15 (12) : 208 - 219