Pedestrian Attribute Recognition Based on Multimodal Transformer

被引:0
|
作者
Liu, Dan [1 ]
Song, Wei [1 ,2 ,3 ]
Zhao, Xiaobing [1 ,3 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Beijing 100081, Peoples R China
[2] Minzu Univ China, Key Lab Ethn Language Intelligent Anal & Secur Go, MOE, Beijing 100081, Peoples R China
[3] Minzu Univ China, Natl Lauguage Resource Monitoring & Res Ctr Minor, Beijing 100081, Peoples R China
关键词
Pedestrian Attribute Recognition; Multimodal Learning; Transformer;
D O I
10.1007/978-981-99-8429-9_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian attribute recognition (PAR) is susceptible to variable shooting angles, lighting, and occlusions. Improving recognition accuracy to suit its application in various complex scenarios is one of the most important tasks. In this paper, based on the Image-Text Multimodal Transformer, the intra-modal and inter-modal correlations are learned from pedestrian images and attribute labels. The applicability of six different multimodal fusion frameworks for attribute recognition is explored. The impact of different frameworks' fused feature division methods on recognition accuracy is compared and analyzed. The comparative experiments verify the robustness and efficiency of the Early Concatenate framework, which has achieved multiple best metric scores on the two major public PAR datasets, PA100k and RAP. This paper not only proposes a new Transformer-based high-accuracy multimodal network, but also provides feasible ideas and directions for further research on PAR. The comparative discussion based on various multimodal frame-works also provides a perspective that can be learned for other multimodal tasks.
引用
收藏
页码:422 / 433
页数:12
相关论文
共 50 条
  • [31] Research and Implementation of Pedestrian Attribute Recognition Algorithm Based on Deep Learning
    Fang, Weilan
    Lu, ZhengQing
    Wang, ChaoWei
    Zhou, Zhihong
    Shi, Guoliang
    Yin, Ying
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2024, 17 (01)
  • [32] Pedestrian Attribute Recognition Based on Dual Self-attention Mechanism
    Fan, Zhongkui
    Guan, Ye-peng
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2023, 20 (02) : 793 - 812
  • [33] PEDESTRIAN ATTRIBUTE RECOGNITION BASED ON MTCNN WITH ONLINE BATCH WEIGHTED LOSS
    He, Xingting
    Shi, Qiuyue
    Su, Fei
    Zhao, Zhicheng
    Zhuang, Bojin
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2461 - 2465
  • [34] A More Efficient Approach for Pedestrian Attribute Recognition
    Hu, Yang
    Wang, Jiaxing
    Tian, Qing
    Wan, Genxun
    Sun, Weichen
    Wang, Ning
    2022 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB), 2022,
  • [35] Recurrent Attention Model for Pedestrian Attribute Recognition
    Zhao, Xin
    Sang, Liufang
    Ding, Guiguang
    Han, Jungong
    Di, Na
    Yan, Chenggang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9275 - 9282
  • [36] Explicit Attention Modeling for Pedestrian Attribute Recognition
    Fang, Jinyi
    Zhu, Bingke
    Chen, Yingying
    Wang, Jinqiao
    Tang, Ming
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2075 - 2080
  • [37] Pedestrian Attribute Recognition in Surveillance Scenes: A Survey
    Jia J.
    Chen X.-T.
    Huang K.-Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (08): : 1765 - 1793
  • [38] Hierarchical Reasoning Network for Pedestrian Attribute Recognition
    An, Haoran
    Hu, Hai-Miao
    Guo, Yuanfang
    Zhou, Qianli
    Li, Bo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 268 - 280
  • [39] Multi-attribute Learning for Pedestrian Attribute Recognition in Surveillance Scenarios
    Li, Dangwei
    Chen, Xiaotang
    Huang, Kaiqi
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 111 - 115
  • [40] UPAR Challenge: Pedestrian Attribute Recognition and Attribute-based Person Retrieval - Dataset, Design, and Results
    Cormier, Mickael
    Specker, Andreas
    Jacques, Julio C. S., Jr.
    Florin, Lucas
    Metzler, Juergen
    Moeslund, Thomas B.
    Nasrollahi, Kamal
    Escalera, Sergio
    Beyerer, Juergen
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2023, : 166 - 175