Clothing Parsing Based on Multi-Scale Fusion and Improved Self-Attention Mechanism

被引:0
|
作者
陈诺 [1 ]
王绍宇 [1 ]
陆然 [1 ]
李文萱 [1 ]
覃志东 [1 ]
石秀金 [1 ]
机构
[1] College of Computer Science and Technology, Donghua University
关键词
D O I
10.19884/j.1672-5220.202303008
中图分类号
TS941 [服装工业]; TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the lack of long-range association and spatial location information, fine details and accurate boundaries of complex clothing images cannot always be obtained by using the existing deep learning-based methods. This paper presents a convolutional structure with multi-scale fusion to optimize the step of clothing feature extraction and a self-attention module to capture long-range association information. The structure enables the self-attention mechanism to directly participate in the process of information exchange through the down-scaling projection operation of the multi-scale framework. In addition, the improved self-attention module introduces the extraction of 2-dimensional relative position information to make up for its lack of ability to extract spatial position features from clothing images. The experimental results based on the colorful fashion parsing dataset(CFPD) show that the proposed network structure achieves 53.68% mean intersection over union(mIoU) and has better performance on the clothing parsing task.
引用
收藏
页码:661 / 666
页数:6
相关论文
共 50 条
  • [21] Uniting Multi-Scale Local Feature Awareness and the Self-Attention Mechanism for Named Entity Recognition
    Shi, Lin
    Zou, Xianming
    Dai, Chenxu
    Ji, Zhanlin
    MATHEMATICS, 2023, 11 (11)
  • [22] Automated detection of sleep-arousal using multi-scale convolution and self-attention mechanism
    Li F.
    Xu Y.
    Zhang B.
    Cong F.
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2023, 40 (01): : 27 - 34
  • [23] Coarse-to-Fine bone age regression by using multi-scale self-attention mechanism
    Wu, Guanyu
    Wang, Ziming
    Peng, Jian
    Gao, Shaobing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 100
  • [24] Non-Invasive Load Decomposition Method Based on Multi-Scale TCN and Multi-Head Self-Attention Mechanism
    Zhang, Yan
    Li, Fei
    Xiao, Yang
    Li, Kai
    Xia, Lei
    Tan, Huilei
    INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (03) : 547 - 556
  • [25] Footprint Pressure Image Retrieval Algorithm Based on Multi-scale Self-attention Convolution
    Zhu M.
    Wang T.
    Wang N.
    Tang J.
    Lu X.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (12): : 1097 - 1103
  • [26] Multi-scale quaternion CNN and BiGRU with cross self-attention feature fusion for fault diagnosis of bearing
    Liu, Huanbai
    Zhang, Fanlong
    Tan, Yin
    Huang, Lian
    Li, Yan
    Huang, Guoheng
    Luo, Shenghong
    Zeng, An
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (08)
  • [27] Video Salient Object Detection Using Multi-Scale Self-Attention
    Liu, Jiahao (jiahao.liu@akane.waseda.jp), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [28] MSGSA: Multi-Scale Guided Self-Attention Network for Crowd Counting
    Sun, Yange
    Li, Meng
    Guo, Huaping
    Zhang, Li
    ELECTRONICS, 2023, 12 (12)
  • [29] Crowd counting using a self-attention multi-scale cascaded network
    Li, He
    Zhang, Shihui
    Kong, Weihang
    IET COMPUTER VISION, 2019, 13 (06) : 556 - 561
  • [30] Parathyroid Gland Detection Based on Multi-Scale Weighted Fusion Attention Mechanism
    Liu, Wanling
    Lu, Wenhuan
    Li, Yijian
    Chen, Fei
    Jiang, Fan
    Wei, Jianguo
    Wang, Bo
    Zhao, Wenxin
    ELECTRONICS, 2025, 14 (06):